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‘ATTITUDES AND NORMATIVE BELIEFS AS FACTORS 
INFLUENCING BEHAVIORAL INTENTIONS! 


ICEK AJZEN?* anp MARTIN FISHBEIN 


University of Illinois 


Descriptions of four hypothetical situations involving risk included information 


that the subject’s own estimate of the probability of success (P) was high or low 
and that the probability of success as estimated by his close family and friends (P.) 
was high or low. The former manipulation was found to influence the subject's 


attitude toward the risky act (Aact), 


while the latter manipulation affected his 


normative belief (NB), that is, his belief that his close family and friends expected 
him to perform the risky act. Consistent with a theoretical model, Aact and NB were 
found to predict behavioral intentions. Aact carried more weight than NB in deter- 
mining intentions, and, consistent with expectations, a second study demonstrated 


«y that a change in P influenced intentions, 


Tn recent years a series of investigations has 
provided substantial support for Dulany’s 
(1968) theory of propositional control (Dulany, 
1961, 1964, . 1968) and for Fishbein’s (1967) 
extension of this theory: to social behavior (e.g., 
Ajzen, 1971; Ajzen & Fishbein, 1969, 1970b; 
Carlson, 1968; Fishbein, Ajzen, Landy, & 
Anderson, 1970). The present paper was an 
attempt to provide answers to some of the 
. problems that have emerged in these investi- 
gations. Our concern was mainly with Fish- 
bein’s formulation of the theory (for a dis- 
cussion of the relationships between Dulany’s 
theory and Fishbein’s extension, cf. F ishbein, 
1967). 

The immediate concern of the theory is the 
prediction of behavioral intentions (BL) which 
are assumed to mediate overt behavior. Ac- 
cording to the extended model, an individual’s 
intention to perform a given act is a joint 
function of his attitude toward performing that 
behavior (Aact) and of his beliefs about what 
others expect him to do in that situation. 
These normative beliefs (NB) are in turn multi- 
plied by the individual’s motivation to comply 
with the norms (Mc). Symbolically, the central 
equation of the theory can be expressed as 
follows: 


-B~BI = [Aact] wo +[NB(Mc)]w. [1] 


1 The authors are indebted to Don E. Dulany, Harry 
C. Triandis, and Alan W. Wicker for their critical com- 
ments on an earlier draft of this paper. 

* Requests for reprints should be sent to Icek Ajzen, 
who is now at the Department of Psychology, Uni- 
versity of Massachusetts, Amherst, Massachusetts 
01002. 


while a change in P, did not, 


In Equation 1, B = overt behavior, BI = be- 
havioral intention, Aact = attitude toward the 
act; NB = normative belief; Mc = motiva- 
tion to comply with the normative belief ; and 
Wo and w, are empirically determined weights. 

It can be seen that behavioral intentions 
(and corresponding overt behaviors) are a 
function of the weighted sum of two com- 
ponents, Aact and NB(Mc). The relative 
importance of these two components, that is, 
their weights, is expected to vary with the kind 
of behavior that is being predicted, with the 
conditions under which the behavior is to be 
performed, and with the person who is to 
perform the behavior.’ Unlike traditional ap- 
proaches to attitude, Aact is the person’s 
attitude toward performing a particular act in 
a given situation with respect to a given object, 
and not his attitude toward the object or class 
of objects per se. 

A person’s attitude toward a specific act is 
proposed to be a function of the act’s perceived 
consequences and of their values to the person. 
Consistent with Fishbein’s earlier theorizing 
(Fishbein, 1963), and similar to formulations 
proposed by other theorists (e.g., Atkinson, 
1957; Peak, 1955; Rosenberg, 1956; Rotter, 
1954), Aact is conceptualized in terms of an 
expectancy-value model 4; 


Aact = Ð Bia; 


i=l 


[2] 


‘For a discussion of these and related issues, sce 
Fishbein (1967) or Ajzen and Fishbein (1970a). 

“For a comparison of various expectancy-value 
models, see Feather (1966), 
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where B; refers to the individual’s belief about 
the likelihood that the behavior in question 
will result in Outcome i; a; is the person’s 
evaluation of (or attitude toward) Outcome 7; 
and is the number of beliefs. 

Equation 2 is clearly very similar to the 
subjective expected utility (SEU) model of 
behavioral decision theory (cf. Edwards, 1961). 
The SEU of a given alternative is a function 
of the subjective probability that certain out- 
comes will follow the act (SP;) multiplied by 
the respective subjective values (or utilities) 
attached to these outcomes (U;). The products 
are summed over all possible outcomes of the 
act, as in Equation 3: 

n 

SEU = > SP;U;. [3] 

i=l 
While there are certain differences between the 
models represented in Equations 2 and 3 
which need not concern us here, 2B,a; is 
essentially equivalent to }SP,;U;. Both models 
specify that for any individual, the attractive- 
ness of a given act is a function of the summed 
products of the subjective probabilities and 
utilities he assigns to the act’s outcomes. 

In most empirical investigations, 2SP;U; is 
based on a limited number of potential out- 
comes of an act. A general measure of Aact, 
on the other hand, should also be influenced by 
other consequences not considered in the mea- 
surement of 2SP,U;. A direct measure of Aact 
will therefore tend to be more general than 
most measures of 2SP;U;, and the correlation 
between Aact and ZSP;U; will not be 
perfect. 

A good deal of information, then, is available 
concerning Aact, the theory’s attitudinal 
component. Much less, however, is known 
about NB(Mc), the normative component. It 
would be desirable to have more information 
about the origins of normative beliefs and 
about their relations to variables other than 
those of the present theory. It is worth noting 
that the concept of NB, normative beliefs, is 
much more restricted than what is usually 
meant by social norms. It refers to a specific 
behavioral expectation attributed to a given 
social agent. While a social norm is usually 
meant to refer to a rather broad range of 
permissible, but not necessarily required, 
behaviors, NB refers to a specific behavioral 


act the performance of which is expected. or 
desired under the given circumstances. 

A belief about a specific expectation of this 
nature may have its origin in innumerable 
inference processes. It is here proposed that a 
person’s normative beliefs are at least in part 
a function of the perceived attitude of relevant 
others toward the act in question, This per- 
ceived attitude will be denoted Aact,. Like a 
person’s own attitude, Aact, should also be 


determined by subjective probabilities and 
utilities of the act’s perceived consequences. 
But, of course, for Aact, we have to consider 


, 


the person’s perception of the relevant oth 
subjective probabilities (SP.) and utilities 
(U.). A sum of the products of these quantities 
(SSP,;U,;) should correlate positively with a 
more general, direct measure of Aact,. 

On the basis of these considerations it is 
proposed that Aact and NB can be manipu 
lated in the (ee manner: (a) Variations 
in SP direc influence 2SP,;U; arid thus 
produce related changes in Aact. (6) Manipula- 
tions of SP, directly influence 2SP,;U,; and 
thus produce related changes in Aact, and NB. 
However, such manipulations of the atti- 
tudinal or the normative component will not 
necessarily produce differences in behavioral 
intentions. Whether or not a change in Aact or 
NB will significantly affect intentions will 
depend on the relative weights of uas two 


components in the prediction of intentions, The 
higher the weight of a given component, the 
more should change in that component he 
reflected in behavioral intentions. 


METHOD 


Four different hypothetical situations, somewhat 
similar to the situations in the choice-dilemmas task 
(Kogan & Wallach, 1964), were formulated for use in 
the present experiment, In each of the situations, an 
element of risk was involved in making a behavioral 
choice. 

An attempt was made to create situations such that 
in two of them Aact would be more important than NB 
in predicting behavioral intentions, while in the other 
two, NB would be the more important determinant. 
The description of each situation included two levels of 
probability indicating (a) the subject’s own estimate of 
the likelihood that the proposed action would lead to 
success and (b) the likelihood of success as estimated by 
his close family and friends. For a given subject in a 
given hypothetical situation these probability levels 
were either both 70%, both 30%, or one 70% and the 
other 30%—a total of four possible conditions. 


The four hypothetical situations in their four different 
conditions of risk were given to 56 undergraduate 
students, both males and females. The subjects were 
randomly divided into the experimental conditions on 
the basis of a Latin square design such that a given 
subject responded only once to each hypothetical situa- 
tion and only once to each of the four conditions, Thus, 
there were 14 subjects in each situation-condition 
combination. The four hypothetical situations were 
presented to the subjects as follows: 


Situation I: Investment 


Over the last few years you have done some part- 
time work and you have saved the amount of $1,000. 
You would now like to invest this money profitably. 
It has come to your attention that a plot of real 
estate. in a developing residential area is available 
for the amount of money in your possession. If a 
planned building project proves successful, the plot 
of land might double its present value. However, 
if the project fails, you would lose your entire 

_ investment. 


ituation 2: Operation 


' You have been informed by your physician that 
` you have developed a severe liver ailment. The 
disease is sufficiently serious to force you to change 
many of your life habits—reducing your work load, 
drastically changing your diet, giving up your 
_ favorite leisure-time pursuits. Your physician sug- 
gests that a delicate medical operation could be 
attempted which, if successful, would completely 
~ relieve the liver condition. But its success cannot be 

_ assured, and, in fact, the operation might prove fatal. 


Situation 3: Renting 


As part of your job with a national company, you 
have to leave town for 2 years. You are therefore 
looking for someone to rent your house for the period 
of your absence. In response to your advertisement in 
a local newspaper a Negro family applies for the lease. 
Since your house is located in a white suburban 
neighborhood you find yourself in a difficult situation. 
Your neighbors may readily accept the Negro 
occupants, and there will be no problems. However, 
they might react negatively, and could even go so 
far as to attack the Negro family or damage your 
home. In this case you would find it extremely 
difficult to return to your home and live with your 

“neighbors. 


Situation 4: Transplant 


Your brother whom you deeply admire is in 
precarious health, Within the next few days a kidney 
transplant has to be performed. As a member of the 
family you are a suitable donor, and you are ap- 
proached by the physician to donate one of your 
kidneys. Although there is no immediate danger to 
your own life, the transplant would of course have 
some effect on your health, and would restrict your 
way of life. If successful, the transplant would save 
your brother’s life. However, its success cannot be 
assured and, in fact, your brother might die anyway. 
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Each situation was followed by a paragraph ascribing — 
a probability of success to the subject as well as to his 
close family and friends, The following is an examplefor 
the first situation ; the brackets were, of course, omitted — 
from the description when presented to the subjects. 


Personally you have [high/low] confidence in the 
planned building project, and you believe that it has 
a—Z chance of success. However, before committing 
yourself, you talk about the situation with a number 
of people, including your close family and friends. 
It turns out that most of the people you consult have 
{much more/the same/much less] confidence in the 
project’s chances of success; most of them believe 
that the project has a —% chance to succeed. 

As a result of these discussions you [increase/ 
retain/reduce] your estimate of the project’s likeli- 
hood of success to be —%, 


To form the four conditions of risk, values in the 
blank spaces were varied such as to produce final chance 
levels (own-other) of 70%-70%, 70%-30%, 30%-70%, 
or 30%-30%,. In the conditions where the subject and 
his close family and friends agreed, initial chance levels 
were set at 70% or 30%, and they remained at that 
level. In the conditions where original probability 
estimates of the subject and his close family and friends 
were in conflict, the subject’s initial estimate was set at 
80% or 20%, and as a result of his discussions he 
reduced his estimate to 70% or increased it to 30%. 

The four hypothetical situations were arranged in 
random order, Each situation was immediately followed 
by a questionnaire designed to assess the variables of 
interest. ( 


Questionnaires 


The following is an example of the questionnaire for 
Situation 1. The same items were modified for use in all 
four situations. 

1, BI—behavioral intentions. The concept “I would 
buy the plot of land” was rated on a 7-point scale 
ranging from ‘extremely probable’ to “extremely 
improbable.” ; 

2. Aact—attitude toward the act, The concept 
“buying the plot of land” was rated on the following 
four evaluative semantic differential scales: foolish- 
wise, good-bad, harmful-beneficial, rewarding-punish- 
ing. The sum across these four scales served as an index 
of Aact. 

3. NB—normative beliefs, The concept “Most of the 
people consulted would expect me to buy the plot of 
land” was rated on a 7-point scale ranging from “‘ex- 
tremely probable” to “extremely improbable,” 


4. Aacto—perceived attitude of relevant others. The 
concept “Most of the people consulted would think that 
buying the plot of land is” was rated on the same four 
evaluative semantic differential scales used to assess 
Aact. The sum across the four scales again served as an 
index of Aact,. 

5. Subjective probabilities of success and failure. 
Toward the end of the questionnaire appeared the 
following measure of subjective probabilities : 
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When you answered the various questions above, 
what did you think was the probability that the 
planned building project would succeed? 

We know that some people will have actually 
used the probability given in the description of the 
situation, while others will have actually used a 
different probability. It makes no difference to us 
what the actual probability was that you used, but it 
is essential that we know this probability. 


(a) SP—the subject’s own subjective probability, “I 
thought that the building project’s chances of success 
were _%.”’ (b) SP,—the perceived probabilities of 
others. “I thought that the people I consulted believed 


the building project’s chances of success to be —%.°” 


6. Utilities of success and failure. (2) U—the sub- 
ject’s own utilities were measured on the following two 
scales: 


You buy the plot of land, and the planned building 
project is a success. 
This course of events is: 


Extremely 
Good 


Extremely 
Bad 


You buy the plot of land, and the building project 

is a failure. 

This course of events is: 

Extremely : 
Good 


Extremely 
Bad 


(b) U.—the perceived utilities of others. These were 
measured in a manner comparable to the subject’s 
own utilities. The statements to be rated were: “If I 
bought the plot of land, and the planned building project 
was a success (or failure), most people consulted would 
consider this course of events to be. . .” 


_ On the basis of Items 5 and 6, BSP;U; and BSP Uoi 
` were computed for each subject." 

The reader will note that no measure of Mc, the 
motivation to comply with social norms, was obtained. 
The reason for this omission is that in previous investi- 
gations (e.g, Ajzen & Fishbein, 1969, 1970b) the 
measures of Mc were unsatisfactory. Since the present 
hypotheses could be tested without reference to Mc, no 
such measure was included in the questionnaire.” 


RESULTS 
Prediction of A act 


Column 1 of Table 1 presents the correlations 
between Aact and 2SP;U; for each of the four 


5 Subjective probabilities of failure were assumed to 
be equal to 1 — SP and 1 — SP.. 

6 It should be noted that the measurement procedures 
used in the present experiment do not meet the usual 
requirements of a formal mathematical model made by 
decision theorists. Thus, our measures can at best be 
taken only as approximations of subjective expected 
utilities. 

7In a recent study, Ajzen (1971) showed that more 
adequate measures of Mc could be secured. 


TABLE 1 


CORRELATIONS BETWEEN AACT AND DSP;U; AND 
BETWEEN NB, Aacto, AND 2SPo;Uor 


Situation | ypy; | NB-Aacto| Opi, | usPaue 
Investment | .814** 13" | 757 13 
Operation 613%" .697** 617%% \ 62628 
Renting .587** -626** 454** .425** 
Transplant | .299* 354** |—.023 107 

| 


Note.—N = 56, 
* p <.05, 
** p < OL. 


situations. As expected, all four correlations 
were significant. It is of interest to note that 
the lowest correlation was obtained in Situation 
4, the kidney transplant. This is quite reason- 
able, since one would expect consequences othe: 
than simply the probability 8f the transplant’s 
success or failure (the factors making up the 
present measure of }SP;U;) to influence Aact. 
In general, however, there is support for the 
notion that Aact is a function of a person’s 
beliefs about the consequences of performing 
a given act and his evaluation of thi€se " 
consequences. 


Prediction of NB 


Columns 2, 3, and 4 of Table 1 present 
intercorrelations of NB, Aact,, and 2SP,;U,;. 
Consistent with expectations, NB was signifi- 
cantly related to Aact, in all four situations. 
Further, with the exception of Situation 4, the 
kidney transplant, ZSP oUo; was significantly 
related to Aact, and to NB. 


Manipulations of Aact and NB 


An attempt was made to influence Aact and 
NB by manipulating the subject’s probability 
of success (SP) and his perception of the prob- 
ability of success held by his close family and 
friends (SP,). Table 2 shows the effects pro- 
duced by variations in the “objective” prob- 
abilities of success (P and Po) provided in the 
descriptions of the hypothetical situations. The 
means and analyses of variance presented in 
Table 2 are based on data from all four situa- 
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TABLE 2 


Mean SP, SPo, 2SP;U;, Aact, ZSPo;Uo1, AACTo, AND NB SUMMED OVER 
SITUATIONS AND ANALYSES OF VARIANCE RESULTS 


Group M ANOVA: F levels 
Criterion Condition Manipulation 
70-70% 70-30 30-70 30-30 P Po PXPo 
SP - 692 644 424 331 204.662** 13.622** 393 
SPo 652 325 628 321 559 276.975** -308 
{ ESPiU;, . 5.180 4,757 3.843 3.355 108,246** 10.356** 062 
Aact 21.785 - 20.268 17.500 16.125 BETIS Y 4.419* .014 
BSP oiV ox 4.834 3.421 4.863 3.470 2.612 101,294** 156 
Aacty 20.750: 12.964 21,232 14.571 1,219 72.920** 391 
NB 5.429 3.625 5.161 3.214 2.904 106.071** .166 
i 


a The first number is the subject's objective probability of success (P); the second number is the probability attributed to the 


subject's close family and friends (Po). 
*p <.05, df = 1/54. 
xp < 01, df = 1/54. 


ions.§ Looking at the first two rows in Table 
2 it can be seen that as expected, variations 
. in Peand Po significantly affected SP and SP., 
respectively. Further, while variations in P did 
not influence SP,, manipulating Po had a 
- significant effect on SP. Thus, while there is 
evidence that the subject tended to bring 
his own beliefs into line with those of his 
close family and friends, there is no evidence 
that he “distorted” his perception of the beliefs 
of others in order to make them more con- 
sistent with his own views.” 

These findings, in conjunction with the 
correlations presented in Table 1, account for 
-the effects of manipulating P and P, on the 
remaining variables in Table 2. As predicted, 
variations in P had significant effects on 
_ SPU; and on Aact. Similarly, variations in 

F, had significant effects on 2S'P,;U si, Aacto, 
and NB. Further, just as manipulating P, 
significantly affected SP, the manipulation of 
P, also had significant effects on ZSP;U; and 


8 Separate analyses of variance for each situation 
yielded results comparable to the overall analyses and 
are not presented due to space limitations. However, as 
might be expected on the basis of the low correlations 
in Situation 4, the kidney transplant (see Table 1), 
variations in P and P, did not significantly affect 
* Aact or Aacto in this situation. 

Tt should be noted that these adjustments in SP 
“were over and above those provided for in the descrip- 
tion of the situations. As will be recalled, the subjects 
were told that they had already adjusted their estimates 
of the chances for success after discussing the situation 
with their close family and friends. 


Aact. In contrast, manipulations of P did not 
influence SP,, and they also had no significant 
effects on ZS Po:U oi, Aact,, or NB. 


Prediction of BI 


According to Equation 1, BI should be 
predictable from Aact and NB. Table 3 
presents product-moment correlation coeffi- 
cients, standardized regression weights, and 
multiple correlations of Aact and NB on BI 
for each of the four situations. It can be seen 
that the multiple correlations were high and 
significant in all.four situations. However, it 
can also be seen that our a priori attempt to 
manipulate the relative weights of the two 
components was not very successful. Although, 
as expected, the NB-BI correlations were 


TABLE 3 


CORRELATIONS, REGRESSION COEFFICIENTS, AND 
MULTIPLE CORRELATIONS OF AACT AND 


NB on BI 
Correlation Regression 
coefficient coefficient 
Situation R 
Aact-BI | NB-BI | Aact-BI| NB-BI 

Investment | .841** | .349* | .833**| .019 | .841** 
Operation | .794** | .243 .775** | 142 | .806** 
Renting .840** | 500** | 801** | 074 | .851** 
Transplant | .538** | 434** | 446** | .296* | .607** 


Note.—N = 56in each situation, 
* p <.05. 
wp <.01, 
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TABLE 4 
Group Mrans or BI AND ANALYSES OF VARIANCE RESULTS 
Group M ANOVA: F levels 
Situation gS Condition Manipulation 
70-708 70-30 30-70 30-30 P Ps | PXI 

Investment | 4.929 5.214 3.788 2.071 28.676** 3.186 | 6.245* 
Operation 5.928 5.257 4.500 4.214 9.864** 1.096 | 122 
Renting 5.643 5.071 4.143 3.500 11.495** TOTAN .006 
Transplant 6.643 5.429 5.500 5.357 2.648 3.307 | ‘061 

Note.—N = 14. 

a The first number is the subject's “objective” probability of success (P); the second number is the probability at tributed to 


the subject's close family and friends (Po). 
*p <.05, df = 1/52. 
**p <.01, df = 1/52. 


higher in Situations 3 and 4 than they were in 
Situations 1 and 2, it was only in Situation 4 
that the weight of the normative component 
reached statistical significance (p < .05), and 
even here more weight was carried by the 
attitudinal component. Indeed, in all four 
situations, the subject’s intentions were pri- 
marily determined by the attitudinal com- 
ponent. Thus, while we would expect variations 
in Aact to produce significant differences in BI 
in all four situations, variations in NB should 
have little or no effect on BI in Situations 1, 
2, and 3, although such variations may produce 
a small effect in Situation 4. 


Manipulation of Behavioral Intentions 


It will be recalled that the manipulation of P 
significantly influenced Aact but not BN, while 
manipulation of P, influenced both com- 
ponents. However, results in Table 2 indicated 
that varying P, had a much weaker effect on 
Aact than on NB. Thus, variations in P are 
expected to produce strong effects on intentions 
in‘all four situations. Variations in Po, however, 
should have relatively little effect on intentions 
in Situations 1, 2, and 3, although such varia- 
tions may have some effect in Situation 4. 

Table 4 presents group means of intentions 
for each of the four hypothetical situations, as 
well as the results of two-way analyses of 
variance. There it can be seen that variations 
in P were reflected in behavioral intentions in 
all situations except Situation 4. It will be 
recalled that the manipulation of P did not 
produce significant differences in Aact in 


Situation 4 (see Footnote 9), and thus no effects 
on intentions could be expected: Further, 
variations in P, did not significantl, 
intentions although the results ap 
significance in Situation 4 (F = 3.31 
the only situation in which the normative 
component was found to carry a significant 
weight in the prediction of intentions. 

Although the above study demonstrated that 
variations in behavioral intentions and in the 
model’s components could be produced by 
manipulating a subject’s own beliefs (SP) or 
his perceptions of the beliefs of relevent others 
(SP.), we felt that the demonstration of 
actual changes in behavioral intentions as a 
result of changes in the predictor variables (i-e. 
Aact and NB) was important enough to 
warrant a small study. 


CHANGE STUDY 
Method 


A total of 70 subjects participated in the study. They 
were given Situation 1 in the 70-70 condition. After 
they had read the description of the hypothetical 
situation, they completed the questionnaire described 
above. The subjects then engaged in an unrelated 
activity for approximately 15 minutes. Following this 
interval they were told that they were to consider 
Situation 1 again, but that they would be given some 
additional information about it, This information ap- 
peared immediately after the description of the hypo- 
thetical situation. 

Thirty-five subjects, selected at random, received a 
“normative” communication, while the remaining 355 
subjects were given an “attitudinal” communication. 
These communications were designed to reduce either 
the subject’s subjective probability of success (atti- 
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TABLE 5 


Group Mrans or SP, SPo, Aact, NB, anp BI AND ANALYSES OF 
VARIANCE RESULTS—CHANGE STUDY 


Normative message Attitudinal message 
ANOVA F levels 
Dependent Time Time 
variable 

Bi $ 

Gor | o | a | eis | Mewace | Time | | memeton 
SP 70.903 65.112 70.351 48.837 38.147** 1325552"* 43,952** 
SPo 69.678 46.728 69,246 68.736 66.579** 138:521** 126.739** 
Aact 21.671 17.135 22.136 14.158 4.508* 308.510** 23.333** 
NB 5.671 3.371 5.506 4,964 19.806** 154.662** 251s 
BI 5.679 4.320 5.580 2.945 14:847** 303.479** 30.954** 
Note.—N = 35. 


à The first number is the subject's pou eves probability of success (P); the second number is the probability attributed to 


the subject's close family and friends (Pe 
*p <.05. 
*tD < 01. 


é Lue 
tudinal message) or the probability attributed to his” 
close family and friends (normative message). The two 
communications were worded as follows: 


[Attitudinal message]. At this stage you make a 
much more extensive investigation of the proposed 
project, and you come across some information that 
considerably lowers your estimate of the project’s 
chances to succeed. You again decide to consult 
with your family and friends, and you find that they 
do not share your concern. Indeed, they retain their 
confidence in the project and still believe it has a 
70% chance to succeed. You, however, now believe 
that the project’s likelihood of success is only 30%. 


(Normative message]. At this stage, however, some 
of your friends make a much more extensive investiga- 
tion of the project and come across some information 
that considerably lowers their estimates of the 
project’s chances to succeed. They now believe that 
the project’s likelihood of success is only 30%. You, 
however, do not share their concern, and you retain 
your confidence in the project. That is, you still 
believe that it has a 70% chance to succeed. 


‘Thus, in the attitudinal condition, the subject’s objec- 
tive probability of success (P) was now reduced to 30% 
and that of his close family and friends (P,) remained 
at 70%. In the normative condition, these probabilities 
were 70% and 30%, respectively. After reading the 
persuasive communication, each subject completed the 
same questionnaire he had previously responded to, 


Results 


Based on the hypotheses previously outlined, 
it can be predicted that (a) the change in P 
(from 70% to 30%) will be reflected in Aact 
and (b) the change in P, (from 70% to 30%) 
will be reflected in NB. The impact of this 
change in one of the components on intentions, 


however, will depend on that component’s 
weight in the prediction of behavioral inten- 
tions. Since in Situation 1, used in this change 
study, behavioral intentions were shown to be 
largely controlled by attitudinal considerations 
(see Table 3), it is to be expected that the 
attitudinal message (reducing SP and thus 
Aact) will produce greater changes in BI than 
will the normative message (reducing SP, and 
thus NB). 


Two-way analyses of variance were per- 


formed, using SP, SP,, Aact, NB, and BI as 


dependent variables, The two main effects 


were message type (attitudinal or normative) 


and time (before or after the persuasive com- _ 


munication). Time was treated as a repeated- 
measures factor, since the same subjects were 
used before and after the communications, The 
results of these analyses as well as the cell 
means are presented in Table 5 where it can be 
seen that changes in the probabilities supplied 
to the subjects affected their subjective prob- 
abilities in the expected manner. The norma- 
tive message, directed at SP,, did indeed 
reduce SP, to a greater extent than did the 
attitudinal message, as indicated by the highly 
significant Message Type X Time interaction. 
Similarly, the attitudinal message reduced SP 
(at which it was directed) significantly more 
than did the normative message, 

The reduction of SP or SP, had, as expected, 
marked impacts on Aact or on NB, respec- 
tively. That is, the attitudinal message (which 
reduced SP) produced a considerable reduction _ 


o —_ 


in Aact; the normative message (directed at 


SP.) had a much smaller effect on Aact. This 


conclusion is supported by the highly signifi- 
cant Message Type X Time interaction for 


_ Aact. Similarly, changes in NB were produced 


primarily by the normative message rather 
than by the attitudinal message, as was again 


witnessed by the significant interaction effect. 


Finally, there was also a significant Message 


Type X Time interaction for BI, indicating 


that as expected, the attitudinal message had 


a much stronger impact on behavioral inten- 


tions than had the normative message. 
The two main effects, message type and time, 


_ were of less importance in the present analyses. 


The results indicate that both message type 


and time had significant main effects on the 
dependent variables. 


To summarize, the change study confirmed 


. the results presented earlier. Using a sample of 


subjects in a pre-post design it was again 
shown that variations in subjective prob- 


abilities tended to be reflected in the appro- 
priate variables of the present theory. Thus, 


-a change in the subject’s probability of success 


| 


(SP) led to a similar change in Aact, while the 
reduction of SP, (the subjective probability 
attributed to relevant others) resulted in a 
reduction of NB, Finally, it was shown that 
the effects of these changes on behavioral 
intentions depended on the regression weights 
of Aact and NB in the prediction of BI. 


DISCUSSION AND CONCLUSIONS 


One purpose of the present study was to 
obtain information about possible origins of 
normative beliefs (NB). It was hypothesized 
that one of the factors influencing a person’s 
beliefs about what others expect him to do is 
his perception of the other’s attitude toward 
the act in question (Aact,). Evidence support- 
ing this hypothesis was obtained, and it was 
further shown that manipulations which pro- 
duced differences or changes in this perceived 
attitude were also found to be reflected in the 
measure of NB. This finding suggests that some 
of the knowledge that has accumulated in the 
attitude area can perhaps be applied to the 
study of normative beliefs, For example, just 
as a person’s attitude toward a given behavior 
was related to his beliefs about the conse- 
quences of performing that behavior and his 
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evaluation of those consequences, his percep- 
tion of attitudes of relevant others was related 
to his perception of their beliefs about the 
consequences of performing the act and his 
perception of their evaluation of those con- 
sequences. Thus, it was possible to influence a 7 
person’s NB by manipulating his perception 1 
of the beliefs of others about the consequences ~ 
of performing a given behavior. It may also be 


perception of others’ evaluations of these 
consequences was manipulated. 


person’s behavioral intentions in the hy 
thetical situations were related to his atti 
toward the behaviors in questio! 
normative beliefs about them. Howey 
found that in all of the hypothetica! si! 
Aact carried most of the weight in the 
tion of BI; the normative compon«: 
nificant regression weight only 
situation, 

Induced changes in Aact bad significant 
effects on BI, while manipulations of D B 
not affect BI significantly. This findin; 
anticipated because of the importance o 
in determining BI in these »\‘uations. ae 
theory would also predict that in situations: 
where NB is of primary importance, changes in 
this component would affect BI, while manipu- — 
lations of Aact would not. This part of the” 
hypothesis could not be tested in the present 
study because in none of the situations em- 
ployed were intentions clearly under the con- 
trol of normative beliefs. Although previous 
research has shown that intentions and overt — 
behavior can be changed by manipulating 
normative beliefs (cf. Ajzen, 1971), it still has 
to*be demonstrated that manipulation of Aact 
will not change intentions in those situations — 
where intentions are primarily under normative 
influence. 


the four 
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INCREASED DELAY OF GRATIFICATION IN YOUNG 
PRISON INMATES THROUGH IMITATION 
OF HIGH-DELAY PEER MODELS* 


JEROME S. STUMPHAUZER ° 


Los Angeles County-University of 


Bandura and Mischel changed the delay orientation of children by exposing 
them to adult models who displayed opposite delay orientations. In the present 
study these findings were extended to a young prison inmate population. Forty 
18- to 20-year-old inmates who demonstrated immediate gratification orienta- 
tions served as subjects. Two older inmates with prestigious work details were 
models, Two measures of delay of gratification were used at all three phases 
(premeasure, postexposure to model, and follow-up) of the study: a series 
14 choices between something they could have immediately, or something 
more valuable for which they would have to wait, and a hypothetical moncy- 
saving measure. Subjects actually received either an immediate or delay 
reward at each of the three phases, and the effect of this differential reinforce- 
ment was assessed as well, “Exposure Stiga peer models did increa 
delaying behavior, and the effect was maint: 

in delay of gratification did generalize to two levels of the money-savin 
measure. Differential reinforcement was found to have no sy: matic effoctguy 
Thus, further support for the social learning view of the ion of self- 
controlling behavior was found. Implications for behavior mo ification wi 
juvenile delinquents were found as well. 


The ability of work and wait for larger 
rewards, later in time, is stressed in virtually 
all discussions of normal personality develop- 
ment (e.g., Bijou & Baer, 1961; Freud, 1946; 
Mischel, 1966; Singer, 1955), To learn to 
delay immediate gratification in favor of 


1 This paper is based on a dissertation submitted 
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the requirements for the PhD degree, An earlier 
abbreviated version of this paper was read at the 
April 1969 meeting of the Florida Psychological 
Association, Orlando, These findings have been 
published in a private circulated technical bulletin 
series of the Federal Correctional Institution, 
Tallahassee, Florida: FCI Research Reports, 1969, 
1(6), 1-67. The author wishes to express his appreci- 
ation to Wallace A, Kennedy for his guidance 
throughout all phases of this research; to A. Cooper 
Price for his support and cooperation at the Federal 
Correctional Institution; and to Albert Bandura 
and Walter Mischel for their encouragement, helpful 
suggestions, and modeling, This study was supported 
in part by Grant NL-135 from the National Institute 
of Law Enforcement and Criminal Justice. This does 
not necessarily indicate the concurrence of the insti- 
tute in the statement or conclusions contained herein. 

? Requests for reprints should be sent to Jerome 
S. Stumphauzer, Department of Psychology (Unit 
III), Los Angeles County-University of Southern 
California Medical Center, 1934 Hospital Place, Los 
Angeles, California 90033. 
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of 


d 4 weeks later. This increase 


later, more valuable reward is an important ~ 
part of the socialization process. The criminal 
or psychopathic personality may best repre- 
sent a failure in this development. Shapiro a 
(1965) suggested that “the psychopath is the ~ 
very model of the impulsive style... his ~ 
aim is the quick concrete gain [p. 157].” 
Often, juvenile delinquents are also charac- — 
terized by this inability to delay gratification. — 
McCord and McCord (1964) consider the © 
delinquent “like an infant, absorbed in his 
own needs, vehemently demanding satiation ~ 
[p. 9].” Empirical support for the view © 
that delinquents are immediate gratifiers is 1 
offered by Mischel (1961a). Very simply, he H 
offered children a series of real choices be- 
tween something they could have immediately ~ 
or something more valuable for which they | 
would have to wait. He found that delin- 
quents showed a preference for smaller but 
immediate rewards. Similarly, Rosenquist and ~ 
Megargee (1969) asked delinquents and non- ~ 
delinquents what they would do with various a 
sums of money (25¢, $2, $20, and $200). As © 
expected, nondelinquents were more likely to” 


quents were more likely to say they would’ 
spend it immediately on pleasurable items. 
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INCREASED DELAY OF GRATIFICATION IN INMATES ie 


Delay of gratification, measured by series 
of choices, has received a good deal of re- 
search attention from Mischel and his col- 
leagues (Bandura & Mischel, 1965; Mischel, 
1958, 1961a, 1961b, 1966; Mischel & Gilli- 
gan, 1964; Mischel & Grusec, 1966; Mischel 
& Metzner, 1962; Mischel & Staub, 1965). 
Stumphauzer (1970b) systematically altered 
the percentage of delay choices in young 
prison inmates through the use of contingent 
social approval. The present study is an 
extension of the Bandura and Mischel (1965) 
findings. Using the series of choices measure, 
Bandura and Mischel selected two groups of 
children, one showing a tendency to choose 
immediate rewards and the other a tendency 
to choose delayed rewards. Subjects from each 
group were exposed to live models, symbolic 
models, or no models (control group). Adult 
models, also in a series of choices, displayed 
delay orientations opposite to those of the 
subject. For example, subjects originally 
showing preference for immediate rewards ob- 
served a model who preferred delayed, more 
valuable rewards. Both live and symbolic 
modeling were effective in modifying delay 
orientations, although changes induced by 
the live model were somewhat more stable 
over time. 

The Bandura and Mischel study shows 
the modification of delay orientations using 
modeling techniques. Since delinquents and 
youthful offenders tend to show immediate 
gratification orientations, as demonstrated by 
both Mischel (1961a) and Rosenquist and 
Megargee (1969), it was the plan of the 
present study to systematically replicate and 
extend the Bandura-Mischel findings with a 
youthful offender population. Live models 
were used with young prison inmates indi- 
cating an immediate gratification orientation, 
An additional measure of delay orientation, 
other than that displayed by the models, was 
needed to examine response generalization, A 
measure similar to that of Rosenquist and 
Megargee was developed for use as an addi- 
tional measure of delaying behavior. Thus, 
examinations of both generalization over time, 
provided by a follow-up measure, and gen- 
eralization to another but similar response 
were made. 


While a number of studies demonstrate 
control of delinquent behavior by reinforce- 
ment variables (Stumphauzer, 1970a), some 
support for the use of modeling techniques for 
the modification of delinquent behavior is 
available as well. In a “token economy” for 
delinquents, Cohen, Filipszak, Bis, and Cohen — 
(1966) found unexpectedly that delinquents 
began to imitate the experimenters. The sub- 
jects went so far as to “spend” some of their 
points to acquire clothing similar to that of 
the staff, Unfortunately, the acquisition of 
these imitative behaviors was simply noted, 
and the imitative behavior received no sys- 
tematic investigation. Truax, Shapiro, and 
Wargo (1968) exposed delinquents about to 
begin group therapy to tape recordings of 
other ongoing groups in the hope that they 
would vicariously learn what was expected 
of them in the group. Results, in the form 
of changes in self-concept for Minnesota 
Multiphasic Personality Inventory profiles, 
suggest that the vicarious experiences of 
group therapy did not have positive effects. 
No mention was made of behaviors observed 
in the tapes and subsequently performed by 
the delinquents. Similarly, Sarason (1968) 
demonstrated the effects of modeling experi- 
ences in groups of institutionalized delin- 
quents, but his dependent measures were only 
indirectly related to the modeling. Hartmann 
(1969) did show the acquisition of aggres- 
sive behavior (ostensibly delivering shocks to 
another person) in delinquents through imita- 
tion. In the present study it was necessary to 
consider who would make a particularly good 
(efficient) model for delinquents or young 
prison inmates to imitate. In reviewing the 
literature, Bandura (1962, 1969) has found 
effective models to be attractive, rewarding, 
prestigeful, competent, powerful, and to have 
high status. Schwitzgebel (1964) suggested 
using only those who are somewhat older 
than the delinquents and who are employed 
in some trade feasible for the delinquent 
to enter and thus provide a role model and 
a source of new patterns of behavior. Ban- 
dura (personal communication, April 1968) 
further suggested that delinquents them- 
selves might be used as models. Delinquents 
who have nearly completed the program in 
an institution, and who are relatively near 
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the time of release, may be effective models 
for more newly admitted inmates. Such 
“successful”? inmates may well appear com- 
petent, prestigeful, and to have high status. 
The use of peer models is further supported 
by those many discussions (e.g., Longstreth, 
1968) of the important influence of peers in 
the socialization process. ` 


METHOD 
Subjects 


The 40 subjects in this study were inmates of 
the Federal Correctional Institution for Youthful 
Offenders in Tallahassee, Florida. This is a medium 
security institution with a total of approximately 
500 inmates, who reside in four dormitories and 
range in age from 18 to 26. The subjects were 
(a) newly admitted inmates, (b) from 18 to 20 years 
of age, (c) Caucasian males, and (d) they had a 
sentence of at least 3 months duration. With the 
use of the choice list, the 40 subjects were selected 
from groups of newly admitted inmates as preferring 
immediate gratification (more than 70% immediate 
_ reward choices), Of the 40 subjects, 20 were exposed 
to a model, and 20 were assigned to a control group 
with no exposure to a model. 


Models 


In the light of the model characteristics and sug- 
gestions discussed earlier, two older, somewhat 
prestigious inmates were selected as models for this 
study. They were confederates and told what be- 
havior to display for subjects to observe. In each 
case, relative to the subject, the model was older 
(ages 21 and 23), had been at the institution longer 
(several months), were relatively near the time of 
discharge, and had a prestigious work assignment 
(institutional photographer and X-ray technician). 
New inmates were assigned initially to a low-prestige 
detail such as custodian or food-service worker, 
Each model had one “professional” contact with each 
subject, either taking his identification photograph 
or chest X ray on admission to the institution. To 
guard against contamination of the experiment by 
extensive contact of models and subjects, no model 
was paired with a subject who resided in the same 
dormitory as the model. Also, subjects were exposed 
to each model at a rate of no more than one subject 
per week. Since there were two models, each served 
as model for 10 of the 20 model-exposed subjects. 


Choice List 


The four choice lists (A, B, C, and D) were 
developed for this particular population. Money is 
available to these inmates only in their institutional 
accounts. Inmates are permitted to spend this money 
in the commissary on certain hours during the week. 
Examples of items available are cigarettes, cigars, 
various candy and food snacks, magazines, and per- 
sonal grooming needs. Since most of these items 
are sought after by inmates, they represent a ready 
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list of realistic rewards for this population, ‘oe 
may be used as a reward by adding it to : ne 
mates’ account. A pool of choices between imeciste 
or delayed but more valuable rewards was deve! oped. 
As in previous research, half of the choi ere 
monetary (e.g, 35¢ now or 50¢ in 1 wee! and 
half were between small articles (e.g., smal’ candy 
bar now or large one in 1 week). The 
used by Mischel (personal communicati May 
21, 1968) were used as a model as far as th io of 


lists 


quantity of reward for given delay period | the 
length of delay periods. For each of four !i seven 
monetary and seven small-article choices we lected 
randomly from the pool of 100 choices, The :vsulting 
four choice lists, consisting of 14 choice i each, 
were administered in a random order to 30 »«wly ad- 
mitted inmates as a preliminary study of że scales 
themselves. Cross-correlations of percent delay 
choices on the four choice lists, ranging fr 87 to 
.94, were found to be highly statistically ificant 
(p< .001). For each subject, List A, B, ov © served 


as the initial measure of delay orientation or 
rate. List D was used for the models choice 
which 20 of the subjects observed. List A, B, or C 
was used to measure the subject's delay crientation 
immediately after exposure to the model. Finally, 
List A, B, or C was used as the follow-1p or 
temporal generalization measure 1 month ‘ater. 
Approximately one-sixth of the subjects in cach 
group received Lists A, B, and C in the following 
orders across the thvee phases of the study: ABC, 


ACB, BCA, CAB, and CBA 
Money-Saving M “apnea 
wr 


The Rosenquist and Megargee (1969) measure of 
delay orientation, simply asking subjects w they 
would do with certain amounts of money, was 


developed into a forced-choice measure rather than 
an open-ended measure. By providing subjects with 
five choices—(a) spend all of it, (b) spend most 
(75%) of it, (c) spend half and save half of it, 
(d) save most (75%) of it, and (e) save all of it— 
a numerical index of delay orientation could quickly 
be determined and be available for later statistical 
analysis. Since inmates could spend no more than 
$20 a month, the $200 item used by Rosenquist -and 
Megargee seemed unrealistic for this population and 


‘was not included. Therefore, three questions made 


up the money-saving measure for this study: (a) If 
25¢ were added to your account, what would you 
do with it? (b) If $2 were added to your account, 
what would you do with it? and (c) If $20 were 
added to your account, what would you do with it? ê 
Thus, a choice list and the money-saving measure 
were given at each of the three phases of the study 
to all 40 subjects. 


Reinforcement 


To make choices on the choice lists realistic, the 
subjects were actually given one of their 14 choices 


3 Copies of the four choice lists and the money- 
saving measure may be obtained from the author. 
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each time a list was administered to them (after 
Bandura & Mischel, 1965), Since they received 
either an immediate or delayed reward, this could 
be considered a differential reinforcement of that 
particular type of responding. To determine any 
effect of this reinforcement, half (10) of the model 
exposed and half (10) of the no-model group re- 
ceived only delayed rewards. The other half of each 
group received only immediate rewards. Each time, 
of course, the subjects did receive one of the items 
that they had actually chosen. Since each subject 
responded to three choice lists, each received a total 
of three rewards. 


Procedure 


The procedure closely followed that of Bandura 
and Mischel (1965) with the notable exceptions that 
(a) only live models were used, (b) only with 
inmate subjects who had immediate gratification 
orientations, (c) another measure of delay orienta- 
tion was added to examine response generalization, 
and (d) the same experimenter conducted all three 
phases of the study. 

The first phase of the experiment (premeasure) 
consisted of the selection of subjects. One of the 
choice lists was administered to newly admitted, 
18- to 20-year-old Caucasian inmates. These inmates 
were called in individually, and the experimenter read 
instructions similar to those of Bandura and Mischel: 


I am interested in finding out how people choose 
when they are offered different kinds of choices. 
I will be asking you to make some choices: the 
choices will be between two things both of which 
you want, but if you choose one you can’t have 
the other. Answer each question to show what 
you would really take. This is not a test, there are 
no right or wrong answers. For example, the 
choice might be 50 cents now or 75 cents next 
week, If you took the 50 cents now you could 
not have the 75 cents next week and if you wait 
for the 75 cents next week you could not have the 
50 cents now. I will offer you 14 such choices. 
Choose very carefully and realistically because in 
one of the choices you will really get what you 
choose. You won’t know which one of your 14 
choices you’ll actually get until the very end, so 
choose very carefully each time. 


The 14 items of a choice list were then displayed on 
a desk for the subject, and his choices were recorded. 
In an earlier study, Stumphauzer (1970) found that 
social reinforcement (“good” or “mm-hmm”) could 
control the percentage of delay choices in this popu- 
lation. Therefore, care was taken in the present study 
to simply record the data matter-of-factly. Next, 
the money-saving measure was given. If the subject 
was indeed to receive an immediate reward, he was 
given one of his immediate choices at that time. 
Tf he was to receive a delayed reward, he was given 
one of his delay choices after the specific time period. 
All subjects received a reward within 4 weeks, with 
4 weeks being the longest delay period. Number of 


delayed reward choices for each subject were tabu- — 
lated, and only those subjects choosing less than 30% 


delay choices (four choices or less) continued as 
subjects in the study. Forty subjects were selected 
in this manner, 20 were assigned to the modeling and 
20 to the control group. Subjects in the two groups 
were matched with regard to number of delay choices 
on this first measure. 

Approximately 4 weeks after the premeasure, con- 
trol subjects were administered another choice list 
and money-saving measure, and again they received 
one of their choices within the next 4 weeks. Sub- 
jects in the modeling condition individually observed 
a model who showed the opposite delay orientation; 
that is, the model consistently made delay of 
self-reward choices to Choice List D. The subject 
was led to believe that the model was also a subject. 
Both model and subject were called in at the same 
time. In order to make the difference between model 
and subject explicit to the subject, a short joint 
interview was conducted by asking such questions 
as “How old are you? How long have you been at 
F.C.I. [Federal Correctional Institute], how many 
months? What is you work detail here?” Next, the 
experimenter read the instructions and turned to the 
model saying, “Let’s start with you.” The choices of 
Choice List D were displayed for the model (while 
the subject was observing) and the model’s responses 
recorded. Rather than summarize his delayed orienta- 
tion rationale, as in the Bandura-Mischel (1965) 
study, it seemed more natural in this setting to have 
the models make two side comments while they 
were making their choices. For one monetary item 
they said, “That’s pretty good interest, I'll take the 
in _______ weeks.” For one of the other 
choices, the model said, “I can wait for that.” Next, 
the model was given the money-saving measure on 
a clipboard and in such a way that the subject could 
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not observe the model’s responses to this measure. 


The model was then excused under the pretext, “You 
can go now, you probably have things to do.” The 


items of another choice list were then displayed for 


the subject and his choice recorded, Again, he re- 
sponded to the money-saving measure, and he 
received one of his choices, depending on whether he 
was in the immediate or delayed reinforcement half 
of his group, within 4 weeks. The results of this 
choice list provided the postexposure to model mea- 
sure of delay orientation and indicated any immediate 
modeling effect. The response to this second money- 
saving measure indicated any immediate response 
generalization effect, 

Four weeks later, for the final (follow-up) phase 
of the study, each of the 40 subjects was adminis- 
tered a third choice list and the money-saving mea- 
sure, For the 20 experimental subjects, the results 
of this choice list determined the stability of any 
modeling effect, over this period of time. If there 
was a modeling effect, the money-saving measure 
provided an index of response generalization to a 
second measure of delay behavior not observed 
directly in the model. For the control group, results 
of the three choice lists and the three money-saving 
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Fic. 1. Mean percentage of delayed reward choices 


as a function of modeling condition, reinforcement 
= condition, and phases of the experiment. 


FOLLOW-UP 


MEAN PERCENTAGE OF DELAYED REWARD CHOICES 


measures indicated any change in delay behavior just 
as a function of differential reinforcements and time 

and not as a function of exposure to high-delay 
- peer models. 


RESULTS 


3 


_ A repeated-measures analysis of variance 
- Was computed on the percentage of delay 
- choices across the three phases of the experi- 
' ment (see Figure 1), and the modeling effect 
was found to be highly statistically significant 
(F = 108.29, df = 1/36, p < 001). Further 
comparisons of the pairs of means across 
experimental phases show that both groups 
that were exposed to a high-delay model sig- 
nificantly increased their delay of gratifica- 
tion regardless of whether they actually re- 
ceived delayed (t= 8.21, df=9, p< .001) 
or immediate (¢= 5.13, df=9, p< .001) 
reinforcements. Furthermore, this modeling 
effect was maintained to a significant degree 
4 weeks after exposure to the high-delay 
peer models for both the delayed (t = 6.72, 
df =9, p< .001) and the immediate (t = 
3.46, df =9, p < .01) reinforcement groups. 
No significant change in percentage of delay 
choices was found in the groups which were 
not exposed to a model. 

As a measure of response generalization, 
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that is, generalization to a second measure 
of delay gratification which was not directly 
observed in the model, a series of three hypo- 
thetical money-saving measures was adminis- 
tered to all subjects at all three phases of ~ 


the study. Three amounts of money were 


used: 25¢, $2 and $20. The percentage that ` 

subjects would save was tabulated at 0%, 
25%, 50%, 75%, or 100% due to the forced- 
choice nature of the measure. Figure 2 shows 
the mean percentage the subjects would save 
of 25¢ as a function of experimental phases. 
A repeated-measures analysis of variance was 
computed and, as in the delay choice mea- 
sure, the modeling effect was found to be 
highly significant (F = 31.71, dj = 1/36, 
p < .001). No other source of variance was 


found to reach the .05 level of significance. < 


Further comparisons by 


reinforcement group (¢ = 3.75, df=9, p< 
01). The modeling effect was maintained 4 


weeks later in both the delayed (t= 2.94, 


df =9, p<.05) and immediate (t = 2.28, 
df = 9, p < .05) reinforcement groups. Again, 
no significant changes were found in the no- 
model groups. 
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Fic, 2. Mean percentage that subjects would save 
of 25¢ as a function of modeling condition, rein- 
forcement condition, and phases of the experiment. 


MEAN PERCENTAGE WOULD SAVE OF 25 CENTS 


FOLLOW-UP 
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Mean percentage that subjects would save 
of $2 as a function of experimental phases is 
presented. in Figure 3. What appeared to be 
substantial differences between groups at the 
premeasure phase were noted. However, a 
simple analysis of variance was computed on 
these values, and the F ratio did not reach a 
statistically significant level. A repeated- 
measures analysis of variance was computed 
and revealed a significant Modeling X Phases 
interaction effect (F = 3.91, df = 2/72, p< 
.05). No other F value reached the .05 level 
of significance in this analysis. In an attempt 
to further identify the sources of variance, 
two sets of ¢ tests between means were com- 
puted: between pairs of means across phases 
and between the groups at each of the three 
phases. Results revealed that no ¢ value 
reached significance. Thus, modeling, rein- 
forcement, and phases were found to have no 
statistically significant effect by themselves. In 
combination, however, modeling and phases 
did have a significant interaction effect on 
this $2 measure. 

Finally, Figure 4 shows the mean percent- 
age that subjects would save of $20 as a 
function of experimental phases. In no case 
did an F value reach the .05 level of sta- 
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Fic. 3. Mean percentage that subjects would save 
of $2 as a function of modeling condition, rein- 
forcement condition, and phases of the experiment. 
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Fic. 4. Mean percentage that subjects would save 
of $20 as a function of modeling condition, rein- 
forcement condition, and phases of the experiment. 


tistical significance in a repeated-measures 
analysis of variance. Thus, no modeling, rein- 
forcement, phases, or interaction effects were 
found for the $20 measure of response 
generalization. 


Discussion 


In this study it was found that not only 
did exposure to high-delay peer models im- 
mediately increase percentage of delay choices, 
but that this effect was maintained 1 month 
later as well. Further, this change in delay 
of gratification through imitation did gen- 
eralize to two hypothetical money-saving 
measures (25¢ and $2). No effect on the 
$20 measure was found. The $20 measure 
may have been unrealistic in the population 
since, as noted earlier, inmates can spend a 
maximum of only $20 each month. Actually, 
very few have this large an amount in their 
accounts, Rosenquist and Megargee (1969) 
also found the larger amounts of money un- 
realistic in their delinquent populations, Since 
models displayed delay responses only to 
choices between items ranging in values less 
than $1, there would be less reason to expect 
generalization to a measure so different in 
value as $20. Indeed, with regard to gen- 
eralization, the data do order themselves on a 
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similarity dimension, the effects becoming in- 
creasingly weaker as similarity decreased. The 
value of the objects chosen by models varied 
from about 25¢ to $1. A strong modeling 
effect was found for similar choices of objects. 
Of the generalization measures, the “save 25¢” 
was the most similar in value, and a strong 
effect was found. For the “save $2” measure 
a weaker effect was found, and no effect on 
the least similar measure, “save $20,” was 
found. 

Since the same experimenter conducted all 
three phases of the study, it might be ques- 
tioned what effect his repeated association 
with the phases might have had. A stronger 
case for a modeling effect may have been 
made if different experimenters had been 
used for the modeling and follow-up. phases 
of the study. Bandura and Mischel (1965) 
did use such controls and did find a strong 
modeling effect. Care was taken in the present 
study to avoid any social reinforcement effect 
(Stumphauzer, 1970). One “message” defi- 
nitely given by the experimenter, which might 
be taken as a cue as to what he wanted, was 
his differential reinforcement of either delayed 
or immediate reinforcement choices. The data 
suggest no such reinforcement effect. More 
precisely, when initially low-delay subjects 
(see Figure 1) were on the one hand rein- 
forced for immediate reward choices but on 
the other hand exposed to a high-delay model, 
only an increase in delay choices was found, 
suggesting only a strong modeling effect. 

It is the psychoanalytic view (e.g., Freud, 
1946; Singer, 1955) that the beginnings of 
the delaying mechanism signal the emergence 
of the reality principle, and further that deter- 
minants of the “mechanism” are sought in 
hypothetical internal events—ego organiza- 
tions and energy systems. In contrast, both 
the Bandura-Mischel findings and the present 
study support the social learning view that 
self-controlling behavior is determined by 
external, social stimulus events. Another alter- 
native theory of the development of self- 
control is offered by Bijou and Baer (1961). 
Their view is that self-control is learned as 
are all other operant responses, through rein- 
forcement. The findings of the present study 
do not support such a view since differential 
reinforcement was found to have no effect on 


delay behavior. This is not to imply ‘hat 
delay behavior cannot be controlled by ma- 
nipulation of reinforcement variables. !a:‘ced, 
Stumphauzer (1970) was able to sy ati- 
cally change the percentage of dela: ices 
through the use of social reinforce: The 
point is that in the present study extrinsic 7 
reinforcement was not a necessary dition 
to achieve the change in delay crientation. 
As Bandura (1962) suggested in his formula- 
tion of imitative learning, a behavior may be 
acquired simply through observation. Rein- 
forcement variables may then pay a role 
in subsequent performance of the learned 
responses. 

Gewirtz and Stingle (1968) have iade a © 
strong case for the learning of genovalized © 
imitation as the basis for identification. Baer 
and Sherman (1964) see the key to under- 
standing imitative learning in the closer 
examination of generalized imitatio:r. The 
present study did find generalization of imita- 
tion to a second series of responses within 
the larger response class, delay of grat: ‘ica- 


tion. Low-delay inmates who were exposed to 
high-delay models not only increased their 
percentage of delay choices, but also “heir 
hypothetical money-saving activity on a mea- 
sure not directly observed in the model. ‘This 
generalization of imitation effect, especially 
when considered in its extreme (identifica- 
tion), is very important when looking forward 
to the use of imitation in behavior therapy 
programs. If the imitation were only of the 
specific response displayed by the model, and 
there were no hope of generalization to other 
somewhat similar responses, then imitative 
learning would probably have little future use 
as a behavior modification regime. The 
present study did find the generalization, and 
the effect was maintained 1 month later. It 
is suggested, within the social-learning view 
supported here, that generalized delay of 
gratification is achieved in normal personal- 
ity development through repeated modeling 
of delaying behavior by several models 
(parents and peers) and in many different 
situations. Further, it is likely that imitation 
of delay behavior then meets with social 
approval and extrinsic reinforcement. In cer- 
tain delinquents and adult criminal offenders, 
these particular socialization agents may not 
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have been present or, indeed, the opposite 
may have been the case—there may have been 
modeling and reinforcement of impulsive, 
hedonistic, and antisocial behavior. In keeping 
with the above discussion, it is suggested that 
a behavior modification program aimed at an 
increased delay of gratification in delinquents 
and youthful offenders would require re- 
peated modeling of high-delay behavior by 
several models in diverse situations. Further- 
more, reinforcement variables would have 


to be manipulated to achieve more generalized: 


delay of gratification (see Stumphauzer, in 
press). 
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MARITAL DYSFUNCTIONALITY AND TWO-PERSON 
NON-ZERO-SUM GAME BEHAVIOR: 


CUMULATIVE MONADIC MEASURES 


DAVID C. SPEER 1 
Amherst H. Wilder Foundation, St. Paul, Minnesota 


The purpose of this study was to investigate the relationships between mavit 
communication and satisfaction and monadic Prisoner’s Dilemma game 

sures among married couples, An attempt was made to deal with verisimilituci 
problems of the standard 2X2 game by incorporating two new response 
choices in three- and four-choice games. Sixty clinic and 60 PTA (Parent- 
Teacher Association) married couples were assigned randomly to one of four 
games (a standard and three experimental) of the Prisoner’s Dilemma. Analysis 
indicated a total absence of reliable clinic-PTA and sex differences among the 
four response variables and the four forms of the game. Reliable time effects 
were found, however, that are consistent with previous findings. Correlational 
analysis, although providing limited evidence for a relationship between ade- 
quacy of communication and game behavior for one form of the game, sug- 
gested a stronger relationship between education and game choices, It is con- 


Re Prisoner’s Dilemma format by increasing matrix complexity leads to a stronger 


_ literature 


cluded that attempting to deal with the verisimilitude problems of the standard’ 
t D 


relationship between monadic game measures and comprehension, thansbe! 
game measures and personality or marital variables. 


Recent reviews of the experimental game 
(Gallo & McClintock, 1965; 


_ Guyer, 1967; Vinacke, 1969) indicate that 
although the Prisoner’s Dilemma has been 
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cies, 


investigated extensively, a vast majority of 
Studies have focused on game character- 
istics (payoff magnitudes and contingen- 
matrix format, etc.) rather than 
subject variables or characteristics. Vin- 
acke, in his excellent review, has labeled the 
former task and situational variables and 
the latter personality variables. Because of 
these reviews, the rather meager literature on 
subject or personality variables is not re- 
viewed here. Suffice it to say that there is 
almost no consistently replicated documenta- 
tion of relationships between personal char- 
acteristics and Prisoner’s Dilemma choice 
measures, with the possible exception of sex 
differences. Another aspect of Prisoner’s Di- 
lemma research, about which Pruitt (1967) 
has raised question, is the verisimilitude of 
the standard Prisoner’s Dilemma format. 
Pruitt believes that adherence to the 2 x 2 
matrix and the one-choice-one-payoff pro- 
1 Requests for reprints should be sent to David C. 


Speer, who is now at the Suicide and Crisis Ser- 
vice, 560 Main Street, Buffalo, New York 14202. 
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cedure results in a lack of isomorphism be- 
tween the game and many real-life, mixed- 
motive situations. Miller (1967) and Pruitt 
(1967) have attempted to deal with this 
problem. 

Another curious phenomenon, as Vinacke 
(1969) pointed out, is that investigators have 
usually avoided studying the game behavior 
of subjects who have a natural relationship 
history. Thus, when subject pairs (rather 
than a subject and a programmed surrogate) 
are asked to play the game, research designs 
have usually demanded that the participants 
be strangers to one another. The only two 
studies of which this author is aware that 
investigated subject-relationship status as an 
independent variable (Oskamp & Perlman, 
1966; Swingle & Gillis, 1968) provided in- 
conclusive findings. 

Although Deutsch (1960) and Solomon 
(1960) had suggested previously that the 
standard response choices tap the dimension 
of interpersonal trust, Watzlawick, Beavin, 
and Jackson (1967) have suggested that the 
standard C choice can be likened to trusting 
behavior in a marital relationship, and that 
the D response can be thought of as kind of 
“playing safe” or distrust response. 
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The purpose of this study was to investi- 
gate the relationships between the cumulative 
Prisoner’s Dilemma game behavior of indi- 
viduals (monads) and marital adjustment 
among married couples, that is, among sub- 
ject dyads with an existing relationship. For 
purposes of this study, marital adjustment 
was defined operationally in terms of (a) in- 
volvement versus noninvolvement in psychi- 
atric or psychological counseling, (b) ade- 
quacy of marital communication, and (c) 
marital satisfaction, Furthermore, the effect 
of two additional response alternatives in the 
game matrix is explored in an effort to get at 
the verisimilitude problem. The first of these, 
suggested by Miller’s (1967) “no-play” re- 
sponse option, is.a_type of withdrawal from 
interaction response in which one player’s 
choice overrules the others and results in a 
loss of points for both. The second experi- 
mental response alternative is an attack, or 
put-down, response which results only in a 
loss of points to the partner. 

Two hypotheses were tested: 

Hypothesis 1: Married couples being seen 
at a psychiatric clinic emit fewer trust or 
cooperative responses, and more competitive, 
withdrawal, and attack response than couples 
who are not involved in psychiatric treatment. 

Hypothesis 2: The frequency of trust, com- 
petitive, withdrawal, and attack choices cor- 
relates reliably with measures of marital com- 
munication and marital satisfaction, and to 
a higher degree than with age, education, and 
occupational level. 


METHOD 
Subjects 


The subjects consisted of 60 married couples 
drawn from a child guidance clinic population (CGC 
sample) and 60 married couples recruited from local 
Parent-Teacher Associations (PTA sample). Clinic 
couples were included in the study only if diagnostic 
evaluation indicated that their child’s difficulty was 
due to significant emotional or family problems. 
Forty percent of the child patients received diagnoses 
of transient personality disorder; 35%, behavior dis- 
order; and the remaining 25% were distributed 
among the neurosis, special symptom, and person- 
ality disorder categories. Fifty percent of the child 
patients arrived with the main presenting complaint 
of some type of conduct or management problem; 
25%, with an achievement or learning problem; 18% 
with anxiety or withdrawal symptoms; and the 
remainder with psychophysiological, cognitive devel- 
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opment, or substance abuse symptoms. Clinic couples 
were recruited by mail and telephone on a totally — 
voluntary basis. 

The PTA sample was matched with the clinic ` 
sample on the basis of sex and grade placement of 
a child within the family, and on urban-suburban 
locus of residence. Prospective couples were con- 
tacted by mail and telephone, and were paid $8 per 
couple for their participation. Couples were ex- 
cluded from the study if any member of their pri- 
mary family was receiving any form of psychological 
counseling at the time of the study. i 

Sixty percent of the qualifying CGC couples par- 
ticipated, and 38% of the PTA couples who were 
asked to participate actually did so. The couples 
were then assigned randomly to one of four game ` 
matrix conditions. Thus, 15 clinic and 15 PTA cou- 
ples (a total of 30 husbands and 30 wives) played 
each of four experimental matrix games. 


Game Matrices 


The four matrices are presented in Figure 1. 
Game 1 is a standard Prisoner’s Dilemma matrix in 
which the C response is the usual cooperation or 
trust response, The D response is the usual defection, 
competitive, or exploitative response. The point pay- 
offs for the various permutations of C-D responses 
by the two players are constant over the four games. 
Game 2 consists of the same standard C and D re- 
sponses plus a put-down or attack response. The 
player making a P response gains no points for him- 
self, and generally causes a loss of points for his ` 
partner. Game 3 consists of the same C and D | 
response and payoffs plus a withdrawal or no-play 
response. As with Miller’s (1967) no-play choice, a 
player making a W response neutralizes or overrides 
the other player’s C or D responses. Game 4 is simply 
an inclusion of all four, C, D, P, and W, responses 
in a single game matrix, 


Procedure 


Subjects were informed that they were being asked 
to perform an experimental two-person task. Sub- 
jects played the game to which they were assigned 
randomly only with their own spouse, and they knew 
that it was their spouse with whom they were play- | 
ing. The subjects were seated back to back in the 
same room, and the experimenter sat at a 90-degree — 
angle to them in such a position as to be able to see — 
the matrix game boards held by each subject. k. 

The following instructions were read to the sub- 
jects: 


The two of you are going to participate in a task — 
in which you may receive various numbers of ~ 
points, The number of points you each receive will — 
depend on your spouse’s choice as well as your 
own choice. Each of you has a task board before — 
you. Notice the red letters on the bottom of your | 
board, These are the choices you can make. The 
black letters on the left side of the board are the — 
choices that your spouse can make. In each round 

you will each make only one choice by pointing — 
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to one of the red letters, A or B, at the bottom of 
your board. Notice that there are two numbers in 
each smaller square on your board. The number in 
red is the number of points you may receive, and 
the number in black is the number of points your 
spouse may receive. The number of points you 
each receive will depend on the letter you choose 
and the letter your spouse chooses. In each round 
you will each make a choice by pointing to one of 
the red letters at the bottom of your board. After 
I have written down your choices, I will tell you 
how many points you have each gained (or lost). 
For example, [left] if you choose A and [right] 
you choose B, [left] you will receive 0 points and 
[right] you will receive 5 points. If you both 
choose A, you will each receive 3 points. Notice 
that after I have told you the number of points 
that each of you has received, it is possible for 
you to figure out which letter your spouse chose. 
You can do this by finding the square with those 
particular points in it and then looking to the 
left to see which black letter is opposite that row. 
This task will consist of a large number of rounds. 
When I have told you the number of points that 
each of you has received, the round will be over 
and you should decide what your next choice will 
be. After we begin, it is very important that you 
do not communicate with each other in any way. I 
will let you know how many points you have 
when we have finished. Do you have any ques- 
tions? Let’s try a couple. Would each of you point 
to a red letter? [two practice rounds, with the 
experimenter inquiring about whether each spouse 
could determine the other’s choice]. Remember, 
after we begin do not speak to each other. Simply 


point to the red letter of your choice. I will tell 
you how many points you have received and we © 
will then go on to the next round. Do you have 
any questions? All right, let’s begin. 


Each game consisted of 75 iterative and simul- 
taneous choices by each pair of subjects. Because of 
the intent to have the couples’ relationship be the 
main determinant of their game behavior, uncon- 
taminated by monetary influences, payoffs consisted 
only of points received; the points awarded had no 
monetary value. 

At the end of the 75 trials, or choices, the subjects 
completed brief questionnaires: the Primary Com- 
munication Inventory (Navran, 1967), which as- 
sesses the quality of communication between husband 
and wife 2; and the short Marital Adjustment Test 
(Locke & Wallace, 1959), which measures degree of 
marital satisfaction or happiness. 


RESULTS 


Preliminary to examining the game data, 
analysis of the questionnaire data indicated 
reliable CGC-PTA sample differences on 
both the Primary Communication Inventory — 
(F = 29.98, df=1/224, p=.01) and the — 
Marital Adjustment Test (F = 46.27, df= 


2 The Primary Communication Inventory (Nav- 
ran, 1967) has been demonstrated to reliably differen- 
tiate “happily married couples” and couples seeking 
marital counseling. It generally assesses the com- 
posite openness, frequency, completeness, and clarity 
of the couple’s communication. 


1/224, p= .01) and an absence of reliable 
sex differences on both questionnaires. Thus, 
clinic couples reported poorer communication 
and less marital satisfaction than did the PTA 
couples, 


Hypothesis 1 


The mean percentages of responses aver- 
aged over the 75 trials, by sample, sex, and 
game matrix, are reported in Table 1. Statis- 
tical analysis of the data consisted of a split- 
plot-type analysis of variance of a two-way 
crossed design (CGC—PTA X Sex) with re- 
peated measures. Time effects were investi- 
gated by breaking the 75 trials into five 
blocks or intervals of 15 successive choices 
each. The data were analyzed separately by 

_ game and by response parameter. 

To make a very long story short, there 
were no reliable CGC-PTA or sex differences 
and no reliable CGC-PTA X Sex interaction 
effects on any of the response parameters in 
any of the four games. However, there were 
reliable main effects due to time on three of 
the four games, with a time effect on the 
fourth game approaching reliability. These 
findings are reported in Table 2 and graphi- 
cally in Figure 2. Because of the general ab- 
sence of CGC-PTA, sex, and interaction ef- 
fects, the samples and sexes are combined in 
Figure 2. 

Table 2 indicates that only the D-choice 
time effects achieved reliability in Games 1, 
2, and 3, and approached reliability in Game 
4, 

Inspection of Figure 2 indicates that this 
D-choice time effect is due to a consistent in- 
crease in rate of D responses over the first 45 
trials across all four games. After the third 
interval, the D response rate of Game 1 de- 
parts from the D rates in the other games by 
continuing to increase during the fourth in- 
terval and then suddenly decreasing in the 
fifth block. The D rates in Games 2, 3, and 4 
either level off or decrease slightly in the 
fourth block, and then increase again during 
the fifth interval. 

Thus, the hypothesis that clinic and PTA 
husbands and wives would differ in the fre- 
quency with which they made the various 
Prigoner’s Dilemma choices was not i substan 
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TABLE 1 
MEAN RESPONSE PERCENTAGES AVERAGED 
OVER TIME 
Response Sample Males Females M 
Game 1 
CGC 47.46 42.75 45.11 
c% PTA 41.06 45.15 43.13 
M 44.21 43.95 
Game 2 
CGC 38.40 37.06 37.10 
C% PTA 33.15 34.57 33.86 
M 35.78 35.82 
CGC 39.37 38.48 38.93 
D% PTA 44.80 39.91 42.36 
M 42.09 39.20 
Game 3 
CGC 41.60 43.11 42.36 
C% PTA 42.13 43:73 42,93 
M 41.87 43.42 
CGC 41.86 39.11 40.49 
D% PTA 45.24 39.73 42,49 
M 43.55 39.42 
Game 4 
2. 
CGC 32.71 29.51 31.11 
C% PTA 39.06 35.73 36.89 
M 33.39 32.62 
CGC 32.44 35.11 33.78 
D% PTA 30.22 33.42 31.86 
M 31.33 34,27 
CGC 20,08 20.00 20,04 
P% PTA 19.73 19.20 19.47 
M 19.91 19.60 


Note. —Because of the ipsative relationships between re- 
sponses within any game, only R-1 response data are presented. 


Hypothesis 2 


Product-moment correlations were com- 
puted between the R-1 choice frequencies for 
each game and the Primary Communication 
Inventory, the Marital Adjustment Test, age, 
education, and husbands’ occupational level. 
The clinic and PTA samples were combined, 
but correlations were computed separately for 
husbands and wives. These coefficients are re- 
ported in Table 3, Apropos of the second hy- 
pothesis, only the percentage of C choices cor- 
relates positively and reliably with adequacy 
of marital communication for both husbands 
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and wives, but only in Game 3. The signifi- 
cance of this is reduced considerably by the 
more frequent reliable correlations between C 
and P percentages, and educational and occu- 
pational levels in Games 3 and 4, Thus, Hy- 
pothesis 2 must also be rejected. 


Discussion 


These findings do not support the idea that 
monadic measures of Prisoner’s Dilemma 
game behavior reflect differences in dysfunc- 
tional communication and satisfaction among 
married couples, Neither do they differentiate 
couples who are participating and not partici- 
pating in psychiatric or psychological counsel- 
ing. 

On the other hand, the reliable time effects 
and their specific form, because of their con- 
sistency with the findings of Rapoport and 
Chammah (1965a), are significant. The curvi- 
linear relationship between time and the fre- 
quency of D choices in Game 1, with D choices 
occurring at a moderate rate early in the game, 
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increasing during middle time intervals, and 
is identical to that re- 
by Rapoport and Chammah, 

divergence in rate of D choices over 
me between couples who played Game 1, and 
, 3, and 4, is also 
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tion data findings, Reliable relationships (p . 
< 05) between adequacy of marital com 
munication and C responding were found in 
only one of the four games. 

However, the fact that the C-D response- — 
education correlations were higher than the — 
communication correlations for males in Game 
3 and for males and females in Game 4 ` 
gests that gua babacviee Paf be more A 
nently related to ability or comprehension 


time in Games 2, 3, and 4, That is, efforts of 


to comprehend the resolution potential of 
cooperative strategy, This raises the 
ity that efforts to increase verisimilitude 
increasing the number of choice alt 
and the complexity of the game situation 
be confounded by the subjects’ 
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PERSONALITY CORRELATES OF DIFFERENTIAL 
RESPONSIVENESS TO DIRECT AND 
VICARIOUS THREAT: 


A FAILURE TO REPLICATE PREVIOUS FINDINGS? 


JAMES R. AVERILL, ERHARD OLBRICH,’ anD RICHARD S. LAZARUS 


University of California, Berkeley 


Psychophysiological responses were recorded as subjects anticipated an electric 
shock to the self and to a model. Attempts to cross-validate previously reported 
findings regarding the personality characteristics of subjects who respond more 
to the direct than vicarious threat, and vice versa, were unsuccessful, Psycho- 
physiological responses differed as a function of whether the vicarious threat 
was experienced first or second, Heart rate and self-reported anxiety were most 
sensitive to differences in the type of threat, whereas skin conductance was 
relatively nondiscriminative. Some implications of these results for experimental 


research in personality are discussed. 


Differential reactivity to direct and vicari- 
ous threat bears on a number of theoretical 
and methodological issues in the areas of 
stress and social adjustment, for example, 
regarding empathy, modeling behavior, and 
desensitization therapy. In our own research 
on the psychodynamics of stress (Lazarus, 
Averill, & Opton, 1970), we have frequently 
employed motion picture films as stimuli. An 
important assumption in using this research 
paradigm is that reactions elicited vicariously 
in the subject viewer are similar to those 
where the harm is directly confronted. In a 
dissertation by Alfert (1966), a comparison 
was made between responses to the vicarious 
threat of viewing a film accident and the 
direct threat of receiving an electric shock. 
Although she found generally significant cor- 
relations between reactions in the two situa- 
tions, considerable variance remained un- 
accounted for. Subsequent analyses (Alfert, 
1967; Opton & Lazarus, 1967) indicated that 
some of this variance was a function of indi- 
vidual differences in relative sensitivity to the 


i This research was supported, in part, by National 
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3 Now at the Bundeswehramt (Gruppe Wehrpsy- 
chologie), Bonn, West Germany. 
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two types of threat. Subjects who showed 
greater responsiveness to the vicarious than 
to the direct threat tended to be more intro- 
verted, submissive, suggestible, insecure, pas- 
sive, anxious, lacking in impulse expression, 
and not caring about friends. 

In their analysis of the Alfert data, Opton 
and Lazarus (1967) were particularly inter- 
ested in the relative merits of ipsative (within- 
subject) and normative (between-subject) 
research designs, and not just the specific 
issue of direct versus vicarious threat. The 
personality data they presented were used to 
illustrate the greater sensitivity of the ipsative 
approach. These data consisted of a set of 
items, drawn from standard personality inven- 
tories, which distinguished between subjects 
showing differential sensitivity to direct and 
vicarious threat. The number of these items 
substantially exceeded what would be ex- 
pected by chance. Moreover, they showed a 
high degree of internal consistency and ap- 
plied to psychophysiological reactions as well 
as to self-reported affective disturbance. This 
suggested that the observed personality—stress 
relationships would prove stable in subsequent 
research. The experiment reported here was an 
attempt to cross-validate and further clarify 
the empirical basis of these relationships. 


METHOD 
The experiment employed anticipated electric shock 
to the subject and to a model—ostensibly a fellow 
subject, but really a confederate of the experimenter. 
Two groups of subjects experienced both the direct 
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and vicarious threat, but in counterbalanced order. 
Thus, there was a total of four conditions: S:—sub- 
ject anticipated the shock first; Mi—subject observed 
the model first; S:—subject anticipated the shock 
second; Mz—subject observed the model second. 


Subjects 


Eighty male subjects completed the experiment. 
All but five were paid volunteers who responded to 
an advertisement in the student newspaper; the 
remaining five were drawn from introductory psy- 
chology courses and received class credit for partici- 

` pation. Seven subjects withdrew from the experiment 
when they learned that it involved electric shock, 
and they were replaced. Subjects were randomly 
divided into two groups, depending upon whether 
they experienced the direct threat first and the vicari- 
ous threat second (direct-vicarious group), or vice 
versa (vicarious-direct group). 


Dependent Variables 


Physiological. Heart rate and skin resistance were 
measured continuously on a Beckman Type R dyno- 
graph, The electrocardiograph was converted to 
beat-by-beat heart rate by means of a cardiotachom- 
eter. Skin resistance was obtained from a modified 
Wheatstone bridge which impressed a relatively con- 
stant current of 10 microamps across silver-silver 
chloride electrodes, 1-centimeter diameter, placed on 
the thenar and hypothenar surfaces of the nondomi- 
nant hand. Skin resistance was changed to log skin 
conductance before analysis. 

Self-reported affect. At the beginning of the experi- 
ment and immediately after each threat condition, 
the subject and the model completed a slightly 
modified version of the Nowlis (1965) Mood Adjec- 
tive Check List (MACL), which was scored for the 
following 11 dimensions: Anxiety, Aggression, Social 
Affection, Elation, Vigor, Surgency, Fatigue, Noncha- 
lance, Egotism, Concentration, and Skepticism. At 
the end of the experiment, the subject and the 
model were taken to separate rooms where ratings 
were obtained of the subject's perception of the 
model in terms of the three semantic differential 
dimensions of evaluation, potency, and activity. 
Upon analysis, these ratings of the model produced 
no significant findings and will not be discussed 
further. A structured interview also was conducted 
with the subject to determine any suspicions on 
his part, his degree of stress, etc. These data did not 
yield results beyond those to be reported for the 
MACL and hence will not be considered further. 

Personality. Following the postexperimental inter- 
view, each subject was given a 506-item personality 
inventory which he completed and returned to the 
laboratory within 10 days. The inventory contained 
all of the items reported by Opton and Lazarus 
(1967) as distinguishing between persons who re- 
sponded more to vicarious than direct threat, as well 
as a variety of other scales which were employed 
for exploratory purposes. The latter included the 
Marlowe-Crowne Social Desirability Scale; Welsh’s 
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A Scale from the Minnesota Multiphasic Personality 
Inventory (MMPI); the Femininity scale from the ~ 
California Psychological Inventory (CPI); the com- ~ 
plete Eysenck Personality Inventor Haan’s Em- 
pathy and Projection Scales; M3 


Feeling, Extraversion, and Ir S s 
David’s Alienation Scale; Rokeach'’s Dogmatism 
scale; and Stein’s Motoric Ideational Activity 
Preference Schedule. 

Procedure 

After the subject and the model reported to the 


laboratory, they were seated in two easy chairs sepa- 
rated by a small table on which the terminal box 
for the electrodes was placed. It was explained that 
the experiment was designed to i tigate reactions 
to various stimuli, the exact natu f which would 
be explained later. Heart rate and skin conductance 
electrodes were attached to both the subject and the 
model, with a brief explanation of the function of 
each device. The subject and the 1 were then — 
asked to relax quietly while the rec equipment 
was being calibrated; after an 11 rest period, 
they completed the MACL for the first time. U 
Up to this point, the treatment of both vicarious- 
direct and direct-vicarious groups was the same. In 
the direct-vicarious condition, plate electrodes were 
next attached to the forearms of the subject and the © 
model, and they were informed that first the subject 7 
and then the model would rec brief 90-voltm 
electric shock at the end of a 3 ticipation ~ 


period (indicated by a large cloc! t of them). 
Before the subject’s shock electro: connected, © 
however, he was asked to help in a ck of the 9 


equipment. A voltmeter was attached to the appa- — 
ratus, and the subject reported to the experimenter, 
who was in the adjacent room, if the needle jumped 
to 90 volts at the same time that a raucous buzzer 
sounded. With the equipment working — 
properly, the voltmeter was disc d, and the 
subject’s shock electrodes were att: the termi- | 
nals, After a 2-minute base-line n 
beginning of the anticipation period 3 
over an intercom, and the clock v ted. Actually, 7 
no shock was administered at the end of the 35 
seconds, although the buzzer did sound. (This pro- 
cedure was necessary in order to keep the vicarious — 
threat plausible without administering a severe shock © 
to the subject.) The experimenter then entered the 
room and instructed the subject and the model to 
indicate on the MACL how they felt during the 
last second before the shock. When the subject 
reported that he did not receive a shock, the experi- 
menter acted incredulous and asked him to completi 
the MACL anyway, reminding him that he was 

rate his feelings immediately before the expected — 
shock. Meanwhile, the experimenter pretended to 
check the equipment in the adjacent room. After 
few minutes, he returned with the voltmeter and, 
with the subject's help, again checked the shock app 
ratus, The output terminals registered the appropria! 
90 volts; however, further testing revealed a brok 
lead to the shock electrode. It was replaced, and 
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regular procedure was then begun for the model, 


“in order to at least get ‘his reactions completely.” 
Following a 5-minute rest interval, the clock was 
started, and at the end of 35 seconds the model 
received his “shock.” He violently jerked his arm, 
breathed deeply, and was quiet again. The subject 
and the model again indicated on the MACL how 
they felt in the last second before the shock. 

In the vicarious-direct condition, the order of 
presentation was reversed, while everything else was 
kept as constant as possible. At the beginning of the 
experiment, the apparatus was checked with a volt- 
meter which indicated 90 volts. The model then 
received his shock and displayed the appropriate pain 
response. When the subject reported that he was not 
shocked when his turn came, the apparatus was 
checked as before, the problem discovered, etc. This 
procedure was apparently effective in allaying sus- 
picions, as indicated by postsession interviews and 
by the fact that there were no significant differences 
as a function of order of presentation on the 
“Skepticism” dimension included in the MACL. 


RESULTS AND DISCUSSION 


We shall consider first the psychophysio- 
logical responses to the direct and vicarious 
threats and then the personality correlates of 
differential responding. 


Response Patterning 


The maximal values of skin conductance 
and heart rate were read in successive 10- 
second intervals during the 2-minute base 
periods and in 5-second intervals during 
anticipation. For each condition, the mean of 
the base readings was subtracted from the 
reading immediately prior to the expected 
shock, which generally was the peak anticipa- 
tory response. This provided a measure of 
the emotional impact of that anticipated 
shock. A two-way analysis of variance* was 
performed on these data, the first factor 
(within-subject) being direct versus vicarious 
threat and the second factor (between- 
subject) being order of presentation. Tests 
were also made of the simple main effects of 
the threat conditions (Sı versus Sy and Mı 
versus Mo) to further analyze the influence 
of position on direct and vicarious threat. 

There were no significant differences in skin 
conductance reactivity as a function of varia- 
tions in experimental treatments; that is, the 


4 Although the experiment has the form of a 
Latin square design, a Latin square analysis is in- 
appropriate due to the interaction of treatment with 
order effects (Benjamin, 1965). 


subjects in all four conditions showed a near 
equal increase in skin conductance. On the 
other hand, heart rate responses differed con- 
siderably depending upon the threat condi- 
tions. The mean increase in beats per minute 
was 20.3 during S, 23.1 during Sz, 12.3 
during Mı and 4.23 during Mə. The main ef- 
fects for the comparison of direct versus vicari- 
ous threat was highly significant (p < .0005). 
The order of presentation also had a signifi- 
cant effect (p < .05), with the vicarious-direct 
group showing greater cardiac reactivity than 
the direct-vicarious group. Finally, when M; 
was compared with Mo, the results were sig- 
nificant at the .005 level of confidence—the 
heart rate response was considerably less 
when the vicarious threat was experienced 
second than when it was experienced first. 

Analyses of variance similar to those de- 
scribed above were also performed on the 
difference scores between base and threat 
ratings for each dimension of the MACL. In 
comparison to the vicarious threat, subjects 
in the direct threat conditions reported sig- 
nificantly greater anxiety (p < .0005), concen- 
tration (p < .0005), and vigor (p < .0005), 
but less fatigue (p < .05), nonchalance (p < 
.001), and social affection (p < .0005). There 
were no significant differences in self-reported 
affect due to order of presentation, although 
two variables (anxiety and egotism) showed 
highly significant interactions between se- 
quence and type of threat (p < .0005). Tests 
of the simple main effects of threat conditions 
yielded only one significant result; subjects 
reported little anxiety when the vicarious 
threat was experienced second rather than 
first (p < 001). This latter is the same pat- 
tern of response as was observed in the case 
of heart rate. 

These results are in general agreement 
with the psychophysiological responses to 
direct and vicarious cold-pressor stimulation 
described by Craig (1968; Craig & Wood, 
1969). They also underscore Berger’s (1962) 
distinction between vicarious and pseudo- 
vicarious experiences; the latter partakes of 
the clear expectation that one is shortly to 
suffer the same harm as the other, while the 
former is composed mainly of an empathic 
response. It may be questioned whether these 
two experiences, in which the significance of 
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what is happeningsto the model is so different, 
should be treated as the same. The failure 
of skin conductance to differentiate between 
the M, and Mz conditions (or between these 
and direct threat) would indicate that the 
vicarious experience (M2) was, in a certain 
sense, as stimulating as the others. However, 
the lack of heart rate activity and self- 
reported anxiety in the Mə condition implies 
that it was not especially threatening. Skin 
conductance, it should be noted, is a very 
general response measure. It indicates pri- 
marily that the subject has been stimulated, 
but provides little information regarding the 
quality of the response state (Averill & Opton, 
1968). 

In summary, truly vicarious or empathic 
responses differ in important respects from 
reactions to both direct and pseudovicarious 

. threats. It cannot be assumed without quali- 
fication that conclusions drawn from research 
employing vicarious techniques (e.g., motion 
pictures) are generalizable to real-life situa- 
tions where the threat of harm is more direct. 


Personality Correlates 


The personality correlates of differential 
responding to direct versus vicarious threat 
were investigated as follows: The reactivity 
scores described above for heart rate, skin 
conductance, and self-reported anxiety were 
standardized across subjects within each 
threat condition (Sı, Se, Mı, Ms). Subjects 
were then divided into groups depending upon 
whether their standardized score on a par- 
ticular variable was greater to direct than 
vicarious threat. Four groups of equal size 
were formed in this manner (S; > Mo, 
Sə >My, M; > Se, and Mz > S1). Two-way 
analyses of variance were then performed 
using the personality scales as dependent vari- 
ables; one factor consisted of relative respon- 
siveness (subject > model versus model > 
subject), and the other factor was order of 
presentation (direct vicarious versus vicarious 
direct). Stated another way, each of the four 
groups described above represented a cell in 
a 2 X 2 analysis of variance table. 

There is no need to go into the details of 
these analyses, for the results were essentially 
negative. The smattering of significant F 
ratios which did emerge could have occurred 
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by chance, and there were no consistent 
trends in the data which could be interpreted 
as supporting the personality-stress relation- 
ships reported by Opton and Lazarus (1967), 
Other analyses were performed involving spe- 


cific comparisons, but these also failed to 
support earlier inferences. : 
Actually, this is the second such failure, © 
An earlier unpublished study * attempted to 7 
examine differential responsiveness to direct — 
and vicarious threat as a function of whether 
the harmful event was primarily physical 
(electric shock) or psychological (failure on 
a form-board task). Briefly, the subjects ex- 
perienced the accident film used by Alfert 
(1966), the anticipation of an electric shock 


to the self and to a model, and failure by 
self and a model on a form-board task, \s in 
the present study, self-report and pl ysio- 
logical responses were standardized within - 
conditions, and comparisons were made among _ 
various combinations of direct and vicarious 
threat. For example, one comparison con- 
trasted subjects who responded relatively 
more to the direct threat of electric shock 
with those who responded more to the vicari- 
ous film threat—the stimuli on which the 
Opton and Lazarus (1967) analyses were 
based. Only a chance number of the person- 
ality items reported by Opton and Lazarus 
discriminated between these groups, and an — 
approximately equal number of items ap- 
peared in the nonpredicted as in the predicted | 
direction, Comparisons involving direct and 
vicarious threats using the same type of 
stimulus, that is, electric shock or ego 
failure, actually yielded results contradictory — 
to expectations. In these instances, persons 
who responded relatively more to direct threat — 
showed greater dispositional anxiety, exactly 
opposite to the earlier findings. However, the 
complexity of this unpublished study pre- 
cluded counterbalancing of conditions, among 
other things, which made it difficult to inter- 
pret the results strictly in terms of direct 
versus vicarious threat. 

To summarize briefly, one original and two 
follow-up studies have been conducted, none — 
of which has supported the findings of the — 

5 We wish to express appreciation to Michael 


D. Evans for his assistance in conducting this 
experiment. 
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others. Hence, we cannot conclude, as origi- 
nally supposed, that there are stable relation- 
ships between broad personality dimensions 
and differential responding to direct and vi- 
carious threat. Certainly, the set of items 
reported by Opton and Lazarus (1967) should 
not be treated as a personality scale in subse- 
quent research. This, however, does not 
detract from the logic of their argument 
concerning the relative merits of ipsative and 
normative designs. What it does do is raise 
a serious question regarding the potentially 
ephemeral nature of findings purporting to 
demonstrate personality—stress relationships. 
There are, of course, many possible reasons 
for failures to replicate. In the present case, 
the two follow-up studies described here were 
not actual duplicates of the original, but at- 
tempts to test inferences derived from it. 
Hence, the three experiments differed in 
many details of procedure, and perhaps in the 
subject population sampled. That is not the 
important point, however; such failures to 
replicate have significance beyond the mere 
details of any particular set of experiments. 
Again and again, investigators have obtained 
what appear to be stable correlations between 
stress reactions and personality variables, 
only to have them disappear when tested in 
a slightly different setting. But because repli- 
cation experiments are rarely undertaken 
unless one is attempting to create a personal- 
ity test of some kind, and because experi- 
mental failures are usually abandoned and 
not published, it is possible that the majority 
of reported personality correlates are will-o’- 
the-wisps. In our view, one reason for this 
is that any given personality variable ac- 
counts for only a small fraction of the vari- 
ance in behavior. Other potential sources of 
variance include types of stimuli, the context 
in which they are embedded, the nature of 
the responses measured, as well as the inter- 
action of these variables with each other and 
with specific attitudes, beliefs, etc., on the 
part of the subject. Not all of these sources 
of variance are actualized in any given ex- 
periment, for example, because the stimulus is 
held constant or only a few responses are 
measured. Nevertheless, when the situation is 
changed somewhat, their influence is mani- 
fested in failures to achieve cross-validation. 


This is a discouraging state of affairs for 
those of us interested in the experimental 
study of personality, but it is by no means 
an insurmountable obstacle. Primarily, it indi- 
cates the need for a change in strategy—per- 
haps along the lines suggested by Endler and 
Hunt (1968), Mischel (1968), and others— 
which places less emphasis on transsituational 
personality dispositions and more emphasis on 
isolating the specific sources of variance 
operating in any given situation. 
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EFFECT OF FRUSTRATION UPON IMITATION OF AGGRESSION ` 


MARGARET A. HANRATTY, EDGAR O’NEAL,! anp JEFFERSON L. SULZER 


Tulane University 


Of 30 first-grade boys, 10 were frustrated and allowed to attack their frustrator, 
10 were frustrated and allowed to attack another person, and the remainder 
were not frustrated. Half of the subjects had previously been shown an aggres- 
sive modeling film. According to prediction, of the subjects who had seen the 
film, those who were frustrated displayed more imitative aggression. Among 
frustrated subjects, there was no significant effect produced by the nature of 
the target. The results are discussed in terms of the influence of frustration on 
performance variables in observational learning. 


A number of studies published during the 
past decade (e.g., Bandura, 1965; Bandura, 
Ross, & Ross, 1963; Hanratty, Liebert, 
Morris, & Fernandez, 1969) have demon- 
strated that following brief exposure to 
models who are not punished for their aggres- 
sive behavior, young children readily engage 
in imitative attacks upon live or inanimate 
victims despite societal prohibitions against 
such behavior. In these studies the attacks 
usually involve relatively novel behaviors, the 
performance of which is attributed to the 
acquisition of the specific responses displayed 
by the model. 

Concurrently, another series of studies, 
represented chiefly in work by Berkowitz 
and his students (Berkowitz & Geen, 1967; 
Berkowitz & LePage, 1967; Geen, 1968), has 
focused upon frustration as an antecedent of 
aggression. In these studies, adult subjects 
who are frustrated (either by attack or failure 
at a task) and are then exposed to stimuli 
associated with aggression deliver more elec- 
trical shock to a victim than subjects who 
have not been frustrated. To deliver shock, 
the subject presses buttons on a panel. 
Berkowitz (1969) has interpreted his 
results in terms of a revised frustration- 
aggression hypothesis by proposing that prior 
attack or frustration, by virtue of its gen- 
eralized drive properties, increases the prob- 
ability of aggressive responses to the cues 
provided by the stimuli associated with 
aggression. 

1 Requests for reprints should be sent to Edgar 


O'Neal, Department of Psychology, Tulane Univer- 
sity, New Orleans, Louisiana 70118. 
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A considerable body of evidence indicates 
that frustration is accompanied by physio- 
logical arousal (e.g., Baker & Schaie, 1969; 
Gambaro & Rabin, 1969; Hokanson & 
Burgess, 1962), and recent results reported 
by Geen and O’Neal (1969) suggest that 
physiological arousal may be a mediator in 
energizing the frustrated subjects’ delivery 
of shocks to a victim. While the role of 
frustration-produced arousal as an energizer 


of aggressive responses has been given con- — 


siderable attention (e.g., Berkowitz, 1969), 
its possible role as cue has been largely ig- 
nored. Presumably, individuals are aware of 
their arousal state (e.g., Schachter, 1964), 
and it can provide cues for behavior (O’Neal, 
1971). It seems reasonable that arousal may 
also serve as a discriminative cue to an indi- 
vidual that aggressive responses would be 
instrumental to goal attainment. The cue 
value of such arousal stimuli is accounted for 
by their presence in the past when aggressive 
responses were rewarded by removal of 
barriers to a goal. 

This line of reasoning suggests that frustra- 
tion may increase the likelihood that aggres- 
sive behaviors will be displayed once such 
responses have been acquired through ob- 
servation (cf. Bandura, 1970, p. 381). Also, 
it would seem to follow that the cue properties 
of frustration-produced arousal would facili- 
tate aggressive responses primarily toward the 
frustrator, since in the subject’s prior experi- 
ence, attacks against agents of frustration 
would most likely have resulted in removal 
of goal barriers. If, however, the frustrator is 
not an available target, then the arousal cues 
would, through target generalization, direct 
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aggressive behaviors toward a more accessible 
target. 

A number of predictions are possible from 
the preceding rationale. Bandura (1965) 
argued that at least one function of exposure 
to unpunished aggressive models is to mitigate 
the observer’s societally imposed inhibition 
against aggression, especially in regard to the 
specific aggressive behaviors displayed by the 
model. If an individual acquires a set of novel 
aggressive responses by witnessing a model’s 
aggression, and his inhibitions toward such 
behavior are simultaneously weakened, then 
frustration would further facilitate perform- 
ance of these responses. Frustration effects 
would be expected to be most pronounced 
when the individual is afforded the opportunity 
to attack his frustrator. 

This experiment was designed to test these 
predictions regarding the effect of frustration 
upon the imitation of aggression. Frustration 
was induced by telling children that they 
would not be given the valuable prize they 
had expected to receive, because their partner 
in a game performed poorly. This manipula- 
tion was designed to conform as closely as 
possible to a definition of frustration as a 
withdrawal of an anticipated reinforcer and 
to eliminate any chance that the frustration 
would be construed by the subject as punish- 
ment, which might lead to inhibition of imita- 
tive aggression (cf. Kuhn, Madsen, & Becker, 
1967). Half of the children had. previously 
been shown a film in which a model displayed 
novel aggressive behavior. Then, each child 
was allowed to aggress either toward his frus- 
. trator partner or toward another person, and 
his performance of aggressive responses was 
observed and recorded. 


METHOD ? 


The subjects were thirty 6- and 7-year-old boys 
in the first grade at two parochial schools in New 


2 Thanks for providing subjects and research space 
are due to Sister Mary Padraic, Superintendent of 
Student Personnel, Catholic Archdiocese of New 
Orleans; Sister Mary Venard, principal of St. 
Alphonsus School; Sister Audrey Ruth, principal of 
Incarnate Word School; and Sister Judith, Adminis- 
trative Assistant in the Office of the Archdiocesan 
Superintendent of Schools. The assistance of Paula 
Ladd, Gary Liebeskind, Dorothy Wright, David 
Chernikoff, Edward Royal, Leon Keys, James Mc- 


Orleans, Three adult males alternated as experi- 
menters, and two adult females, as the target of 
aggression. 


Modeling Manipulation 


One-half of the subjects were assigned to the 
modeling condition, and one-half, to the control con- 
dition (no model). Those in the modeling condition 
were brought by the experimenter individually from 
their classrooms to another room and shown the film, 

The 23-minute color film with synchronized tape 
recording to provide sound depicted an adult male 
model aggressing against an adult female costumed 
as a clown. The clown costume consisted of a full- 
length clown suit, face mask, and wig. The film was 
similar to that described by Hanratty et al. (1969). 
The adult male model performed four distinct types 
of aggressive behavior toward the motionless clown 
during the film: fist shaking, hostile comments, 
shooting a toy machine gun toward the clown, and 
hitting the clown with a plastic hammer. 


Frustration Manipulation 


In order to induce frustration, the experimenter 
showed the subject his “toy cabinet” which con- 
tained an array of attractive toys ranging in value 
from 10¢ to $7. After indicating which toy he liked 
best, the child was informed that it would be pos- 
sible for him to receive the toy as a prize if both 
he and a partner performed well in a game. The 
game involved a very simple task—arranging six 
plastic milk bottles in a carton according to the 
color of their caps to match another carton of bottles 
which had previously been arranged by the experi- 
menter. Then the experimenter instructed the child to 
begin work while he went to see how the partner 
had done, who supposedly was working on the same 
task. Upon returning, the experimenter remarked; 
“You’ve done very well in the game, but I’m afraid 
your partner didn’t do well at all. Because he per- 
formed so poorly, I won’t be able to give either of 
you a prize.” 

One-third of the subjects in each modeling condi- 
tion was led to believe that the partner was “our 
clown” (frustrator target group); one-third, that 
their partner was another child (nonfrustrator 
target group), and the remaining subjects were not 
frustrated (no-frustration group). The subjects in 
the no-frustration group were not shown the toys 
but were asked to arrange the milk bottles, and 
after they had finished they were told that they 
had done very well. 


Dependent Measures 


Then each subject was escorted to a room in which 
the clown was standing. The mallet and toy gun had 
been placed on the floor near the clown. Upon 
entering the room, the experimenter motioned toward 
the clown and remarked “That’s our clown, Dodo. 


Daniel, James A. Thomas, and J. Higgins Hensley 
is also gratefully acknowledged. 


TABLE 1 
MEAN IMITATIVE AGGRESSION SCORES 


Frustration 


I No 
Fen frustration 


Non- 
Frustrator 
frustrator 
“target a 
Film 0, 4.81, 8.0, 
No film ae ‘6a i 


Note.—Five subjects per cell, Cells having common sub- 
scripts are not signifiicantly different at the .05 level by the 
Duncan multiple-range test. 


Dodo doesn’t talk very much.” The experimenter 
then mentioned to the child that he had some very 
important work to do and that the child could 
play in the room, doing whatever he wished, while 
the experimenter completed his paper work. Then 
the experimenter seated himself in the corner of the 
room with his back to the clown and avoided inter- 
action with the child. The clown remained motion- 
less and never spoke. All subjects remained in the 
room for 5 minutes. Finally, each child was given 
a small package of balloons as a gift, thanked for 
participating, and taken back to his classroom. 
Each subject’s aggressive behavior while in the 
presence of the clown was observed and classified 
according to two response categories. The first cate- 
gory, imitative aggression, included those responses 
previously performed by the adult male model (fist 
shaking, shooting the clown with the toy machine 
gun, and hitting the clown with the hammer). The 
other category included any aggressive acts not 
exhibited by the adult male model in the film. 
Aggressive behavior was recorded during the first 
10 seconds of each of the 103-minute periods com- 
prising the 5 minutes in which the child was with 
the clown. Observers were located outside the door 
~of the room and monitored each child’s behavior 
through a window in the door which was covered 
with cheesecloth. The cheesecloth screen served to 
conceal the observers from the view of the child. 
Six of the subjects were observed by two indepen- 
dent observers, and the scoring procedure was found 
to be highly reliable (r= .97). 


RESULTS 


The mean imitative aggression scores for 
subjects during the time-sampled periods are 
presented in Table 1. An analysis of vari- 
ance performed on these scores yielded the 
predicted main effects for the modeling 
(F = 16.7, df = 1/24, p < .01) and for the 
frustration (F=5.9, df=2/24, p< .01) 
factors. Dunnett’s ¢ tests for dependent com- 
parisons revealed that as anticipated, sub- 
jects both in the frustrator target and in 
the nonfrustrator target conditions (t = 3.33, 
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p< .005; ¢=2.16, p< .05, respectively) 


delivered more imitative aggression than did 


those in the no-frustration group, However, 
subjects in the frustrator target and non- 


frustrator target groups did not differ from 
each other in mean imitative aggression scores. — 

The analysis of variance on imitative ag- 
gression scores also revealed a significant 
(F = SS 


Film X Frustration interaction 
df = 2/24, p< .01). This interaction was 
apparently accounted for by the fact that 


there were significant (by Duncan’s multiple- ` 
range test) film-no-film differences in the 
no- 
film © 
increased the imitative aggression expressed ` 


frustration groups, but not in the 
frustration groups. That is, viewing th 


by the frustrated subjects but had no effect 
on the imitative aggression of the other 
subjects. 

The nonimitative category of aggressive 
responses proved unsatisfactory. Since novel 


attacks toward the clown rarely occurred, — 
behaviors scored in this category consisted 


mostly of random gun shooting not directed 
toward the victim. Such behaviors were ex- 


hibited, for the most part, by subjects in the 


no-modeling conditions. However, subjects in 


the modeling conditions engaged primarily in 
imitative attacks toward the clown (which — 


was scored as imitative aggression) and thus 


did not often obtain scores in the nonimitative — 
category. Since interpersonal aggression was 


the primary interest of this study, non- 
imitative scores were excluded from the analy- 
sis due to the reciprocity between imitative 


and nonimitative scores (r = —.38, p < .05). 


Discussion 


These results indicate that frustration can 
be an important variable affecting the per- 


formance of imitative aggression. Indeed, sub- 
jects who witnessed an aggressive model — 
imitated his actions only if they had been — 


frustrated. These findings appear to conflict 


with a previous finding that for kindergarten — 


children, mere exposure to a model was suf- 
ficient to produce imitative aggression toward 
a human target (Hanratty et al., 1969). This 


discrepancy could be accounted for by the — 


age difference between subjects in the earlier 
study and those in the present experiment. 
Since aggression toward others is punished 
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by parents and teachers, older children pre- 
sumably have more inhibition against aggres- 
sion than younger children. For a very young 
child, gross similarities between the testing 
situation and that depicted in the modeling 
film seem to be sufficient to elicit imitative 
aggression. Because of his greater inhibition, 
however, before an older child will imitate 
the unpunished aggressive behavior of a 
model toward a human victim, it may be 
necessary that additional disinhibitory cues 
be provided. Thus, the combination of aggres- 
sive modeling and frustration appears to serve 
as a potent elicitor of subsequent aggressive 
behavior. 

This interpretation may appear to be dis- 
crepant from results reported by Walters and 
Llewellyn Thomas (1963), These investiga- 
tors observed an increase in number of shocks 
given to a stooge following exposure to an 
aggressive film, even though their adoles- 
cent and adult subjects had not been frus- 
trated. However, the difference between the 
Walters-Llewellyn Thomas experimental situ- 
ation and the present one is so great as to 
make comparison extremely difficult. Pre- 
sumably, since subjects in the former study 
believed that delivery of shocks to the con- 
federate was an integral part of the experi- 
ment in which they were participating, initial 
inhibition against such behaviors should have 
been decreased. In contrast, the children in 
the present study who were not frustrated 
were given no explicit justification for at- 
tacking the clown, and thus frustration may 
have played a crucial role as a disinhibitor. 

The pattern of results obtained provide 
support for the arousal-as-cue position pre- 
sented in the introduction. Since subjects 
who had been exposed to the modeling film 
did not aggress unless they had been frus- 
trated, it does not seem likely that frustration 
acted as a general energizer by increasing 
the intensity of a dominant response. It would 
seem that the cue effects—as opposed to the 
energizing effects—of frustration had more 
opportunity to operate in the present experi- 
ment than in the Berkowitz studies for at 
least two reasons. First, in the present study, 
frustration-produced arousal acted more 
clearly as a cue to aggression because the 
responses of the subjects were more similar 


to those specific responses which in the “real 
world” had been paired with arousal cues than 
were those employed in the Berkowitz studies. 
Also, being more complex (more difficult to 
successfully execute) than Berkowitz’s button 
pressing, the responses available to the chil- | 
dren were less subject to enhancement by 
the energizing effects of general arousal or — 
general drive (cf. Geen & O’Neal, 1969). 

The fact that the target of aggression in 
the film was the same as the subjects’ target 
might account for the failure to obtain 
the predicted frustrator-target-nonfrustrator- 
target differences. This confounding of film 
target with subjects’ target ® might well have 
increased the aggression of subjects in the 
nonfrustrator target film group above that 
which would otherwise be directed toward an 
object not their frustrator, Also, having pre- 
viously seen their frustrator harmed might 
have reduced the aggression directed by sub- 
jects in the frustrator target film group 
against their frustrator (cf. Bramel, Taub, 
& Blum, 1968). 

Finally, these results offer evidence that 
frustration may increase the likelihood that 
responses, acquired through observation of an 
aggressive model such as those on television, 
will be performed. 


3For obvious ethical reasons, the experimenters” 
were constrained against providing either the model 
or the subjects with a child as target for aggression. 
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PERCEPTION OF RESPONSE ALTERNATIVES IN A 
PRISONER’S DILEMMA GAME! 


THOMAS A. BARANOWSKI? anp DAVID A. SUMMERS 


University of Kansas 


Sixty-four undergraduate subjects participated in a two-choice Prisoner's Dilemma 
(PD) game in which other's status, potential power, and strategy were systemati- 
cally varied. At the conclusion of the game, the subjects rated each response alterna- 
tive on several bipolar scales, including cooperative-noncooperative, reward—punish- 
ment, coercive-noncoercive, and conflict-compliant. It was found that (a) a sub- 
stantial proportion of the subjects perceived the alternatives in a manner that 
differed from their usual interpretation; (6) the subjects’ perceptions of the alterna- 
tives were influenced by conditions under which the game was played; and (¢) 
perceptions of the response alternatives were related to subjects’ performance. 


Questions concerning the subjective mean- 
ing of subjects’ responses in the Prisoner’s 
Dilemma (PD) game have been raised with 
increasing frequency (e.g., Komorita, 1965; 
Swinth, 1967; Vinacke, 1969). For example, 
Komorita found that subjects who alternated 
cooperative and noncooperative responses (and 
thereby received payoffs of +5 and —1) 
thought that they were cooperating. Komorita 
noted: 


The implication of this result is that conclusions 
about the degree to which individuals cooperate in the 
Prisoner’s Dilemma game, or any game, is a function of 
how cooperation is defined. Most investigators, in- 
cluding the present investigator, have defined coopera- 
tion as a response in which joint expected value is 
maximized. However, if a subjective criterion of co- 
operative behavior were to be employed, it is quite 
plausible that cooperation may not be such an elusive 
phenomenon as implied by previous investigators [pp. 
744-745]. 


Thus, the low frequency of mutually rewarding 
responses (a typical finding) might not neces- 
sarily reflect lack of cooperation in the PD 
game. Cooperation—from the subject’s per- 
spective—might be more prevalent than the 
existing measures of this behavior imply. 
However, at this point, a legitimate question 
might be raised regarding this argument: To 
what extent are the subject’s perceptions 


1 This research was conducted while the senior author 
was a United States Public Health Service trainee, and 
constituted partial fulfillment of the requirements for 
the MA degree. The authors would like to thank 
Robert Baranowski for his assistance in collecting the 
data. 

2 Requests for reprints should be sent to Thomas A. 
Baranowski, Kansas Regional Medical Program, 3909 
Eaton Street, Kansas City, Kansas 66103. 
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systematically related to his interaction be- 
havior in the game situation? If the subject’s 
perceptions of the response alternatives are 
not related to his behavior, then questions 
about subjective perception of the game 
become less relevant. 

In order to investigate the questions raised 
above, the subject’s perception of the response 
alternatives in the PD game were assessed in 
several different interaction situations. These 
situations were generated by systematically 
varying three situational variables that have 
been found to be theoretically important— 
namely, the status, potential power, and 
strategy of the other (e.g., Blau, 1964; Rapo- 
port & Chammah, 1965), With regard to the 
relationship between perception of the response 
alternatives and the subject’s game behavior, 
particular emphasis is placed here on (a) overall 
level of cooperation (the most frequently re- 
ported measure of game behavior); (b) recipro- 
cation of the other’s cooperative choices; and 
(c) reciprocation of the other’s noncooperative 
choices. 


METHOD 
Design and Subjects 


The design of this experiment was a 2 (Strategy) 
X 2 (Power) X 2 (Status) factorial, with eight subjects 
per cell. Sixty-four male undergraduates at the Univer- 
sity of Kansas, none of whom had prior knowledge of the 
PD game, served as subjects. 


Strategy of Other 


In all conditions, the subjects interacted with a 
simulated other. The strategy of the simulated other 
assumed two forms: contingent and random, In the 
contingent strategy condition, the simulated other re- 
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you 
BUTTON 1 BUTTON 2 
YOu RECEIVE 5 You RECEIVE 6 
HE RECEIVES 5 HE Loses 4 
OTHER 


You LOSE 4 


HE RECEIVES 6 


N ZOA4Cm + ZO44CH 


Fic. 1. Payoff matrix displayed to subjects. 


sponded to the subject’s response on Trial W with the 
same response that the subject gave on Trial N — 1. 
This represents perfect reciprocity by the other; thatis, 
the other returned the subject’s response on the follow- 
ing trial. (On the first trial, the other’s response was 
determined by chance.) In the random strategy condi- 
tion, the responses of the other were presented in a 
random order. In order to assure that the proportion 
of “cooperative” responses by the other was equivalent 
in the two strategy conditions, a modified yoked control 
design was used. That is, each subject in the random 
condition received the same proportion of Number 1 
button responses by the other as did a matched subject 
in the contingency condition. Furthermore, the re- 
sponses of the subject from the contingency condition 
were randomized over blocks of 20 trials for the matched 
subject in the random condition, 


Status of the Other 


The two levels of status (high versus low) were 
manipulated by identifying (in the instructions) the 
fictitious other to the subject. In the low-status con- 
dition, the subject was told he was playing against 
another undergraduate who was also earning research 
credits. In the high-status condition, the subject was 


told that he was playing against a member of the 
faculty. 


Potential Power of Other 


Potential power was also manipulated by preexperi- 
mental instructions. In the high-power condition, the 
subject was instructed that the points accumulated in 
the game could be increased (by a factor of five) by the 
other—at the other’s discretion. In the low-power con- 
tion, this possibility was not indicated to the subject. 


A pparalus 


Subjects were placed in separate soundproof rooms 
each of which contained two response buttons, a signal 
light, and a small outcome panel (in the form of the 
game matrix). The response buttons and outcome 
panels were electrically connected to a control panel 
operated by the experimenter in a separate room. The 
game matrix was the same as that shown in Figure 1. 
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Procedure 


Subjects were initially given written instructions ex- 
plaining the game matrix and its function in the experi= 
ment. Although the instructions made it clear that the 
subject was participating in an experimental game, no 
indication was given as to what constituted cooperati e 
or noncooperative responses. Included in the instruc: 
tions was information designed to manipulate the 
status and potential power variables described earlier, 
Subjects were also instructed that the three individu; 
who accumulated the highest number of points would 
each receive $5 at the conclusion of the study, i 

After the experimenter was satisfied that the subject 
understood the instructions, the subject completed | 
100 trials of the game. Each trial was initiated by a 
signal light which notified the subject to press one of the 
two response buttons. After the subject had made his 
response, a light was flashed for 5 seconds on the out- 
come panel, informing the subject of his and the other's 
payoff for that trial, These steps were repeated for ea 1 
trial. At the end of the experiment, the subject was 
asked to complete a posttask questionnaire. 


Posttask Questionnaire 


The subject’s perceptions of the two response alterna- ; 
tives (Button 1 versus Button 2) were assessed by 
asking the subject to rate each alternative on 13 bipolar 
adjective scales. The 7-point scales used were adopted 
from the standard form semantic differential question- 
naire; that is, most scales were taken from Osgood, Suci, 
and Tannenbaum (1957). Specifically, the scales used 
were cooperative-noncooperative, noncoercive-coer- 
cive, reward-punishment, good-bad, important-unim=- 
portant, tenacious-yielding, complex-simple, inten- 
tional-unintentional, stable-changeable, cautious-rash, 
temperate-intemperate, orthodox-heretical. The stans 
dard semantic differential instructions accompanied the 
scales (see Osgood et al., 1957). 


RESULTS 


Of the 13 scales included in the postexperi- 
mental questionnaire, 4 were judged as more 
relevant to game theoretical assumptions than 
the others. In particular, the cooperative- 
noncooperative distinction is used in virtually 
all PD game experiments. In addition, the 
coercive-noncoercive distinction has been used _ 
in conjunction with game behavior (cf. Me 
Clintock, Gallo, & Harrison, 1965), as has” 
teward-punishment (McClintock & McNeel, 
1966; Vinacke, 1969) and compliant-conflict 
(Gahagan, Tedeschi, Foley, & Lindskold, 
1970). The present analysis, therefore, focus 
upon these four scales. In each instance, th 
subject’s ratings of each response alternative 
are treated according to a 2 (Strategy of the 
Other) X 2 (Potential Power of the Other) 
X 2 (Status of Other) X 2 (Response Altern 
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TABLE 1 
` MEAN RATINGS OF RESPONSE ALTERNATIVES ACCORDING TO CONDITION 


Condition 
Scale 
Overall M Contingent other | Random other | High-status other] Low-status other 
Cooperative (1) RA1 2.06 1,84 2.28 2.63 1.50 
Noncooperative (7) RA2 4.84 SAA, 3.94 4.00 5.69 
Noncoercive (1) RA 1 373 3.13 4.34 3.78 3.69 
Coercive (7) RA 2 4.52 5.09 3.39 413 4,91 
Reward (1) RA 1 2.39 2.22 2.56 2.91 1.88 
Punishment (7) RA 2 S975 4.66 2.88 - 316 4.38 
Conflict (1) RA 1 4.77 4.84 4.69 4.03 5.50 
Compliant (7) RA 2 2.78 2.41 3.16 3.34 2.22 


Note.—RA = response alternative, Numbers in parentheses 


tive) analysis of variance, with repeated meas- 
ures over the last factor. 


Perception of Response Alternatives 

Cooperative-noncooperative scale, The analysis 
of variance described above yielded signifi- 
cant main effects for response alternatives 
(E = 50.02, p < .001) and other’s strategy 
(F = 6.81, p < .025).3 Thus, it was found that 
Response Alternative 1 was rated as more 
“cooperative” than Response Alternative 2 (as 
expected), and that both alternatives were 
rated as more “cooperative” in the random 
other than in the contingent other condition. 
Of particular interest, however, were the 
significant interactions between response alter- 
native and strategy (F = 8.18, p< .01) 
and between response alternative and status 
(F = 12.79, p < .001). 

With regard to Response Alternative 
X Strategy, it was found that the mean ratings 
for the two alternatives differed significantly 
(p < .01) in the contingent other condition, 
but not in the random strategy condition. As 
can be seen in Table 1, the subjects’ perceptions 
of each response alternative in the contingency 
condition are close to what is typically assumed 
in most PD studies. However, in the random 
condition, the mean ratings of both alterna- 
tives are on the cooperative side of the mid- 
point. With regard to the Response Alternative 

è All F ratios reported here have 1 and 56 degrees of 
freedom. In addition, all individual comparisons re- 


ported here are based upon the Tukey (a) technique 
(Winer, 1962, pp. 87-88). 


refer to the rating on the 7-point scales. 


X Status interaction, it was found that the 
mean ratings for the two alternatives were 
quite diverse in the low-status condition 
(p < .01), but did not differ significantly in 
the high-status condition. Again, subjects’ 
perceptions in the low-status condition are 
similar to what might have been expected. 
In the high-status condition, however, the 
mean rating for Response Alternative 2 was 
considerably less noncooperative than might 
have been expected. (Indeed, the mean rating 
was neither cooperative nor noncooperative.) 

Coercive-noncoercive scale. Here, a significant 
main effect was found for response alternative 
(F = 4.94, p< .05), as was a significant 
Response Alternative X Strategy interaction 
(F = 11.42, p< .005). With regard to the 
main effect, it was found that Response 
Alternative 2 was seen as more coercive than 
Response Alternative 1—as anticipated. The 
significant interaction, however, indicates that 
the subjects’ ratings varied according to the 
strategy of the other. In the contingency con- 
dition, the subject’s perceptions are consistent 
with what it typically assumed; that is, Re- 
sponse Alternative 1 is seen clearly as non- 
coercive, Response Alternative 2 is seen clearly 
as coercive (p < .05). In contrast, mean rat- 
ings of the two alternatives in the random 
strategy condition did not differ significantly 
(Table 1). 

Reward-punishment scale. The analysis of 
variance results for this scale yielded signifi- 
cant strategy (F = 6.73, p < .025) and re- 
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TABLE 2 


Game BEHAVIOR AND PERCEPTIONS OF 
RESPONSE ALTERNATIVES 


Group A | Group B 


Behavior measure (n = 41) | G = 23) t 
Frequency of #1 re- 
sponses 72.5 36.4 7.30** 
Reciprocation of #1 
responses 65.4 22.4 8.76** 
Reciprocation, of #2 
responses 12.7 20.9 | —1.88* 


Note.—Subjects in Group A perceived the response alterna- 
tives in the manner typically assumed; that is, Response 
Alternative 1 cooperative, Response Alternative 2 noncoopera- 
tive. Subjects in Group B perceived at least one alternative 
differently. 

* p <.05, one-tailed test, 

** p < 01, two-tailed test. 


sponse alternative (F = 15.89, p < .001) main 
effects, as well as Response Alternative 
X Strategy (F = 9.49, p< .005) and Re- 
sponse Alternative X Status (F = 10.64, p 
< .005) interactions. As indicated in Table 1, 
Response Alternative 1 was on the whole per- 
ceived as more rewarding than Response 
Alternative 2. Moreover, subjects in the ran- 
dom strategy condition perceived both alterna- 
tives as more rewarding than did subjects in 
the contingency condition. 

The Response Alternative X Strategy and 

Response Alternative X Status interactions, 
however, indicate that the subject’s percep- 
tions of the two alternatives varied according 
to condition. Specifically, when the other 
utilized a contingency strategy, the subject’s 
ratings of the alternatives were consistent with 
game theoretical assumptions; that is, Re- 
sponse Alternative 1 was seen as highly 
rewarding, while Response Alternative 2 was 
seen as highly punishing (p < .05). On the 
other hand, when the other utilized a random 
strategy, the ratings of the two alternatives did 
not differ significantly. (As can be seen in 
Table 1, neither alternative was perceived as 
punishing.) 

Conflict-compliant scale. The findings for 
this scale differed only slightly from those 
obtained for the preceding three scales. Spe- 
cifically, this analysis yielded a main effect for 
response alternative (F = 29.83, p < .001), as 
well as a Response Alternative X Status inter- 
action (F = 12.74, p < .001). Not surprisingly, 
Response Alternative 1 was perceived (on the 
whole) as a more compliant response than was 
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Response Alternative 2. With regard to the 
interaction, it was again found that when the 
subject was paired with a low-status other, the” 
ratings of the two alternatives differed signifi-” 
cantly (p < .01); that is, Response Alternative” 
1 was clearly seen as a compliant response, 
while Response Alternative 2 was clearly seen 
as denoting conflict. When the subject was) 
paired with a high-status other, such was not 
the case (see Table 1). : 


Perceptions of Response Alternatives and Game 
Behavior 


In order to examine the relationship between = 
the subject’s perception of each response” 
alternative and game behavior, the sample was 
divided into two groups: (e) subjects whose 
cooperative-noncooperative ratings were con- 
gruent with the usual assumption about the” 
meanings of response alternatives in the PD 
and (b) subjects whose ratings v 
Specifically, subjects who 
Alternative 1 on the cooper. S 
scale midpoint and rated Respons tives 
2 on the noncooperative ide aes midpoint 
were assigned to the first group (n = 41), The 
remaining subjects (» = 23) were assigned to” 
the second group. : 

These groups were then compared on the — 
following three behavioral indexes: (a) overall 
level of cooperation (Response Alternative 1); 
(b) proportion of cooperative responses by the — 
other which were reciprocated; and (c) pro- 
portion of noncooperative responses by the 
other which were reciprocated. 4 

Significant differences between the two 
groups were found for all three indexes. — 
Specifically, subjects who perceived each re- | 
sponse alternative in the expected manner 
(Response Alternative 1 as cooperative, Re- 
sponse Alternative 2 as noncooperative) were 
characterized by a higher level of cooperation 
and a greater tendency to reciprocate the © 
other’s cooperation than did subjects who per- < 
ceived the alternatives in an unexpected man- — 
ner. Less pronounced was the finding that 
subjects in the latter group tended to recip- — 
rocate other’s noncooperative choices more 


* Inasmuch as cooperation-noncooperation is the — 
most frequently used description of game behavior, an — 
analysis based upon the subjects ratings on this scale is 4 
reported here. Similar results are obtained in connection — 
with the other three rating scales, 
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frequently than did the other subjects. These 
results are shown in Table 2. ` 


DISCUSSION 


Of- particular interest is the finding that a 
substantial proportion of the subjects (slightly 
more than one-third) did not perceive the 
response alternatives as they are ordinarily 
interpreted in most PD studies. In this respect, 
the results lend additional support to previous 
suggestions that a subject’s perceptions in 
experimental games be further investigated 
(e.g, Vinacke, 1969). Equally important, 
however, were findings that (a) the subject’s 
perceptions of the alternatives varied signifi- 
cantly according to experimental conditions 
(most notably according to status and strategy 
of the other), and (b) perceptions of the re- 
sponse alternatives were significantly related to 
the subject’s game behavior. Each finding is 
discussed in turn. 


Perceptions of Response Alternatives 


As reported here, the subject’s perceptions of 
the response alternatives departed most notice- 
ably from the usual interpretation of these 
alternatives when the other was presented to 
the subject as having high status, or when the 
other’s response strategy was random. When 
the other was presented as low status, or when 
the other engaged in a contingent response 
strategy, subjects tended to perceive the 
alternatives in the expected manner; that is, 
Response Alternative 1 as cooperative, Re- 
sponse Alternative 2 as noncooperative. 

That the subject’s perceptions of the alterna- 
tives varied with the other’s status is, in 
certain respects, not surprising. If assumptions 
can be made about culturally prescribed ex- 
pectations about inappropriate behavior by 
students toward others of higher status (in this 
case, faculty members), the influence of the 
other’s status upon perceptions in this situation 
can be accounted for within the framework of 
dissonance theory (Festinger, 1957). Specifi- 
cally, it is possible that noncooperative, 
punishing, or coercive behavior by students 
toward a faculty member is typically seen as 
inappropriate. (Although recent .events in 
university settings cast doubt on the univer- 
sality of such norms, the isolation and con- 
straints that typically accompany laboratory 
studies may serve to inhibit competing re- 


i 
sponse tendencies.) In contrast, when inter- 
actions do not involve high- versus low-status ` 
participants, the prescriptions against non- 
cooperation, coercion, etc., may not be as 
strong, nor as clear. 

Although admittedly speculative, the ex- 
pectations suggested above provide a basis for 
explaining the ratings of the response alterna- 
tives obtained here. Specifically, if a subject 
who was paired with a high-status other chose 
Response Alternative 2 on ai least one occasion — 
(as did 92% of the subjects), he is then faced 
with either (a) recognizing that he has behaved 
in a culturally disapproved manner toward a 
high-status other (noncooperative, coercive, 
etc.), or (b) not perceiving Response Alternative 
2 as noncooperative, coercive, etc. If it is 
assumed that violation of cultural norms 
(which one shares) results in cognitive dis- 
sonance, Outcome b above can be predicted 
as a means of dissonance reduction. Support 
for this interpretation is afforded by the results. 
As shown in Table 1, perceptions of Response 
Alternative 2 (ordinarily assumed to be non- 
cooperative) were most noticeably influenced 
by the other’s status; the subject’s perception 
of Response Alternative 1 was little affected 
by this manipulation. 

While the effects of the other’s status upon 
the subject’s perceptions of the response alter- 
natives can be accounted for in terms of 
dissonance reduction, the dissonance theory 
framework seems less useful in explaining the 
effects of the other’s strategy. On the contrary, 

a quite different explanation—involving the 
veridicality of the subject’s perceptions— 
seems most plausible. Specifically, when the 
other utilizes a random strategy, and is there- 
fore totally unresponsive to the subject’s own 
strategy, the meanings of noncooperation, 
coercion, and punishment in the interaction are 
ambiguous. That is, when the subject’s choices 
have no effect upon the behavior of the other— 
which is necessarily the case in the random 
other condition—for the subject then to rate 
Response Alternative 2 as noncooperative, 
coercive, or punishing would involve mis- 
perception of the situation. The opposite would — 
be true when the other is totally responsive to — 
the subject’s strategy; that is, as in the con- 
tingent other condition. In short, if the sub- 
ject’s perceptions of the situation are influenced - 
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by the effects of his own behavior upon the 
other, one might expect that the subject would 
perceive less difference between the two alterna- 
tives in the random other condition than in the 
contingent other condition. 


Perceptions and Game Behavior 


As noted at the outset, if perceptions of the 
situation are unrelated to the subject’s be- 
havior in the interaction, questions about the 
subject’s perceptions may be of limited impor- 
tance. The results obtained here, however, re- 
vealed a strong relationship between percep- 
tions and game behavior. In particular, it was 
found that when subjects perceived one alterna- 
tive as cooperative and the other as noncooper- 
ative, the overall level of cooperation as well 
as the reciprocation of cooperation was high, 
while reciprocation of noncooperation was low. 
Quite different levels of responding charac- 
terized subjects whose perceptions departed 
from the usual interpretations of the response 
alternatives. 

While the empirical relationship between 
perceptions and behavior is clear-cut, the 
direction of causality is not. On the one hand, 
it is possible that when subjects fail to perceive 
Response Alternative 2 as noncooperative (as 
did 96% of the subjects in the “atypical” 
perceptions group), whatever unfavorable con- 
notations might be associated with this re- 
sponse become less salient for the subject than 
would otherwise be the case. Hence, the like- 
lihood that Response Alternative 2 will be 
chosen would increase—as was observed. 

On the other hand, an equally plausible 
interpretation is possible if one conceives of 
the subject’s game behavior as shaping, or 
influencing, subsequent perceptions of the 
game. Thus, those subjects who showed a 
strong preference for the noncooperative re- 
sponse (Response Alternative 2) may have 
subsequently rated that response in a some- 
what favorable manner; that is, as more cooper- 
ative than not. Such an explanation, of course, 
assumes that the self-image of the subjects who 
participated in this experiment is one of 
cooperation, rather than noncooperation. If 
this assumption is justified, this latter inter- 
pretation of the perception-behavior relation- 


_ ship would be consistent with cognitive dis- 


sonance theory. However, inasmuch as the 


subject’s perceptions of the response alterna- 
tives were assessed at the conclusion of the 
experiment, it is unclear which interpretation is 
more plausible. 


Despite the problems of interpreting the 
perception-behavior relationship, the broad 
implications of the findings seem clear. ~ he 
manner in which the subject perceives ihe 
response alternatives bears a strong relation- 
ship to the subject’s behaviorin the PD game 
and what may appear to be noncooperation by 
the experimenter may not be perceive as 
noncooperation by the subject—at leas: in 
retrospect. An adequate explanation of cor:lict 
behavior in the PD game, therefore, would 
seem to require that the subject’s perceptions 
be taken into account, and that the conditions 


which influence these perceptions be more {ully 
explored. 


ings 
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COGNITION OF AROUSAL AND ACTUAL AROUSAL 
AS DETERMINANTS OF EMOTION* 


DAVID GOLDSTEIN, DAVID FINK, ann DAVID R, METTEE? ; 


Yale University 


Valins’ false heart rate procedure was replicated in the first part of the present 
study. Results showed that increases in false heart rate feedback (cognition 
of physiological arousal) led to increases in reported emotionality. (Nudes for 


which “heart rate” 


increased were rated higher than nudes for which heart 


rate remained constant.) In addition, while subjects were being exposed to 
the nudes, their actual heart rates were determined. It was found that cogni- 
tion of physiological arousal led to changes in actual physiological arousal 
(change in heart rate), but reported emotionality was not consistently related 
to those changes. In the second part of the study, an attempt was made to 
experimentally vary the level of emotion, in particular, the feeling of being 
offended, by exposing some subjects to male nudes (offensive condition). These 
subjects reported that they were more offended by the experiment than did 
subjects exposed only to female nudes. However, in the offensive condition, 
,speedups in false heart rate feedback did not produce any systematic effects 
either on actual physiological arousal or on ratings of emotionality (in this 
case, offensiveness), but, there was a significant relationship between actual 


physiological arousal (heart rate change) and reported emotionality. 


The 


implication of these results is that physiological arousal apparently functions 
as more than a mere cognitive cue to be interpreted by the individual along 
with many other cognitive cues. The present data suggest that actual physio- 
logical arousal was a crucial mediator of emotion, since, regardless of a 
subject’s artificially provided cognition of physiological arousal, he seemingly 
did not experience much emotion unless there was actual physiological arousal. 


James recognized as long ago as 1884 that 
physiological changes accompany emotion. He 
hypothesized that “bodily changes follow di- 
rectly the perception of the exciting fact 
and that our feeling of these same changes as 
they occur is ‘emotion’ [p. 189].” Cannon 
(1929) later found that the physiological 
states associated with widely different emo- 
tions were essentially identical. This result 
forced theorists to consider the importance of 
cognitive factors in determining emotion. 

Schachter and Singer (1962), among others, 
have emphasized the effects of cognitive fac- 
tors and their interaction with physiological 
arousal: Emotion is “a function of a state 
of physiological arousal and of a cognition 

1 This study was supported in part by National 
Science Foundation Grant GS 2749 to David R. 
Mettee. This paper is based on research supervised 
by Mettee, for which Goldstein and Fink received 
the 1970 Angier prize, an annual award for the 
outstanding Yale undergraduate research project in 
psychology. 

2 Requests for reprints should be sent to David R. 
Mettee, Department of Psychology, Yale University, 
333 Cedar Street, New Haven, Connecticut 06510. 
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appropriate to this state of arousal [p. 380].” 

In most natural situations, of course, the 
two factors appear simultaneously. In experi- 
ments, they are separable. Maranon (1929) 
injected subjects with adrenalin, thereby in- 
creasing their physiological arousal, and asked 
them to introspect. The situation provided no 
cognitive cues for emotion, and subjects did 
not experience emotion. Maranon concluded 
from his results that the physiological com- 
ponent alone was not sufficient to cause emo- 
tion. 

Schachter and Singer (1962), in turn, 
found that an explanation of emotion in terms 
of cognitions alone also was insufficient. They 
compared the emotions of the two sets of 
subjects who were provided with identical 
cognitive cues for an emotion. One set of sub- 
jects had been injected with epinephrine and 
was physiologically aroused, as evidenced by 
an increase in heart rate, while the other set 
had been injected with a placebo and was not 
aroused. Schachter and Singer found that the 
aroused subjects experienced significantly 
higher levels of emotion than the nonaroused 
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subjects. Since cognitive cues about external 
events were identical for both groups, the dif- 
ferences in their emotional level must have 
been due solely to differences in their state 
of physiological arousal. 

Schachter and Singer (1962) offered three 
propositions to explain these results. First, 
“Given a state of physiological arousal for 
which an individual has no immediate expla- 
nation, he will ‘label’ this state and describe 
his feelings in terms of the cognitions avail- 
able to him [p. 381]”; second, “Given a state 
of physiological arousal for which an indi- 
vidual has a completely appropriate explana- 
tion . . . no evaluative needs will arise and the 
individual is unlikely to label his feelings in 
terms of the alternative cognitions available 
[p. 382]”; and third, “Given the same cogni- 
tive circumstances, the individual will react 
emotionally or describe his feelings as emo- 
tions only to the extent that he experiences a 
state of physiological arousal [p. 382].” 

Valins (1966) takes the analysis one step 
further: “Once it is granted that internal 
events can function as cues or stimuli then 

_ these events can now be considered as a 
source of cognitive information [p. 400].” 
According to Valins, physiological arousal does 
not function in any intrinsically special way, 
but rather cognition of arousal is one of many 
cognitions reaching the neocortex that are eval- 
uated together and constructed by the indi- 
vidual as emotion. Thus, Valins is saying that 
cognitions about external events provide ex- 
planations for states of physiological arousal, 
but that the importance of actual arousal re- 
sides not in its physiological effects, but in its 
being an internal event about which there are 
cognitions. It is the cognition of this internal 
event combined with cognitions about ex- 
ternal circumstances that produces an emo- 
tion, according to Valins. 

Valins (1966) tested his hypothesis by ask- 
ing if the specific cognitive cues associated 
with actual physiological arousal are acted 
upon by the individual in the same way that 
he acts upon other cognitive cues. In particu- 
lar, he investigated the effects on emotion of 
nonveridical representations of physiological 
changes. Valins (1966) argued: “If cognitive 
representations of internal events are impor- 


tant for emotional behavior, then these non- 
veridical representations of physiological 
changes should have the same effects as 
veridical ones [p. 401}.’ a 

In his experiment, Vali: 
slides of Playboy nudes, 
at them, they heard clicks w 
led to believe, corresponded to their heart- 
beats. Actually the clicks were tape recorded, 7 
The feedback indicated to the subjects that” 
when certain of the nudes appeared, their” 
heart rate increased, Lat hen they werey 
asked to rate the nudes on < ile of attrac- 
tiveness, they did indeed rate higher those 
girls for which they heard “heart rates” ™ 
increase. Valins conclude hat “Internal 
events are a source of cognitive information | 
and . . , individuals will wart to evaluate and ~ 
understand this kind of information, When an 
emotional explanation is prepotent, they will” 
label their reactions accordingly |p. 407]? 
In short, Valins concluded that it is the cog- 
nitive information about internal events, not. 
the internal event itself, that contributes to 
emotion. s 

This conclusion, however, should be re- 
garded as tentative, since Valins did not cona 
sider that false heart rate feedback might 
have had a direct effect on actual heart rate, 
that is, a mimic effect, whereby actual in- 4 
ternal changes occurred. If such an effect 
existed, the subjects in Valins’ experiment 
might not have been responding to nonveridi- | 
cal cognitive cues, but to actual changes in 
heart rate (i.e. an actual internal event). — 
Valins never measured the actual heart rate 
of his subjects. Rather, he claims that actual 
heart rate did not increase based upon the ` 
reports of subjects that they did not feel their ~ 
heartbeats or palpitations of any kind during” 
the experiment. $ 

The question addressed in the present study 
was this: Which was the crucial component of - 
emotion in Valins’ investigation, actual change 
in heart rate or cognition of heart rate — 
change? Valins asserts that cognition of 
physiological change is sufficient. But is it? 
If cognition of physiological arousal does not 
produce, through a “mimic effect,” changes in ~ 
actual physiological arousal, does level of — 
emotion change? The two alternative views — 


showed subjects” 
ile they looked 
ch, they were 


y 


AROUSAL AND ACTUAL AROUSAL AS DETERMINANTS OF EMOTION 43 


are diagrammed below: 


A. PA, ————____'+ E 
Cognition of physiological Emotion 
arousal (Subject’s ratings) 
(False heartbeat) 

B. PA, > PA. >E 
Cognition of Actual Emotion 
physiological physiological (Subject’s 

arousal arousal ratings) 
(False heartbeat) (Actual heartbeat) 


The direct effect of cognition of physio- 
logical arousal on emotionality (as measured 
by subject ratings) is shown in “A.” Here, 
actual arousal is not necessary for the pro- 
duction of emotion and so can be bypassed. 
The mechanism diagrammed in “B” involves 
the indirect effect of cognition of physiological 
arousal. In B, a cognition of physiological 
arousal produces, via the mimic effect, a 
change in actual physiological arousal, and it 
is the change in actual physiological arousal 
that is the real source of the physiological 
component of an emotion. In B, a cognition 
of physiological arousal will not lead to a 
change in emotionality unless there is also a 
change in actual physiological arousal. 

In order to distinguish between these two 
possibilities, Valins’ experiment was replicated, 
but with the added feature of monitoring the 
subject’s actual heart rate while presenting 
him with nudes and false heartbeat feedback. 
In addition, after reasoning that viewing 
Playboy nudes may not be overly arousing 
for typical males from a college population, 
and so might produce only minimal emotion, 
we attempted in the second part of the experi- 
ment to produce a stronger emotion in the 
subjects by presenting them with male nudes. 
The effect of this procedure upon actual heart 
rate was observed, and the interaction be- 
tween reported emotion, false heart rate feed- 
back, and actual heart rate was determined. 


METHOD 
Overview 


Male introductory psychology students were re- 
cruited for an experiment labeled “Physiological 
Correlates of Visual Sexual Stimuli,” a study os- 
tensibly being run by the medical school in con- 
junction with the psychology department of Yale 
University. The subjects were shown pictures of 16 
nudes projected by an opaque projector onto a wall 
of the experimental room. While they viewed the 
nudes, their heart rates were recorded using a poly- 


graph, with the subjects in the experimental condi- — 
tions hearing clicks that they believed corresponded | 
to their heartbeats. For 2 of the first 8 nudes, their ` 
“heart rates” increased enormously. Control subjects 
were not provided with fake heart rate feedback. 
Subjects were then presented with the same 8 nudes, 
under the same heart rate feedback conditions, and 
rated them on a scale of attractiveness. The ratings 
were written down and collected, and then a second 
series of nudes was presented. Half of the experi- | 
mental subjects also heard their heart rates increase 
for the last 4 nudes in this second series, while half 
heard a constant heart rate. Each of these groups 
was cross-cut so that half saw 4 Playboy nudes for 
the last four pictures, and half saw 4 male nudes. 
As before, control subjects did not hear heart rate 
feedback. After the subjects had left the experimental 
room, they were asked to rate how offended they 
were by the experiment, and following this they 
were debriefed. 


Procedure 


Each subject was brought to a room in the medi- 
cal school by an “undergraduate hired by the medi- 
cal school.” This undergraduate was one of the two 
experimenters (“Experimenter 1”), Experimenter 1 
was supposedly coordinating a number of studies 
involving the physiology of sex, and he told the 
subject that he had been assigned to one of four 
conditions: heart rate, galvanic skin response, respi- 
ration, or sweat. The subject was always assigned to 
the heart rate condition, This meant that while the 
“visual sexual stimuli, in other words, pictures of 
nudes,” were presented to the subject, his heart rate 
would simultaneously be recorded. Experimenter 1 
then introduced the subject to Experimenter 2, who 
was running the heart rate measurement. Before the 
subject left with Experimenter 2, Experimenter 1 
reminded the subject to return to him after the 
experiment was over. 

Once inside the experimental room, Experimenter 
2 explained to the subject: 


This is one of a bunch of studies being run by 
the medical school concerning physiological cor- 
relates of visual sexual stimuli. I’ve got a kind of 
bursary job at the medical school, and I’m run- 
ning this particular test, which is heart rate. What 
I’m going to do is project some pictures of nudes 
onto the wall, and then while you’re looking at 
them, your heart rate will be recorded by that 
polygraph over there. 


Then, while he was strapping the electrodes from 
the polygraph onto the subject, Experimenter 2 
added to experimental subjects only: 


The way this thing works is that the electrical 
impulses which accompany each heartbeat are 
picked up by these electrodes, The impulses go 
through a series of amplifiers [one bank of the 
polygraph was clearly labeled as, and indeed was, a 
“drive amplifier”]. This is a rather old and beaten- 


a O 


up polygraph, as you can see, and works by an 
old method using conversion of the electrical im- 
pulses into sound and amplification of the sound, 
so that it is actually the amplified sound which is 
running the polygraph pens. More modern EKG 
machines are a lot easier to use, and they don’t 
have the sound intermediate, but this machine is 
the only one we could get a hold of. It és sensitive, 
though, and so once you’re all hooked up, I’m 
going to ask you to assume a comfortable position 
and try to hold it. Oh, by the way, because this 
polygraph has that amplified sound intermediate, 
you'll probably be able to hear blips or clicks 
corresponding to your heartbeat, Although it may 
be interesting to you, it’s irrelevant to us—just an 
unfortunate artifact of using this old machine— 
so if you hear it, try to ignore it. OK, now I'll be 
showing you pictures of nudes—about 15 seconds 
apiece—and you just sort of look at them. 


In fact, the “blips or clicks” were prerecorded and 
‘played on a tape recorder hidden behind the poly- 
graph that was usedto measure the actual heart rate, 
The 42 experimental subjects heard these sounds 
while they viewed the nudes; 18 control subjects 
were hooked up without the explanation about 
sound amplification, and they heard no false heart 
rate feedback, 

Heart rate was used as a measure of actual physio- 
logical arousal not because of a belief that heart 
rate has some special relation to emotion that dis- 
tinguishes it from other physiological parameters, but 
rather because Valins used false heart rate feedback 

as a “cue” about subjects’ internal states and be- 
Cause he assumed that the actual heart rates of his 
subjects remained constant. We wished to test the 
validity of that assumption and did so by monitor- 
_ing the heart rates of our subjects. Other autonomic 
measures were not taken because we were interested 
only in heart rate, and since autonomic measures 
typically show low intercorrelations with each other 
(Lacey, Kagan, Lacey, & Moss, 1963), obtaining 
other such measures would not have substantially 
strengthened the heart rate measure. Admittedly, it 
would have been desirable to take other measures as 
independent assessments of arousal state, but this 
was not done for other methodological reasons, Thus, 
of course, generalizations we propose in this study 
about the relation between physiological arousal and 
emotion must be limited to the particular physio- 
logical parameter measured (i.e., heart rate). 

Tape recordings. The tape recordings were made 
using a Hewlett-Packard low-frequency beat gen- 
erator, which was plugged into the tape recorder, 
The recording began with beats varying between 66 
and 72 beats per minute. When the time came for 
increases in false heart rate, the rate of the beats was 
increased to 90 beats per minute, then varied between 
78, 84, and 90 beats per minute, until, when it was 
again time for nonspeeded false heart rate, the rate 
of the beats was decreased slowly until it reached 
the 66-72 beats per minute base line, 
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Heart Rate Measures 


Heart rate was recorded for each subject throuzh- 
out the experiment using a Grass polygraph with 
electrocardiograms, Each of the first eight nodes 
was presented for a 15-second interval; actual hsart 
rate was determined by counting the number of 
heartbeats that occurred in the middle 10 units of 
the 15 units of polygraph paper used during the 15- 
second interval. Heart rates across intervals were 
compared by counting the number of beats per 10 
units of polygraph paper making up the middle 10 
seconds of the interval. In this manner, we hope’ to 
minimize the influence on our results of trans ont 
heart rate changes either at the beginning or r 
the end of each presentation. 

In the second part of the experiment, a similar 
method of recording heart rate was employed, except 


that heard heartbeat was either constant for (wo 
sets of four nudes presented consecutively for 15 
seconds apiece (a total period of 2 minutes) or was 
constant for the first four consecutively presented 
nudes (a 60-second time period) and then speeocd 
up during the presentation of the last four nuc 
Actual heart rate during the first I-minute periv 4, 
when heard heart rate was always constant, vis 
measured in beats per 30 units of polygraph paper 
(approximately 30 seconds) during the middle por- 
tion of this 1-minute period. Then this rate wes 
compared with the heartbeat per 30 seconds during 
the middle portion of the second 1-minute period, 
where heard heatbeat speeded up for some subjects 
and remained constant for others. The 1-minutc 
measurement intervals allowed a greater amount of 
time for heart rate to recover from the “surprise” 
component of a male nude presentation and yet pro- 
vided for ample recording time. All of these criteria 
for counting heartbeats were established prior to 
data collection. Changes in heart rate per se rather 
than increases in heart rate were used as the depen- 
dent measure of physiological arousal. In the first 
part of the experiment, absolute deviations in heart 
rate were obtained by taking a subject’s actual 
heart rate for each of the six “non-speeded-up” 
nudes, averaging these six figures, and finding the 
absolute difference between that average and the 
subject’s actual heart rate when viewing each of the 
eight nudes. In the second part of the experiment, 
absolute deviations were obtained by taking the 
heart rate for the last four nudes (females for half 
of the subjects, males for the other half) as a 
deviation from the mean heart rate for the first four 
nudes (females in all cases). 

The reasons for using absolute changes rather than 
increases in heart rate as an index of physiological 
arousal were twofold. First, it has been found that 
different people in the same situation may become 
aroused (as determined by self-reports and other 
physiological indexes of arousal), but yet heart rate 
will decrease for some and increase for others ( Lacey, 
1967; Lacey et al., 1963). For example Lacey found 
that for some persons heart rate decreases in an 
“arousing” situation and palmar conductance in- 
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creases (this phenomenon has been labeled “direc- 
tional fractionation”), while for others both indexes 
increase. In all cases, however, the aroused person’s 
heart rate changed from resting level, which sug- 
gested that absolute change in heart rate would be 
an appropriate index of arousal. Second, investi- 
gators (Goodman & Gilman, 1970) have found that 
heart rate increase is not a precise index of physio- 
logical arousal even when the person has been in- 
jected with epinephrine. Even though an injection 
of epinephrine generally produces a sizable increase 
in heart rate, at times it is primarily the variability 
rather than the level of the heart rate that increases. 
This happens because epinephrine not only produces 
tachycardia (rapid speedup in heartbeat), but also 
produces pronounced arrhythmia (irregularity in 
heartbeat. rhythm). If this irregularity is severe 
enough, it not only introduces variability within 
the total of the period, but depending on the 
size of the measurement interval it can lead to a 
recorded decrease in the aroused individual’s heart 
rate (see Plutchik & Ax, 1967). In addition to these 
findings, it has been shown that with more sensitive 
techniques (Berkhout, Walter, & Adey, 1969), heart 
rate change occurs in response to emotionally sig- 
nificant verbal stimuli (i.e., personal questions about 
sexual and ethical topics), and these changes are 
characterized by high variability. Given the above 
findings, it was thought that just looking for in- 
creases in heart rate might mask actual physiological 
arousal occurring in response to the heard heart- 
beat—arousal that would be reflected by increases in 
heart rate in most cases but by decreases in other 
cases. In both instances, however, actual heart rate 
would ‘have changed; thus, a measure of absolute 
change would be a more refined index of arousal. 
This in fact turned out to be the case. When only 
increases in actual heart rate were considered, a 
speedup in heard heart rate tended to produce a 
corresponding speedup in actual heart rate, but this 
effect was only directional and not significant. But 
when change per se was measured, a speedup in 
heard heart rate produced significant changes, dis- 
regarding direction, in actual heart rate. Similarly, 
in the second part of the experiment, which was 
clearly an “arousing” situation, the significant find- 
ings held only if actual heart rate was considered 
in terms of change per se rather than in terms of 
increase in heart rate. It should also be noted that 
in the present study, heart rate variability was a 
between-subject phenomenon and did not occur 
within subjects. That is, inspection of heart rate 
over the entire recording interval showed that while 
some subjects’ heart rates increased and others’ de- 
creased when they heard their heartbeat speed up, 
the changes resulted from a very regular and stable 
deceleration or acceleration over the entire recording 
period. Thus, our subjects either experienced an 
increase or a decrease in heart rate at the onset of 
heard heart rate increase and maintained this change 
over the entire recording interval. (A further con- 
sideration here is that although absolute deviation 
in heart rate was used as the dependent measure, 
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the manipulation of heard heart rate involved only a 
increases. This was done for the simple reason that 
we thought a speedup in heard heart rate would 
provide the subject with a clear-cut cognition of ~ 
physiological arousal, while a heard slowdown would 

create cognitive confusion.) ; 

Pictures. The first eight pictures that the subjects 
viewed were all pinups from Playboy magazine. Each 
of these nudes was presented for 15 seconds, with 
15-second intervals between presentations. The 
recorded heartbeat increased during the presentation 
of the fourth and seventh pictures. The assignment 
of the nudes to the critical positions was randomly 
determined, so that each of the critical positions was 
occupied with about equal frequency by all of the 
first eight nudes, 

Heard heart rate was increased on only the fourth 
and seventh nudes, rather than on a randomly 
determined pair of positions, for two reasons. First, 
in order for the subjects to detect that their heart 
rate had changed upon presentation of a particular 
nude, it was necessary to establish a base line against 
which to compare a change. If heard rate was high 
for initial nudes, the subjects would only realize this 
in retrospect when they heard a lower heart rate for 
subsequent nudes. This would not be the case when 
heard heart rate increased for a nude at a later po- 
sition, and hence a “speedup” for an early nude 
would not carry the same information as a speedup 
for a later nude. To equate the two speedup nudes 
in terms of the background against which increased 
heart rate was compared (as was the case in Valins’, 
1966, study), Positions 4 and 7 were selected. In the 
initial presentation, heard heart rate was considered 
to be dominated by extraneous factors; hence, both 
Nudes 4 and 7 were preceded by two presenta- 
tions in which heard heart rate was constant and 
normal. A second reason for not varying position 
was that before each session the nudes were ran- 
domly assigned to the speedup positions, and this 
was thought to sufficiently equate for extraneous 
determinants of ratings at a given position, 

After the first eight pictures were shown, Experi- 
menter 2 interrupted the presentation, explaining: 


Now what I’ve shown you isn’t the real presenta- 
tion yet. I'll show you that in a minute. The 
purpose of this presentation was to sort of warm 
you up. In the presentation I will show you, there 
are currently 8 nudes, but we’ve decided that we 
need 10, because we’re getting into all sorts of 
statistical hassles using only 8. So eventually we're 
going to take 2 more nudes from the 8 I just 
showed you and put them into the real presenta- 
tion. We want a little subject feedback on this, 
because obviously we want the most exciting nudes 
to get into the presentation, so we can get better 
results. As a result, I’m going to show you the 
8 nudes you just saw again, and as each girl 
goes by, I’d like you to rate her on a scale going 
from 1 to 100, where 1 means you didn’t like her 
at all, she turned you off, she was ugly, not 
attractive at all, and 100 means, gee, she was 
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really great, really turned you on, beautiful, really 


attractive to you; and 50, of course, is a perfectly 
neutral score, OK? 


Experimenter 2 then presented each picture again 
for 5 seconds apiece, with 5 seconds between presen- 
tations, The order was the same as it was previously 
for the subject, and once again, the experimental 
subject's heart rate increased for the fourth and 
seventh nudes. When the subject was through, 
Experimenter 2 collected the ratings that the subject 
had written down and continued with the experiment. 

The offensive part. The second part of the experi- 
ment involved an attempt to produce a stronger 
emotion in the subject—in particular, the feeling 
of being angered or offended by the experiment. 
To do this, two of the experimental groups were 
shown pictures of male nudes: the “real presentation” 
consisted of four Playboy nudes followed by ~four 
male nudes. Each nude was presented for a 15-second 
period, with the periods occurring consecutively 
without an interval between presentations. Thus, in 
the offensive condition, a subject viewed female 
nudes for the first minute of a 2-minute time period 
and viewed male nudes during the second minute. 
Subjects in one of these male nude groups heard 
their heart rate go up when they saw the male 
nudes. The other group heard their heart rate, but 
it remained constant. In addition, one of the control 
groups was assigned to the offensive condition. As 
in the first part of the experiment, this group did 
not hear any false heart rate feedback. An 
“inoffensive” condition involved a “real presenta- 
tion” that paralleled the offensive condition, except 
_ that it consisted of eight Playboy nudes. 

The six groups involved in the real presentation 


TABLE 1 


_ Mean Ratincs or tHe Ercut NUDES BY SUBJECTS 
Wao RECEIVED Farse Heart Rare FEEDBACK AND 
ABSOLUTE MEAN Deviation or ACTUAL HEART 
Rate (BEATS PER 10 SEC.) WHILE VIEWING A 
PARTICULAR NUDE FROM AVERAGE ACTUAL 
HEART RATE FOR THE SIX NUDES FOR WHICH 
Hearn Heart RATE was CONSTANT 


Nude Mean ratings aa aute moan 
1 72.15 .94 
2 69.95 58 
3 70.69 .62 
4 80.65 98 
5 66.20 .63 
6 68.55 .59 
7 81.95 89 
8 71.17 .60 


Note.—Italicized means indicate critical positions where 
heard heart rate increased., 

* Ratings of Nudes 4 and 7 combined compared against rat- 
ings of the remaining six nudes combined produced a dependent 
t = 7.96 (p <.001). Analysis of variance was not used here 
because although there were eight nudes, there were only two 
viewing conditions—heard speedup in heart rate and heard con- 
stant heart rate. 

b Nudes 4 and 7 combined, compared with Nudes 2, 3, 5, 6, 
and 8 combined, produced a dependent ¢ = 3.70 (p < .001), 
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part of the experiment constituted a 2X3 design, 


with 2 levels of offensiveness and 3 levels of feed- 
back. That is, the offensive and inoffensive condi- 
tions included three groups: speedup in false heart 
rate (“speedup” condition); no speedup in false 
heart rate (“nonspeeded” condition); and control 
(no-feedback condition). Note that the four groups 
that heard false heart rate feedback were the experi- 
mental subjects of the first part of the experiment. 
The two control groups were comprised of the 
control subjects from the first part of the experiment, 
and in both instances they did not receive false 
heart rate feedback. 

Posttesting and debriefing. After the subject had 
viewed the real presentation, Experimenter 2 un- 
hooked the electrodes. While he did so, he said: 


That’s it. Let me just unhook you, and you can 
get out of here. By the way, there’s a guy from 
the medical school in the main office you're sup- 
posed to go to. The medical school is a little 
wary about hiring undergraduates to do this sort 
ething—so he'll 

şut how I ran 

it in general, 

ibject back to 

ly spoke to 


the subject. a 6 

When Experimenter 2 had left, Experimenter 1, 
who was blind to the subject’s offensive condition, 
explained to the subject: 


Dr. Mettee, who’s in charge of this whole study, 
is somewhat hesitant about having undergraduates 


run his experiment for him, Since he isn’t here at — 


night to see how the experiment is going, he gave 
me this form for you to fill out in evaluating 
how your experimenter conducted the experiment. 
You see, the experimenter has applied for a grant 
to continue this sort of research, and Dr. Mettee 
needs some way of evaluating him. The experi- 
menter knows about this rating, but he will never 
see your answers. 


The questionnaire contained eight questions. The 
subject was instructed to write down the evaluation 
that he thought was appropriate on a scale from 
1 to 21, where 1 meant “not at all,” 21 meant “very 
much,” and 11 meant “neutral.” The crucial question 
was the fifth one, which asked the subject to report 
how offensive he found the experiment. 

After he filled out the questionnaire, the subject 
was questioned obliquely about his suspicions about 
the experiment. First, Experimenter 1 asked if he 
had any questions, then, how he felt hearing his 


own heartbeat, and finally, Experimenter 1 stated i 
that before any subjects had been run, he had him- 


self been hooked up to the machine and heard h 
heartbeat, and that it was quite an experience. 
the subject showed no suspicions that what he heg 
was not his own heartbeat, he was considered { 
suspicions.” None showed any suspicions, incl 
the controls who did not receive false feedbat 
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Experimenter 1 then told the subject that the 
debriefing was, for now, over, but that some infor- 
mation had been left out. He gave the subject the 
phone number of one of the experimenters and told 
the subject that in a few days he could call the 
experimenter to’ find out about the deception. 


RESULTS AND DISCUSSION 


The second column in Table 1 lists the 
mean ratings given by the experimental sub- 
jects for the first eight nudes. The results 
confirm Valins’ (1966) findings. The nudes 
presented fourth and seventh were rated sig- 
nificantly higher than the other six nudes by 
an average of 12 points on a 100-point scale. 
The control groups showed no significant dif- 
ferences among the means for the eight nudes. 
The subjects who heard their heart rate speed 
up for certain nudes did indeed rate those 
nudes higher than the nudes for which they 
heard no speedup in their heart rates. 

The third column in Table 1 lists the means 
for the absolute difference between a subject’s 
actual heart rate for a specific nude and his 
average actual heart rate for the six nudes 
where heart rate was constant (i.e., the base- 
line level). It is clear from this column that 
the absolute deviation in the subjects’ heart 
rates was significantly greater for the fourth 
and seventh nudes than for the other nudes 
(t = 3.70).§ This significantly greater depar- 


® This £ test does not include Nude 1. At the start 
of the experiment, it was expected that the subjects’ 
heart rates would be abnormally high as a function 
of entering a novel situation, being introduced to a 
strange task, etc., and hence would deviate markedly 
from normal levels. Therefore, heart rates on the 
first presentation were a priori excluded from analy- 
sis. Heart rates were in fact abnormally high for the 
first nude, but they slowed quickly on subsequent 
nudes. It is interesting to note that additional evi- 
dence indicates that heart rate while viewing Nude 1 
was primarily determined by the initial novelty of 
the situation. On Nude 1, the subjects heard a con- 
stant heart rate, but nonetheless their actual heart 
rate deviated a great deal from the base-line level. 
But this actual arousal did not result in a higher 
rating for Nude 1. This finding could be interpreted 
as indicating that the nonveridical cognitive cue that 
the subjects were not aroused took precedence over 
their actual state of arousal in determining their emo- 
tionality ratings (i., how much they liked the 
nude). However, a more plausible interpretation is 
that the subjects’ actual arousal state in fact took 
recedence over the constant false heart rate feed- 
ack, but this did not produce higher ratings of 
de 1 because the prepotent cognitive explanation 


ture from the base-line level indicates that 
relative to the random changes in heart rate 
that occurred from one nonspeedup nude to 
another, there was a significant change in 
heart rate upon switching from a non- 
speedup to a speedup nude. It is also clear 
from this column that habituation of heart 
tate between Nudes 2 and 8 was negligible— 
actual heart rate deviations for the constant 
heard heart rate nudes was the same at the 
beginning of the sequence (Nudes 2 and 3) 
as it was at the end of the sequence (Nudes 
5, 6, and 8). It seems that habituation effects 
occurred only during the beginning of the 
first presentation. Apparently, then, the heart 
rate results were free of any artifact due to 
habituation, and hence it is clear there was a 
“mimic effect.” When the subjects heard their 
heart rate speed up, their actual heart rate 
deviated significantly more from the base line 
than it did when they heard no speedup. 
However, the question still remains as to 
whether there was a relationship between the 
nude rating and the mimic effect (i.e., actual 
changes in heart rate). So far it has been 
established that a speedup in heard heart rate 
leads to a change in real heart rate and also 
leads to higher nude ratings, but it has not 
been shown that change and higher nude 
ratings are themselves related. The data indi- 
cated that these two variables were only 
weakly related. Chi-square analyses using 
various criteria for large/small changes in 
heart rate and high/low nude ratings showed 
that there was a consistent but nonsignificant 
relationship. Subjects who experienced a 
small change in heart rate were split 50/50 
in terms of high/low ratings of the nude, 
while for large changes in heart rate, the 
number of subjects rating the nude high was 
double the number rating it low (ie, a 
2/3-1/3 split). Moreover, linear correlations 
between actual heart rate change and nude 
ratings tended to be small (rs around .15) 


for this arousal state was the entering of a strange 
and novel situation, not the nude they were viewing. 
Hence, these subjects probably knew they were 
aroused and knew why (the experience of coming 
into something strange) and knew that it was not 
because of the nude they were seeing. Given this, we — 
would not expect the nude rating to correspond to — 
the subjects’ actual state of arousal (see Schachter ee 
Singer’s, 1962, Proposition 2, p. 382). 
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TABLE 2 


MEAN RATINGS OF OFFENSIVENESS AS A FUNCTION OF 
Sup XPERIENCE DURING THE SECOND 
Part oF THE EXPERIMENT 


Heard heart rate 


Offensiveness of 


stimuli Speedup on | No speedup | Control (no 
last four | on last four | heart rate 
nudes nudes feedback) 

Offensive (exposed 

to male nudes) 8.15 7.80 7.30 
Tnoffensive (ex- 

posed to female 

nudes only) 1.70 1.60 2.00 


and nonsignificant. Hence, it appears that the 
powerful effect observed in the first part of 
the experiment, demonstrating the strong in- 
fluence of heard false heart rate upon reports 
of emotionality (nude ratings), could not 
have been produced solely through the 
intermediary of actual physiological arousal; 
the relationship between actual physiological 
arousal and nude ratings was just not that 
pronounced. 

This, then, leaves us with two possibilities. 
Changes in heart rate may have only been 
by-products of false heart rate feedback, 
tangentially related to emotionality * or these 

_ changes may, even though they were only 

_ weakly related to nude ratings, play a crucial 
mediating role. The results from the second 
part of the experiment shed more light on 
this question. 

Remember that in the second part of the 
experiment there were six groups formed by 
two levels of offensiveness—offensive (male 
nudes) and inoffensive (Playboy nudes)—and 
three levels of heart rate feedback—speedup, 
no speedup, and control (no feedback). 
Table 2 summarizes the subjects’ answers to 
Question 5 of the posttest questionnaire 
about how offended they were by the experi- 
ment. The results show that subjects shown 


4 Even if there were a significant relationship be- 
tween emotionality (nude ratings) and actual heart 
rate, the relationship could just as well be a parallel 
as a dependent one. That is, a strong relationship be- 
tween ratings and actual heart rate would not neces- 
sarily mean that ratings depended upon (i.e., were a 
function of) actual heart rate. Obviously, some un- 
specified third variable could cause a simultaneous 
variation in both heart rate and reported emotion- 
ality (ie., a parallel relationship). 
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male nudes were indeed more offended than 
were subjects shown only female nudes” 
(F = 47.10, p < 001). 

It also can be seen from Table 2 that in 
the offensive condition there was orly a 
negligible difference in the amount of emotion 


felt by the subjects who heard their heart 
rate speedup and by subjects who hea:d no 
change in their heart rate. Table 3 shows- 
this effect dramatically. There was no ob- 
served effect of a speedup in heard heart rate 
on the ratings of offensiveness by subjects 
in the offensive condition, with the controls, 
of course, not being considered here. (Sub- 
„jects reporting under 11 on the 1-21 scale 
were considered as not offended, while ‘ose 
reporting P1 or above were considere! as 
offended; this also represented the empirical — 
median. ) 

It is apparent, then, that “heard” leart 
rate differences had no effect on offensiveness 
ratings, but what about the relationship be- 
tween actual heart rate and ratings of offen- 


siveness? Again, only those groups were 
sidered who saw male nudes and beard 
their heart rates. In Table 4, we observe i 
that there was a significant relationship be- 
tween the extent of change in actual heart 
rate and ratings of offensiveness. Here 
the criterion for a change in heart rate — 
was a deviation from the mean heart rate 
for the first four nudes (female) of more 
than one beat in 30 units of polygraph paper ~ 
(the median deviation), while the criterion — 
for being offended was the same as it was 
above, It is clear that there was a substantial ` 
relationship between changes in actual heart 
rate and ratings of offensiveness. 

But what about the fact that for the sub- 
jects in the offensive condition there was no — 
significant relationship observed between their 


TABLE 3 


OFFENSIVE-CONDITION SUBJECTS WHO FELT OFFENDED 
AS A FUNCTION OF HEARD HEART RATE 


Reported offensiveness 


Condition 
Not offended Offended 
No speed up 5 5 
Heard heart rate speed up 7 6 


Note.—X? = 1.00, df = 1. 
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heard heart rate and their ratings of offen- 
siveness (Tables 2 and 3)? Such a relation- 
ship was observed between heard heart rate 
and nude ratings in the first part of the 
experiment (Table 1). Why doesn’t the effect 
work here? 

To answer this question, we looked at the 
mimic effect in the second part of the experi- 
ment. Table 5 shows the mean absolute dif- 
ference between actual heart rate during the 
presentation of the first four nudes and actual 
heart rate during the presentation of the 
second four nudes (all males in the offensive 
condition, all females in the inoffensive condi- 
tion) in the speedup and no-speedup false 
heart rate conditions. The latter condition 
provides the appropriate heart rate change 
base line necessary to assess the mimic effect. 

Table 5 shows clearly that there was no 
significant difference in actual heart rate be- 
tween subjects in the speedup and no- 
speedup conditions who saw male nudes (of- 
fensive condition), The right two cells of 
Table 5 represent subjects in the speedup 
and no-speedup conditions who saw only fe- 
male nudes (inoffensive condition) and indi- 
cate results analogous to those observed in 
the first part of the experiment where only 
female nudes were viewed (see Table 1). 
Here, the mimic effect is at least detectable. 
Finally, Table 6 indicates that actual heart 
rate was independent of heard “heart rate” 
and moreover, as the splits within cells in 
Table 4 show, the effect of actual heart rate 
on rated offensiveness was independent of 
whether the subjects were in the speedup 
(higher heard heart rate) or in the no- 


TABLE 4 


OrrENSIVE-CONDITION SUBJECTS WHO FELT OFFENDED 
AS A Function or ACTUAL HEART RATE 
(L.E, ACTUAL PHYSIOLOGICAL AROUSAL) 


Reported offensiveness* 


Actual heart rate (PAa) 


TABLE 5 


MEAN ABSOLUTE DIFFERENCE BETWEEN ACTUAL 
HEART RATE (BEATS PER 30 SECONDS) FOR FIRST 
Four Nupres anp AcruaL HeArT RATE FOR 
Seconp Four Nunes ror Supyects Wo 
RECEIVED Fatsr HEART RATE FEEDBACK 


Inoffensive* 


` Offensive 
Heart rate (mele rey (female ates 
No speed up 2.3 12 
Heard “heart rate’’speed 
up 3.0 2.7 


at = 1,50, p < .20. 


speedup (no change in heard heart rate) 
condition. 

To summarize our results from the second 
part of the experiment: In the inoffensive 
condition, there was a weak mimic effect but 
no relationship between heard heart rate and 
emotionality (bottom row of Table 2)°; in the 
offensive condition, there was no mimic ef- 
fect and no relationship between heard heart 
rate and emotionality; however, in the offen- 
sive condition there was a significant rela- 
tionship between actual physiological arousal 
and ratings of emotionality, while in the in- 


5 Although there was a weak mimic effect in the 
inoffensive condition of the second part of the ex- 
periment, no concomitant increase in reported emo- 
tionality was demonstrated. This was undoubtedly 
due to the particular measure used in the second part 
of the experiment. Subjects experiencing a mimic 
arousal while viewing female nudes would not be 
expected to give higher ratings on an offensiveness 
scale. On the contrary, these subjects may have felt 
“turned on” rather than “turned off” or offended by 
the nudes. However, the offensiveness question did 
not give these mimic-arousal subjects the opportunity 
to express their greater degree of turned on emo- 
tionality. Instead, they were asked if they were 
“turned off,” and, given their mimic arousal and fe- 
male nudes as an external cognitive referent to ex- 
plain their arousal, they knew they were not. 


TABLE 6 


Actuat Heart Rate or Supjects Wao WERE Ex- 
POSED TO MALE NUDES as A FUNCTION 
oF HEARD “HEART RATE” 


Not offended» Offended 
speed 2 speed u. 5 i 
Sen t een up 1 no speed up Actual heart rate (PAs) 
Heart rate 
Changed 2 speed up 4 speed up 
0 no speed up 4 no speed up Changed Constant 
‘ect No speed u 5 4 
D — i ithin cells indicate whether a subjects’ ip $ 
Note.—The splits within cells indica j Heard “heartrate’' specdun 6 


heard heart rate speeded up or was constant. 
a x2 = 3,94 (corrected for continuity), df = 1, p <.05. 
> Actual heart rate data were not available for two of these 
subjects. 


Note.—xX? = 1,00, df = 1. 
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offensive condition there was no such rela- 
tionship, primarily because no emotionality 
(offensiveness ratings) was reported. 

The results from the two parts of the 
experiment enable us to compare situations 
where there is a fi imic effect and where there 
is not. When there was a strong mimic effect, 
as in the first part of the experiment, there 
was a pronounced relationship between heard 
heart rate and emotionality (nude ratings) — 
a “Valins” effect. When there was no mimic 
effect, as in the offensive condition in the 
second part of the experiment, there was no 
relationship between heard heart rate and 
emotionality, although there was a strong 
relationship between actual heart rate change 
and emotionality. 

We can explain our results consistently if 
we make an assumption about the types of 
situations that should have a marked emo- 
tional impact on today’s college male subject 
population. For such a subject population, 
viewing Playboy nudes just may not be an 
emotionally provocative experience. Some of 
our subjects recognized the girls in the pic- 
tures and could accurately guess what issues 
they appeared in, even though the pictures 


_ were from Playboy issues up to 5 years old. 


One subject had dated one of the playmates 


- he saw. 


Viewing male nudes, on the other hand, 
probably is an emotionally provocative experi- 
ence for such a subject population. A number 
of the subjects reported that they were 
“grossed out” by the male nudes, and, overall, 
subjects who had viewed male nudes in the 
experiment reported being offended by the 
experiment significantly more than subjects 
who had viewed only female nudes. 

The difference in results between the first 
and second parts of the experiment may 


be due to this difference in the emotion- 


provoking potential of the two situations. In 
the first part of the experiment, the subjects 
may not have experienced much emotion. In 
the second part, they did.® 


6QOur experimental design did not allow for 
actual quantitative comparison of how emotional our 
subjects were in the two parts of the experiment, 
since entirely different dependent measures were 
taken (nude ratings in the first part, offensiveness 
ratings in the second part; nude ratings were not ob- 
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If one accepts this assumption, the results 
become clear-cut. In situations not emo- 
tionally provocative, a Valins effect may occur 
with the heart rate change mimic being an 
incidental by-product. This is apparently 
what happened in the first part of the experi- 
ment. In such a cognitive, low-arousal s.tua- 
tion, a subject may rationally evaluate 
cognitive information to reach a cognitive 
conclusion (see Footnote 5), but this process 
may have nothing to do with expericnced 
emotion. However, in situations that are 
emotionally provocative, no mimic effect is 
observed, no Valins effect is observed, and 
a direct relationship between actual hear: rate” 
and emotionality is observed. 

Some experimental validation for these 
notions exists. Valins and pol (1967) des 
sensitized snake-phobic subj to live boa 
constrictors by having the Beet hear no 
change in their heart rate while viewing thé 
snakes, In this clearly emotionally provocative 


situation, the subjects based their actions on” 
cognitive input. But they did not experience 
changes in emotion based on congitive input. 
On the contrary, those subjects who heard no 
change in their heart rate did not differ a { 
controls on their subsequent self-reported” 
fear of snakes. Valins and Ray could not 
explain this result. It was a case where there 
was a strong emotion and no correspondence 
between heard heart rate and reported emo- 
tionality (i.e., no Valins effect). 
Frankenhaeuser (1967), on the other hand, 
has investigated what appears to us to be a 
non-emotion-provoking situation. She injected 
subjects with either a “stimulant” or a 
“depressant,” explaining what the symptoms 
would be and stressing that the injections” 
would be harmless. She really injected all of 
her subjects with a placebo. Those subjects” 
who had been told that they had been injected 
with a stimulant showed an increase in physi- 
ological arousal (including heart rate) and 


tained in the second part because the same subjects 
were used in both parts, and the rationale for ob- 
taining the original nude ratings did not hold in the 
second part). However, most of the subjects seemed 
objective and unemotional regarding the first part 
of the experiment, while in the offensive condition of 
the second part they often reported having be 
shocked, uncomfortable, or “grossed out.” $ 
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increases in a number of behavioral measures 
of arousal and emotion. Those subjects who 
thought they had been injected with a depres- 
and showed exactly the opposite physiologi- 
cal and behavioral effects. Frankenhaeuser’s 
experiment, then, showed that in a non- 
emotion-provoking situation, a mimic effect— 
a change in actual physiological arousal result- 
ing from a change in false cognitions about 
arousal—can co-exist with a Valins effect, a 
change in rated emotionality that corresponds 
with a change in false cognitions about arousal. 

On the basis of our results, we submit that 
in the case of very provocative emotional 
situations, physiological arousal does not serve 
just as a cognitive cue, but also functions as 
an essentially unconscious mechanism intrinsic 
to emotional experience. Of course, an indi- 
vidual does learn to associate his perception 
of arousal with his emotionality, and so in a 
situation that is not emotional, if he is given 
a false perception of arousal, that ma- 
nipulation may lead him cognitively to make 
discriminative judgments that would be 
operationally defined as relative ratings of 
“emotion.” However, in emotionally provoca- 
tive situations, as in the second part of our 
experiment, if actual physiological arousal 
and the nonveridical cue give contradictory 
messages, the actual level of physiological 
arousal determines the extent of reported emo- 
tionality, while the purely cognitive cue about 
physiological arousal becomes largely irrele- 
yant to whether or not emotion is experience. 

In summary, it is proposed that the fol- 
lowing schema more adequately explains what 
happened in the present experiment than does 
Valins’ hypothesis. In a nonemotional situa- 
tion, the subject may use information about 
his physiological arousal as a purely cognitive 
cue in formulating discriminative judgments 
about objects he has observed. In such a 
situation, a mimic effect may be observed. If 
so, some correlation between actual physio- 
logical arousal and reported emotionality may 
be observed, but these reports may not be 
related in any causal way to actual physio- 
logical arousal. In an emotional situation, 
physiological arousal serves as much more 
than a cognitive cue, since, as we have 
shown, false cognitive cues contradicting one’s 


state of actual physiological arousal are over- 
ridden in determining one’s reported emo- 
tional state. In highly emotional situations, 
there is no physiological mimic of nonveridical 
cognitive cues (i.e. false heart rate feed- 
back), and these false cues have little effect 
on reported emotion, but rather actual physio- 
logical arousal determines largely the level of ~ 
reported emotion. ; 

Thus, in the case of potent emotional — 
situations, it is not the cognition of physio- 
logical arousal that is crucial in mediating 
emotionality; rather, it is actual physiological 
arousal. 
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OPINION CHANGE AS A FUNCTION OF WHEN INFORMATION 


ABOUT THE COMMUNICATOR IS RECEIVED AND 


WHETHER HE IS ATTRACTIVE OR EXPERT? 


JUDSON MILLS? anp JOHN HARVEY 


University of Missouri 


An experiment tested the hypothesis that whether the audience receives information 
about the communicator before or after the communication will have less effect on 
its persuasiveness when the communicator is attractive than when he is expert, 
College women read the same communication under four conditions; information 
indicating that the communicator was either attractive or expert was presented 
either at the beginning or at the end. In confirmation of the hypothesis, it was found 
that agreement with the expert communicator was lower when the information 
about him was given after the communication than when it was given before, but 
agreement with the attractive communicator was just as great when the informa- 


tion about him was given after as when it was given before. 


Kelman (1961) has made a distinction be- 
tween opinion change based on identification 
and on internalization. According to Kelman, 


Identification can be said to occur when an individual 
adopts behavior derived from another person or a group 
because this behavior is associated with a satisfying self- 
defining relationship to this person or group. ... 
Identification is similar to compliance in that the indi- 
vidual does not adopt the induced behavior because its 

_ content per se is intrinsically satisfying. Identification 

_ differs from compliance, however, in that the individual 
actually believes in the opinions and actions he adopts. 
.. . . Finally, internalization can be said to occur when 
an individual accepts influence because the induced be- 
havior is congruent with his value system. It is the con- 
tent of the induced behavior that is intrinsically re- 
warding here [pp. 63-65]. 


Kelman (1961) proposed that: 


To the extent that the agent’s power is based on his 
attractiveness, influence will tend to take the form of 
identification. . . . To the extent that the agent's 
power is based on his credibility, influence will tend to 
take the form of internalization. An agent possesses 
credibility if his statements are considered truthful and 
valid, and hence worthy of serious consideration [p. 68]. 


From these conceptions it might be expected 
that if the communicator is very attractive, 
when the audience receives information about 
him may not make as much difference as it 
would if the communicator were considered an 


1 This study was supported by a grant from the Na- 
tional Science Foundation. It was conducted while the 
second author held a National Defense Education Act 
predoctoral fellowship. 

* Requests for reprints should be sent to Judson M.: ills, 
who is now at the Department of Psychology, Univer- 
sity of Maryland, College Park, Maryland 20742, 
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expert on the topic. If agreement with the at 
“tractive communicator is based on identific 
tion, then it may be relatively unimportant — 
sneer or not the audience has accepted the » 
specific arguments of the communication. How- 
ever, if agree with an expert communica- 
tor is based on internalization, the reduction: 
in acceptance of the content of the communi- 
cation that may occur if the audience. is not 
informed about the communicator beforehand 
may materially decrease his effectiveness. 4 
An experiment was designed to test the hy- 
pothesis that whether the audience receives 
information about the communicator before 
or after receiving the communication will have 
less effect on the communication’s persuasive 
ness when the communicator is attractive than 
when he is expert. 


METHOD 


Female collegestudentsread a speech favoring general 
education under four experimental conditions: attrac- 
tive before, attractive after, expert before, and expert 
after. In the before conditions, a description and photo- 
graph of the communicator were presented at the be- 
ginning of the speech. In the after conditions, the de- 
scription and photograph were presented at the end of 
the speech. In the attractive conditions, the communi- 


photograph showed him as rather unattractive. Agree- 
ment with the communicator’s position was meas 
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alter the subjects had rated his personality characteris- 
tics and completed a test of memory for what they had 
read. 

The subjects were 72 college women who participated 
in order to earn extra credit toward their course grade 
in general psychology. They were run during three 
sessions. Within each session, the subjects were ran- 
domly assigned to the experimental conditions with as 
near an equal number as possible in each condition. 
There were 18 subjects in each of the four conditions. 

At the beginning of each session, the experimenter 
told the subjects that the study concerned impression 
formation. Each of them would read a speech and then 
rate the personality characteristics of the person who 
delivered it. Booklets were distributed containing a 
fictitious speech arguing that every college student 
should receive a broad, general education. 

In the before conditions, a page containing a descrip- 
tion and photograph of the communicator was placed 
before the text of the speech. In the after conditions, the 
page containing the description and photograph of the 
communicator was placed after the text of the speech. 

In the attractive conditions, the material describing 
the communicator was as follows: 


This speech was given by a sophomore majoring 
in education at a large Midwestern state univers- 
ity. He was elected vice president of his freshman 
class by popular vote. A photograph of him ap- 
pears below. 


Printed on the page was a 12 inch photograph of an 
attractive, smiling, young man. d 1 

In the expert conditions, the material describing the 
communicator was as follows: 


This speech was given by a professor of education 
at a large Midwestern state university. He is one 
of the country’s leading experts on the problems 
of higher education. A photograph of him appears 


below. 


Printed on the page was à 1X2 inch photograph of an 
unattractive, frowning, middle-aged man. 

When the subjects had finished reading the speech, 
the speech booklets were collected, and each subject was 
given a rating form with the same code number as their 
speech booklet. T he subjects were asked to indicate how 
well various characteristics applied to the communicator 
by circling a number on a scale from 0 (extremely in- 
appropriate) to 20 (extremely. appropriate) for each 
characteristic. The characteristics were attractive, 
competent, cold, earnest, expert, friendly, frank, knowl- 
edgeable, likable, sincere, and unpleasant. The subjects 
were told that their ratings would be kept completely 
confidential and were not asked to put their names on 

rating forms. 
ager ite subjects had completed the rating forms, the 
experimenter told them that there was more to the 
study than he had mentioned at the beginning. He said 
that another purpose was to study the relationship be- 
tween impressions of a person and memory for what the 
person had said. He was now going to pass out a memory 
questionnaire to determine how well they could recall 
exactly what the communicator said in the speech. He 
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explained that he could not reveal this earlier because, 
if he had, they might have made a special effort to 
memorize the speech. The subjects were warned that 
the memory test was difficult. 

The rating forms were collected, and each subject 
was given a booklet for the memory test with the same 
code number as their speech booklet and rating form. 
The subjects were not asked to put their names on the 
memory test booklet. The first section of the booklet 
contained 20 multiple-choice questions with three 
alternatives each. The subjects were instructed to 
choose the word or phrase that completed each state- 
ment in the way which was closest to what was said in 
the speech. 

After all of the memory test booklets were distrib- 
uted, the experimenter interrupted the subjects and 
said he should have mentioned that the last page of the 
booklet was included to get their personal reactions to 
some statements in order to see if there was a relation- 
ship between personal reactions and memory. Some of 


the subjects invariably turned to the last page, and the © 


experimenter immediately cautioned everyone not to 


look at the last page until they had answered all of ~ 


the memory items, and not to return to the memory 
items once they had begun to give their personal reac- 
tions. The last page contained 10 Likert-type items 
concerning general versus specialized education, with 
seven alternatives from “strongly agree” to “strongly 


disagree.” Six of the items were in favor of general edu- 


cation, and the other four were opposed to general edu- 
cation. A measure of favorability to general education 
was calculated by assigning scores from +3 to —3 to 
the alternatives for each item and then summing over 
the 10 items; the more positive the score, the greater 
the favorability to general education. 

After the subjects had answered all of the itemsin the 
booklet, the experimenter asked them to write a few 
sentences on the back describing their reactions to the 


study. He told them that they could say anything they — 


wanted about the study. None of the subjects made 
comments indicating suspicion about the procedure, 
Finally, before dismissing the subjects, the experimenter 
explained the necessity of their not discussing the 
experiment with anyone. 


RESULTS AND DISCUSSION 


It was assumed that the communicator 
would be perceived as more attractive in the 
attractive conditions than in the expert condi- 


TABLE 1 
MEANS FOR THE RATINGS FOR ATTRACTIVE 


When information about 
communicator was received 


Communicator 


Before After 
Attractive 15.2 15.7 
Expert 8.9 SES 


Note.—N = 18 for each cell, 


TABLE 2 
MEANS FOR THE RATINGS For EXPERT 


When information about 
communicator was received 
Communicator 


Before After 
Attractive 11.5 9.0 
Expert 14.0 15.8 


Note.—N = 18 for each cell. 


tions. The means for the ratings for the adjec- 
tive “attractive” for the four experimental 
conditions are presented in Table 1. It can be 
seen from Table 1 that ratings of the communi- 
cator’s attractiveness were higher in the attrac- 
tive conditions than in the expert conditions. 
An analysis of variance showed that the main 
effect for the attractive-expert variable 
was highly significant (F = 48.32, df = 1/68, 
p < .001), The main effects for the before- 
after variable and the interaction were not 
significant. 

It was also assumed that the communicator 
would be perceived as more expert in the ex- 
pert conditions than in the attractive condi- 

- tions. The means for the ratings for the adjec- 
_ tive “expert” for the four experimental condi- 
tions appear in Table 2. As can be seen from 
Table 2, ratings of the communicator’s expert- 
ness were higher in the expert conditions than 
in the attractive conditions. An analysis of 
variance showed that the main effect for the 
attractive-expert variable was highly signifi- 
cant (F = 19.84, df = 1/68, p < .001). The 
main effect for the before-after variable was 
not significant. The interaction between the 
attractive-expert variable and the before-after 
variable was significant (F = 6.60, df = 1/68, 
P < .025). The difference in the ratings for 
“expert” between the attractive and expert 
conditions was greater for the after conditions 
than for the before conditions. Although this 
interaction was not predicted, it is easily un- 
derstandable in terms of better recall of the 
information about the communicator in the 
after conditions. When it was received closer 
in time to the ratings, the information about 
the communicator had more effect on the 
ratings of his expertness. 

Further evidence that the communicator was 

perceived as more expert in the expert condi- 
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tions than in the attractive conditions is pro 
vided by the subjects’ ratings for the ad ‘ective 
“knowledgeable.” The ratings for knowledge- 
able were significantly higher for the expert 
conditions than for the attractive conditions 
(F = 4.93, df = 1/68, p< .05). The only 
other significant effects for the ratings for the” 
adjectives were a significant interaction for 
the ratings for “competent” and a signi/icant 
main effect of the before-after variable for the 
ratings for “frank.” For the ratings for com- 
petent, the difference between the attractive 
and expert conditions was greater for the after 
conditions than for the before conditions 
(F = 4.46, df = 1/68, p < 05). This inter- 
action is similar to the interaction for the 
ings for expert and can be understood in 
same way. The ratings for frank were hig 
for the before conditions than for the aft 
ditions (F = 4,60, df = 1/68, p < .05). ’ 
subjects may have felt that the communica 
identity should have been revealed bef 
communication. When it was not, they 
have held the communicator responsible, 
consequently perceived him as being less fr: 
Before presenting the results for the mea- 
sure of agreement with the communicator’s 
position, it should be mentioned that there 
were no differences between the conditions in 
the number of correct answers on the memory 
test. An analysis of variance of the scores on 
the memory test did not yield any significant: 
effects. The subjects averaged approximately 
14 correct answers on the 20 memory items. 
It was hypothesized that whether the 
audience receives information about the 
communicator before or after receiving the 
communication will have less effect on the com- 
munication’s persuasiveness when the com- 
municator is attractive than when he is expert. 
From this hypothesis it would be expected that 
the difference in favorability to the communi- 
cator’s position between the attractive-before 
condition and the attractive-after condition 
would be less than the difference between the 
expert-before condition and the expert-after 
condition, The means for the measure of favor- 
ability to general education for the four experi- 
mental conditions are presented in Table 3. It 
can be seen from Table 3 that while the mean 
for the expert-after condition was considerably 
lower than the mean for the expert-before con- 


OPINION CHANGE AND THE COMMUNICATOR 


dition, the mean for the attractive-after con- 
dition was actually slightly higher than the 
mean for the attractive-before condition. An 
analysis of variance showed that the interaction 
between the attractive-expert variable and the 
before-after variable was significant beyond 
the .05 level (F = 4.20, df = 1/68). Neither of 
the main effects approached significance. 

In designing the study, an attempt was made 
to describe the attractive and expert communi- 
cators so that they would be about equally 
persuasive. As can be seen from Table 3, this at- 
tempt was successful. The mean for the at- 
tractive-before condition was very similar to 
the mean for the expert-before condition; the 
difference did not approach significance. How- 
ever, the difference between the mean for the 
attractive-after condition and the mean for 
the expert-after condition was significant (t 
= 2.67, df = 1/34, p < 02) Looking at the 
results from a different angle, the difference 
between the expert-before condition and ex- 
pert-after condition was significant (t = 2.60, 
df = 1/34, p < .02), while the difference be- 
tween the attractive-before condition and the 
attractive-after condition was not. 

The results provide good support for the 
hypothesis, based on Kelman’s (1961) dis- 


3 All significance tests are two-tailed. 


TABLE 3 


MEANS FOR THE MEASURE OF AGREEMENT WITH 
THE ComMMUNICATOR’S POSITION 


When information about 
communicator was received 


Communicator 


Before After 
Attractive 8.2 9.3 
Expert 9.5 1,9 


Note.—The higher the scores, the greater the agreement 
with the communicator’s position, N = 18 for each cell, 


tinction between identification and internaliza- 
tion, that whether the audience receives in- 
formation about the communicator before or 
after the communication will have less effect 
on the communication’s persuasiveness when 
the communicator is attractive than when he is 
expert. They demonstrate that the processes 
which produce agreement with an attractive 
communicator are not identical to the proces- 
ses which produce agreement with an expert ~ 
communicator. 
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Four degrees of deception were used in a partial replication of the Lerner and 
Simmons study of observers’ reactions to an “innocent victim.” All of the 158 college 
student subjects watched the videotape sequence of a girl receiving electric shocks 
that was used in that study. Half of the subjects were told that the session was live 
and that they would watch the girl go through a second session (midpoint condi- 
tion) ; the other half were told that they were watching a tape of a complete session 
carried out earlier (past-event condition). It was predicted and found that only 
subjects completely deceived as to both the purpose of the study (to measure re- 
actions to victims) and the method (that the “victim” was only pretending to 
receive shocks) would reject the victim more in the midpoint as compared to the 
past-event condition. Implications of this study both for the use of deception and 
for theory in the area of reactions to victims are discussed. 


Increasingly in recent years, ethical and 
methodological objections to the use of decep- 
tion in psychological experimentation have 
been raised. One of these objections, the purely 
ethical position of Kelman (1967), that it is 
just plain wrong to use deception, is a matter 
of values not amenable to proof or disproof. 
Most of the other objections that have been 
raised, however, could be put to empirical test. 

_ For example, Baumrind (1964) maintained 
that the deceptions in Milgram’s (1964) obedi- 

ance study lowered subjects’ self-esteem and 

_ their trust in experimenters. Vinacke (1954) 

_ Suggested that deceived subjects were harmed 
with respect to both their own well-being and 
their conceptions of the psychological labora- 
tory. Kelman maintained that deception leads 
to subsequent loss of niiiveté. The few studies 
done to date in response to criticisms of this 
sort indicate that under some conditions, de- 
ception does not greatly affect the subject with 
respect either to his own “mental health” 
(MacKinney, 1955; Milgram, 1964) or to his 
usefulness in subsequent experiments (Brock 
& Becker, 1966; Fillenbaum, 1966; Stricker 
& Messick, 1967). 

Asking subjects to play a role has been sug- 
gested as a possible alternative to deception 


1 This study presents, in revised form, data from a 
senior honor’s thesis written by the first author and 
submitted to the Department of Psychology, University 
of Pennsylvania, May 1968. We would like to thank 
the department for supporting the research financially. 

* Requests for reprints should be sent to Jane Allyn 
Piliavin, Home Management House, University of 
Wisconsin, Madison, Wisconsin 53706, 
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by Kelman (1967). Two studies indicate that 
responses thus obtained differ in several ways 
from responses under conditions of decestion 
(Deutsch, 1960) or hypnosis (Rosenberg, 
1960). Freedman (1969) has recently mace the 
very strong suggestion that role-playing studies 
are of no use whatsoever, However, Green berg 
(1967) found that role playing produced the 
same general results as deception, in the context 
of Schachter’s (1959) experiment on the rela- 
tionship between anxiety level, birth order, and 
affiliative tendencies. He suggested that the 
success of the technique may depend on the 
subjects’ ability to empathize and on their 
degree of active involvement in the role. 

The studies cited above were all designed to 
replicate, with nondeceived subjects, work 
initially done using deception. The use of role 
playing, or responding “as if” has, of course, 
been used as a technique in studies initially 
designed for it (e.g., Rosenberg & Abelson, 
1960). Furthermore, “role playing by com- 
puter,” or simulation, is a growing practice 
(e.g., Guetzkow, Alger, Brody, Noel, & Snyder, 
1963). It is clear that when the experimental 
situation is not one in which the subject would 
know what response would be the “good” re- 
sponse, or when the responses under scrutiny 
are not under cognitive control, role playing 
will be a valid method. It is only under circum- 
stances in which the subject can choose to 
“make a good impression” that role playing 
may well not lead to valid results. It is for the 
study of such problems that the deception 
technique was developed. 
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The'only argument in favor of deception is, 
of course, the pragmatic one that certain ques- 
tions cannot be answered without the use of 
deception—that informed subjects behave in 
ways quite different from those who are de- 
ceived. In 1954, Vinacke suggested that this 
assertion could be checked by experiments 
comparing groups of deceived and nondeceived 
subjects to see whether results were indeed 
different. Except when role playing has been 
used as an additional technique, such a com- 
parison has been made only once, by Hornbeck 
(1967). In a study involving the writing of 
counterattitudinal essays, he found no reliable 
differences in attitude change between deceived 
and nondeceived junior high students. How- 
ever, the relationships between “‘performance- 
variable clusters” (e.g., such things as quality 
of essay) and certain attitude change variables 
did depend in part on level of deception. 

The present study was designed to compare 
groups of subjects who are deceived to various 
degrees with both minimally role-playing and 
non-role-playing informed subjects. The pur- 
pose of the study was to discover to what extent 
a finding obtained under deception can be ob- 
tained with subjects “‘undeceived” to varying 
degrees. No real test of any alternatives to de- 
ception was attempted. The context of the 
study was that used by Lerner and Simmons 
(1966) in their study of reactions to a victim 
of misfortune. These authors hypothesized that 
people have a need to believe that the world is 
basically just and that events occur for some 
good reason. Therefore, if they cannot attribute 
the misfortunes of others to any obvious action 
on the part of those others, they will have to 
assume that something bad within the victims 
is responsible. They will consequently have a 
lower opinion of them than they would of the 
same people not involved in misfortune. Two 
additional conditions are necessary: the ob- 
server must believe that the suffering will con- 
tinue and that he is powerless to help the 
victims. 

To test this hypothesis, Lerner and Simmons 
showed subjects a videotape of a supposedly 
severely shocked student (the victim). After 
viewing the tape, the subjects were given differ- 
ent explanations of, and amounts of control in, 
the future treatment of the victim. Those who 
believed they had changed her from a con- 


tinuing shock to a future “reward” condition 
(reward group) or who believed that the task 
was over (end point group) or was conducted 
sometime in the past (past-event group) rated 
the victim more favorably than did those who 
believed she would continue to be in the shock 
condition (midpoint, reward-decision, and 
martyr groups). 

Whether these results actually support the 
‘Gust world” hypothesis has been questioned 
(Piliavin, Hardyck, & Vadum, 1967). However, 
the design is useful for comparing deceived and 
nondeceived groups. In the present study, two 
of the Lerner-Simmons conditions were used, 
the “past-event” and the “midpoint.” The 
effects of each of four deception conditions are 
studied within each group, yielding eight ex- 
perimental groups. 

The degrees of deception were as follows. 
Two groups (deception) were deceived as to 
the purpose and the method of the experiment. 
Two groups (partial truth) were deceived as to 
method, but were informed as to purpose. Two 
more groups (complete truth) were informed 
as to purpose and method, The final groups 
(role-playing) were informed as to purpose and 
method, but were asked to react in the way 
they thought that subjects in the “deception” 
group probably would (a minimal role-playing 
instruction). 

The following is the specific hypothesis in- 
vestigated in this study: The effect found by 
Lerner and Simmons, namely, greater devalua- 
tion of the victim in the midpoint as compared 
to the end point condition, will be found only 
among deceived subjects, not among subjects 
in the partial- and complete-truth and role- 
playing groups. No conclusions can be drawn 
concerning which set of findings—those ob- 
tained under deception or those obtained with- 
out the use of deception—should be accepted 
as the valid findings, if there is the predicted 
discrepancy between them. 


METHOD 
Subjects 


Subjects were 158 male and female student volunteers 
from introductory psychology courses at the University 
of Pennsylvania. They were run in nine sessions of 19-21 
subjects each, with all eight experimental conditions 
run at each of the sessions. 


Apparatus 


The professional television facilities of the Annenberg 
School of Communications, including monitors, inter- 
com facilities, and videotape recorder, were employed 
to facilitate the illusion that the shock session being 
witnessed could have been live (as was required for the 
midpoint condition).* The tape sequence used, a copy 
of the Lerner-Simmons tape, begins as a lab-coated 
female experimenter enters a small room with a college- 
age female (the victim). The victim is then strapped 
into an elaborate piece of electrical apparatus and told 
that she will be shown a sequence of nonsense-syllable 
pairs. After the first run through the list, only the first 
word of each pair appears, and she must produce the 
‘associated syllable. If she errs, a buzzer sounds and a 
shock is delivered. The session is then run as indicated. 
The victim (who is, of course, only acting and is not 
being shocked) puts on a rather convincing performance, 
including stiffening of the limbs, gasps of pain, and 
occasional pleading looks and requests to stop directed 
at the experimenter. The tape sequence lasts about 
10 minutes. 


Procedure 


Once the subjects had arrived and been seated, book- 
lets containing premeasures, group-defining instruc- 
tions, and postmeasures were handed out. Booklets 
containing the differential instructions for the eight 
different conditions had been shuffled together before 
the session, so assignment to conditions was random. 
The premeasure for all groups consisted of rating a 
“typical female college student” on the same 15 bipolar 
adjective scales used by Lerner and Simmons as their 
index of attractiveness. Next, the subjects read one of 
eight explanations of the experiment, which set up the 
eight experimental conditions. The explanations were 
close to the same length, and the format was the same, 
so as to reduce suspicion from the accidental glance. 

The manipulation of the subjects’ expectations as to 
the future treatment of the victim was done in the 
following way. The past-event groups were told that a 
visiting faculty member and some students in the de- 
partment had, in the course of their work, made some 
tapes of experimental sessions; that one tape had been 
borrowed and would be shown to them; and that, after 
seeing it, they would answer some questions. The mid- 
point groups were told that a visiting faculty member 
and some students in the department were currently 
running some experimental sessions in another building 
and that an arrangement had been made whereby one 
session was to be seen live through a closed-circuit tele- 
vision apparatus. They were also told that after watch- 
ing the first part of the session, they would answer 
some questions and that they would then watch the 
session continue for an equally long period. ‘ 

The four levels of deception were then set up within 
the past-event and the midpoint groups. The deception 


8 We would like to thank George Gerbner and John 
Massi of the Annenberg School of Communications, 
University of Pennsylvania, for their permission to use 
and assistance in using these facilities. 
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groups were told that the experiment was designed to 
discover particular cues that allow people to perceive 
emotion in others; they were also led to believe that the 
victim was actually receiving shocks for incorrect re- 
sponses. The partial-truth. groups were told the real 
purpose of the experiment, namely, to study reactions 
to victims of misfortune. However, they were led to 
believe that the victim was actually being shocked. The 
complete-truth groups were fold the real purpose and 
the fact that the victim was only pretending to receive 
shocks. The role-playing groups were given the same 
information as the complete-truth groups: in addition, 
they were asked to play the role of an observer of a 
punished subject in a real learning experiment. Initially, 
a fifth condition, a partial-truth group informed as to 
method but uninformed as to purpose, had been in- 
cluded in the design. Unfortunately, it had to be 
dropped for reasons of space and time, 

After the subjects had read the preliminary state- 
ment, they watched the videotape sequence. The most 
salient points of the differential explanations were then 
reemphasized in a brief statement that preceded the 
postmeasures. Subjects then filled out the postmea- 
sures, most important of which was a rating of the 
victim on the same 15 bipolar adjective scales on which 
they had previously rated a “typical female college 
student.” Several other measures were taken, including 
various checks on the manipulations. When the subjects 
had finished, they were completely debriefed and 
pledged to secrecy for the remainder of the semester. 


RESULTS 
Checks on the Manipulations 


Among the questions asked of subjects were 
two that can be used to check the effective- 
ness of the past-event-midpoint manipulation. 
These questions were the following: 


1, The learning experiment was 
a. a live session carried out at the present. 
b. a tape of a session carried out at an earlier time. 
2. The girl you have been watching 
a. has completed her participation in the learning 
experiment 
b. will participate in another ten-minute session 
after a short break. 


Past-event subjects should answer “b” to 
Question 1 and “a” to Question 2, while mid- 
point subjects should answer ‘‘a” and “b,” re- 
spectively. A 3 X 2 table (the middle category 
contains the two other possible combinations 
of answers) yields a chi-square of 63.60 
(df = 2), which is significant well beyond the 
.001 level of confidence. The manipulation 


4 Copies of the entire experimental booklets are to be 
found in the complete thesis and may be obtained from 
the Graduate Library of the Department of Psychology, 
University of Pennsylvania, for xerox costs and postage. 
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seems to have successfully set up different per- 
ceptions of the learning session and of the 
subjects’ expectations concerning the girl’s 
future. (The most popular wrong response for 
both conditions was the “tape-continue” re- 
sponse. Almost none of the subjects answered 
both questions as the other condition was ex- 
pected to). 

One other question can be used to evaluate 
the effectiveness of the manipulation separating 
the complete-truth and role-playing groups 
from the deceived and partial-truth groups: 


The girl you have been watching was 


a. receiving strong electric shocks. 
b. acting as though she were receiving shocks. 


The “a” response was given by 87% of those 
who should have given it, namely, the decep- 
tion and partial-truth groups; and the “b” re- 
sponse was given by 97% of those in the other 
two groups, Chi-square in this case is over 100, 
and the probability is well beyond .001. 

Thus, the manipulations we can check have 
been highly effective. We do not know how 
well the subjects accepted the role-playing in- 
struction, nor do we know whether partial- 
truth subjects were affected by the information 
that we were studying response to victims of 
misfortune. 


Index of Attractiveness of Victim 


The dependent measure used by Lerner and 
Simmons and, therefore, by the present in- 
vestigators, is calculated by subtracting each 
rating given to the “typical female college 
student” from the rating given to the victim 
and then summing these 15 difference scores to 
obtain an overall index. If the index is positive, 
the victim is relatively positively evaluated; 
if negative, she is devalued. Table 1 presents 
the means, ns, and standard deviations of the 
index of attractiveness for each of the experi- 
mental conditions. 

The crucial comparisons for our hypothesis 
involve the four pairs of past-event and mid- 
point groups within the levels of deception. Of 
these, as predicted, the only one in which £ was 
significant is the deception condition (¢ = 2.22, 
df = 38, p < .05, two-tailed). In none of the 
other comparisons was / greater than A 
Further comparisons reveal that the midpoint 

_ deception group differs from each of the other 


TABLE 1 


‚MEANS, ns, AND STANDARD DEVIATIONS OF INDEX OF 


ATTRACTIVENESS OF VICTIM BY CONDITION 


Group Deception F at a Complete Role play 

Past-event 

x 1.58 3.40 5.55 —.40 

SD 10.8 14.3 19.5 19.5 

n 19 20 20 20 
Midpoint 

x —8.27 2.89 1.05 1.50 

SD 15:5 14.0 15.3 15,2 

n 21 19 19 20 


Note.—For the deception, partial-truth, complete-truth, and 
role-play conditions, 4 = 2.22, p <.05, t <1, t <1, and 
t < 1, respectively. 


midpoint groups as well (for the partial-truth 
group, ¢ = 2.31, p< .05; for the complete- 
truth group, ¢ = 1.86, p = .08; for the role- 
playing group, ¢ = 1.94, p = .07). 


Discussion 


It seems clear from these findings that in this 
particular situation, complete deception is 
necessary for the differential devaluation effect 
to occur. As soon as the subjects were informed 
concerning the purpose of the experiment, even 
though they remained deceived as to method, 
the subjects in the midpoint condition no longer 
rated the subject less favorably than did sub- 
jects in the past-event condition. It is of course 
possible that had we included the second 
partial-truth group, informed as to method 
but uninformed as to purpose, this group 
might have reproduced the Lerner and Sim- 
mons effect. In fact, since this condition would 
likely have provided few clues as to what was 
a desirable response, the chances are that re- 
sponses would at least have been closer to those 
of the deceived subjects. It is also clear that a 
simple role-playing instruction of this kind is 
ineffective in leading the subjects to produce 
the same results as deceived subjects. 

On the surface, these findings seem to sup- 
port the position of those who use deception in 
experiments. The ‘‘well-substantiated” result 
cannot be replicated without the use of decep- 
tion. Looked at in another light, however, the 
data may be viewed somewhat differently. If 
we are, from the results of these laboratory 
experiments, to conclude something about 
people’s reactions in the real world, perhaps we 
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should eXaimine the nature of the deception a 
bit differently. The situation is set up as the 
observation of the emotional responses of some- 
one in a learning situation. Hopefully, since 
this is the instruction, this is how subjects view 
the situation. When, instead, you suggest to 
them that they are observing a victim of 
misfortune, it appears that their reaction to 
the victim is very different—they do not 
devalue her at all, even though they believe 
that she is really being shocked. Perhaps, then, 
in the real world, this effect will not occur 
either if the observer perceives the victim as 
truly a helpless victim of misfortune, rather 
than as an individual who is being punished for 
his behavior, This suggestion bears a striking 
resemblance to the findings of Piliavin et al. 
(1967), who showed that if Lerner and Sim- 
mons’ victim is made to appear truly helpless, 
because the machine has been wrongly pro- 
grammed, she is not devalued in the midpoint 
as compared to the end point condition. 

If the results of this study are viewed in this 
way, the conclusion to be drawn concerning the 
use of deception is very different. Perhaps 
under some conditions the unthinking use of 
deception as the only research technique can 
obscure theoretically relevant concerns, such 
as the subject’s definition of the situation. The 
obvious suggestion to be made on the basis of 
this line of reasoning is that investigators use 
both deception and nondeception on occasion 
to see what subjects can tell us about their 
differential perceptions under the two con- 
ditions. Beyond this suggestion, the results of 
this study cannot be of help in answering the 
question ‘‘What method can we substitute for 
deception?” Naturalistic studies can help, as 
can the use of “unobtrusive measures” (Webb, 
Campbell, Schwartz, & Sechrest, 1966). 
Clearly, if we wish to avoid the use of decep- 
tion, we must learn to think in terms of a non- 
deception method first, rather than starting 
with deception and then seeing whether 
nondeceived subjects “‘still act right.” 
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BLACK POWER IDEOLOGY AND COLLEGE STUDENTS’ ATTITUDES 
TOWARD THEIR OWN AND OTHER RACIAL GROUPS? 
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Negro depreciation of fellow Negroes and overidealization of whites have focused on 
the visible signs of racial identification, particularly skin color. Black power ideology 
has attempted to reverse the negative connotations of blackness and promote posi- 
tive attitudes toward blackness, as well as toward black persons, In an exploratory 
attempt to discover how successfully this goal was being achieved, 272 white 
and 176 Negro college students were administered a Black Power Ideology Scale 
and were asked to rate the concepts friend, enemy, ideal person, colored person, 
black person, white person, and Negro person on the semantic differential, Within 
each racial group separately, analyses of variance were computed to compare the re- 
sponses of students high in black power orientation to the responses of students low 
in black power orientation. In both racial groups, subjects high in black power 
orientation placed a lower evaluation upon white person and a higher evaluation 
upon black person than subjects low in black power beliefs; however, other findings 


varied by race of the subjects. 


Minority group identification with the 
majority group’s idealization of itself and 
depreciation of the minority have for some time 
been recognized as a major threat to the 
development of healthy self-esteem and pro- 
tective group solidarity among minority group 
members (Barron, 1967; Lewin, 1951; Rose & 
Rose, 1948). The prevalence of anti-Negro 
attitudes among Negroes has been extensively 
documented. Clark and Clark’s (1947) classic 
study of two hundred and fifty-three 2- to 
7-year-old Negro children revealed that a 
majority of the youngsters at each age level 
preferred the white doll and stated that the 
Negro doll “looked bad.” Subsequent similar 
research has replicated and extended these 
findings (see literature review in Proshansky & 
Newton, 1968). Radke-Yarrow, Campbell, and 
Yarrow (1958) found that Negro girls in an 
interracial camp devalued Negro peers but 
overvalued white girls. Morland (1962) found 
that 58% of Negro nursery school children, 
but only 10% of the white children in a sample 
of 407, preferred to play with children of a race 
other than their own. Bayton, Austin, and 


1 The authors would like to express their appreciation 
to Craig Barton and Kevin Houlihan for assistance in 
the computer processing of the data and to Merton 
Krause; Daniel Solomon, and Daniel Druckman for 
helpful comments regarding the manuscript. 

2 Requests for reprints should be sent to Elise E. 
Lessing, Institute for Juvenile Research, 1140 S. Paulina 


Street, Chicago, Illinois 60612. 
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Burke (1965) found that Negroes produced 
more favorable scores on several personality 
traits when directed to respond as if they were 
white than when responding as Negroes. 

One of the many factors possibly contribut- 
ing to Negro self-depreciation is the language 
custom of color coding racial groups by designa- 
tions such as white, yellow, and black for 
Caucasian, Oriental, and Negro, respectively, 
so that racial concepts are tied to a host of 
general cultural symbols with implications 
regarding goodness and badness (Fager, 1967, 
p. 29; Proshansky & Newton, 1968; Williams, 
1966). In a series of extremely ingenious re- 
searches, Williams and his collaborators (Mor- 
land & Williams, 1969; Williams, 1966; 
Williams & Carter, 1967) demonstrated that 
whites produced similar associations on the 
semantic differential to colors, persons desig- 
nated by those colors, and ethnic groups 
associated with a particular color. Thus, whites 
viewed the color white, white person, and 
Caucasian as more similar (and more ‘‘good,” 
but less “‘powerful”) than the color black, 
black person, and Negro, Negro subjects did 
not link colors and concepts as closely; how- 
ever, they also rated white and white person as 
slightly more good than the color black and 
black person. Williams (1966) expressed 
pessimism regarding 


the efforts of the Black Muslims and others to arbi- 
trarily reverse the conventional symbolism by as- 


sociating black with goodness and white with badness. 
Tt would scem doubtful that such deliberate efforts at 
reversal can generally succeed in a culture where the 
symbolism of white as good and black as bad is so 
thoroughly entrenched in literature, religion, the mass 
media, etc. [p. 539]. 


The black power movement has set as one of 
its tasks the very reversal of symbolic associa- 
tions which Williams considered so improbable. 
Wright (1967) urged that “Negroes must, 
then, be proud of the variegated blackness 
which is theirs [p. 65]? Ladner (1967) de- 
scribed the efforts of cosmopolitan Mississippi 
black power advocates to promote ‘‘black 
consciousness,” defined as “a set of ideas and 
behavior patterns affirming the beauty of 
blackness and dispelling any negative images 
that black people may have incorporated about 
blackness [p. 91.” In addition, blacks were 
advised to free themselves from domination 
by whites. Franklin (1969) affirmed that 


the ideology of Black Power gives supreme importance 
to the problem of the Negro's image of himself. . . - 
The meaning of blackness as viewed by whites is 

_ turned into its opposite—beauty and strength rather 
than ugliness and submissiveness. . - . Finally, and 
most important, there is the ongoing effort to imbue 
the individual Negro with an ethos of power and 
self-respect . . . Cp. 291). 


Pinderhughes (1969) described the black power 
~ movement as ‘‘a massive sociotherapy, group 
"therapy, and individual therapy movement 
whose advocates are attempting to develop 
more adaptive psychology, role relationships, 
and group structure for blacks by developing 
bonds . . . and self-esteem for black people 
[p. 1091.” 

The present study was undertaken as an 
exploratory investigation of the extent to 
which the cognitive and emotional restructur- 
ing advocated by black power advocates was 
actually being accomplished. The semantic 
differential technique used in the series of 
studies by Williams and his associates 
(Williams, 1966; Williams & Carter, 1967) 
was selected for use in the current study for 
two reasons. It was considered desirable to be 
able to compare the findings of the current 
study with extensive earlier data on college 
students. More important, the D statistic, 
developed by Osgood, Suci, and Tannenbaum 
(1957, pp. 89-97) i 


as an index of the distance 
between pairs of concepts in three-dimensional 
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semantic space, was judged to provide an 
excellent means of evaluating complex hy- 
potheses regarding attitudes. For example, 
Williams (1966, pp. 533, 538) was able to 
demonstrate not only that Negro college 
students rated black person lower than white 
person on the evaluation dimension of the 
semantic differential, but that they rated black 
person as more distant from their generic 
concept of person. 

Eleven specific hypotheses were formulated 
on the basis of both the earlier Williams 
studies and the writings of the black power 
advocates previously cited. It was hypothe- 
sized that students high in black power 
orientation, when compared with students low 
in black power orientation with «regard to 
semantic differential responses, yl 
produce a significantly higher evaluati 
score for black person, (b) produce a signifi- 
cantly lower evaluation factor score for white 
person, (c) perceive a significantly smaller 
distance between black person and ideal 
person and between black person and friend, 
(d) perceive a significantly larger distance be- 
tween black person and enemy, (e), perceime a 
significantly smaller distance between white 
person and enemy. It was further hypothesized |) 
that when reactions to the three labels black 
person, Negro person, and colored person were 
compared, students high in black power 
orientation, (f) would produce a significantly 
higher evaluation factor score for black person 
than for Negro or colored person, (g) would 
perceive a significantly smaller distance be- 
tween black person and friend and between 
black person and ideal person than between 
the other two nonwhite concepts and friend 
and ideal person, and (h) would perceive a 
significantly larger distance between black 
person and enemy than between the other two 
nonwhite concepts and enemy. Conversely, it 
was hypothesized that students low in black 
power orientation (i) would produce a sig- 
nificantly lower evaluation factor score for 
black person than for Negro person or colored 
person, (j) would perceive a significantly 
larger distance between black person and friend 
and between black person and ideal person 
than between the other two nonwhite concepts 
and friend and ideal person, and (k) would 
perceive a significantly smaller distance be- 
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tween black person and enemy than between 
the other two nonwhite concepts and enemy. 

The 11 hypotheses presented above were 
considered to be equally applicable to both 
white and Negro subjects. That is, while the 
proportion of whites and Negroes accepting 
black power ideology would be expected to 
differ significantly, the cognitive consequences 
of such acceptance would be expected to be 
similar. 


METHOD 
Subjects 


Subjects were 272 white and 176 Negro college 
students enrolled in five colleges and universities. In 
each of the three institutions in the Chicago area, both 
authors conducted the testing as an interracial team 
during regular class sessions, At the other two schools, 
one a racially integrated state university in the Midwest 
and the other a black university in the South, re- 
sponsibility for the testing was assumed by black 
faculty members in order that all five groups of 
subjects would see the primary responsibility for the 
research taken by a Negro. In addition to the classroom 
testing, one small group of Negro girls was tested at a 
sorority meeting! Subject recruitment was directed 
at getting the widest possible range of attitudes in 
regard to both black power ideology and conceptions 
of various racial groups. The testing was conducted 
between March 1968 and January 1969. It was not 
possible to measure or control for temporal effects or 
for the effects of the assassination of Dr. Martin Luther 
King in April 1968, since different types of samples 
were being tested at different times. Since the major 
focus of the research was the attitudes of Negro 
students, it was fortunate that over three-fourths of 
the Negro students were tested within the 2-month 
period of January-February 1969. 

The final sample of 448 subjects consisted of the 
subjects retained in the study after the elimination of 
125 subjects for the following reasons: over 40 years 
of age (5 subjects); member of religious order (20 
subjects); failure to record sex, race, or paternal 
occupation (27 subjects); nonwhite other than Negro 
(9 subjects); excessive omissions® (60 subjects); un- 
varying “neutral” or illogical responses to semantic 
differential items (4 subjects). 


3 The assistance of Edgar G., Epps and Wesley J. 
Lyda is gratefully acknowledged. 

4 The authors wish to thank the Beta Chapter of the 
Alpha Kappa Alpha sorority for its participation, 

50n the semantic differential, failure to rate all 
concepts on at least 19 of the 20 scales was considered 
to constitute “excessive omissions.” Since only 5 scales 
were used for the activity and potency factors, the 
prorating of many cases for the omission of more than 
one scale could have created much distortion in the 
means for either of these two factors, 


Instruments 


The major independent variable, acceptance of 
black power ideology, was measured by means of a 
23-item Black Power Ideology Scale (BPIS), developed 
by the first author for this study. The BPIS, which is 
described in detail elsewhere (Lessing & Zagorin, 1970), 
requires that subjects respond on a 4-point scale of 
“strongly agree,” “agree,” “disagree,” and “strongly 
disagree” to items such as: “While integration may 
be desirable, black (Negro) Americans should first 
try to get a fair share of economic and political power 
in America.” The instrument yields both factor scores 
and total scores. However, in the present study, only 
the total score with a possible range of 0-92 was used 
asa measure of level of general black power orientation. 
This total score was previously found to discriminate 
successfully between criterion groups consisting of 
members and nonmembers of black student associations, 
as well as between subjects describing themselves as 
favorable to black power and those describing them- 
selves as unfavorable (Lessing & Zagorin, 1970). 

The dependent variable, attitude toward various 
racial groups, was measured by means of the semantic 
differential technique (Osgood et al., 1957). The choice 
of scales and concepts was based largely upon the 
studies of Williams and his associates (Williams, 1966; 
Williams & Carter, 1967). The concepts to be rated 
were friend, enemy, ideal person, black person, white 
person, colored person, and Negro person. The first 
three were always presented in the order listed. The 
ethnic concepts were presented in counterbalanced 
order, with one-fourth of the subjects rating each of 
the four ethnic concepts as his or her first racial concept. 
Each concept was rated on 20 scales, 10 representing 
the evaluation factor, 5 representing potency, and 5 
representing activity. The individual scales were as 
follows: kind-cruel, clean-dirty, bad-good, honest- 
dishonest, unpleasant-pleasant, worthless-valuable, 
fair-unfair, cowardly—brave, friendly-unfriendly, un- 
successful-successful (evaluation); thick—thin, strong- 
weak, small-large, soft-hard, heavy-light (potency); 
and cold-hot, active-passive, slow-fast, sharp-dull, 
violent-moderate (activity). 


Statistical Analysis 


The total BPIS scores of white and black students 
were compared by means of a three-way analysis of 
variance, with race, sex, and dichotomized social class 
level as independent variables and total BPIS score as 
dependent, Social class level was dichotomized on the 
basis of paternal occupation, which was classified by 
means of the codes in the Alphabetical Index of Occupa- 
tions and Industries (United States Bureau of the 
Census, 1960). All occupations at or above “clerical, 
sales, and kindred workers” were categorized as 
middle-class. The analyses of variance were performed 
by means of a computer program based on Bancroft's 
(1968, pp. 55-60) formulas for least-squares analysis 
of variance with unequal cell sizes. j 

For the actual tests of hypotheses, many of which 
would involve tests of the significance of interactions, 
it was felt that an orthogonal design with equal cell 
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TABLE 1 


Resutts or Two Sets or 21 ANALYSES OF VARIANCE, WITH SEX AND LEVEL oF BLACK 
POWER ORIENTATION (BPIS SCORES) As INDEPENDENT VARIABLES AND SEMANTIC 
DIFFERENTIAL EVALUATION, Potency, AND Activity SCORES FOR- SEVEN 
CONCEPTS AS DEPENDENT VARIABLES 


F values for white sample F values for Negro sample 
Concept and semantic nt wie ae 
differential factor 
Sex BPIS Sex X BPIS Sex BPIS Sex X BPIS 
Friend (evaluation) 13.42** 3.32 05 07 01 1.45 
Enemy (evaluation) 13 2.24 06 50 00 | 4.22* 
Ideal person (evaluation) 4.43* 5.05* 1,01 4.10* .08 31 
Colored person (evaluation) 9.09** 4.39* 1.48 .07 79 1.03 
Black person (evaluation) 8.45** 16.48** 2.06 28 6.84* 5.41* 
White person (evaluation) 1,16 4.68* 88 .00 15.20** 85 
Negro person (evaluation) 9,557 9.87** 23 Ji 39 91 
Friend (potency) 10 .20 2.99 1.38 1.48 1.18 
Enemy (potency) 12 1.56 02 1.26 67 1.82 
Ideal person (potency) 3.47 .93 2.83 90 03 11:127. 
Colored person (potency) 02 04 AS ll 1,33 1.01 
Black person (potency) 2.08 -00 08 4.55* 2.06 Zk 
White person EEES 1.74 -00 10 1.41 3.17 .03 
Negro person (potency) 59 1,39 1,09 14 38 04 
Friend (activity) 43 2.61 60 2.64 6.75* 07 
Enemy (activity) .00 -68 A2 „11 03 21 
Ideal person (activity) 03 -93 46 225 1,49 7.41** 
Colored person (activity) 3.50 1.34 05 .00 1.45 54 
Black person (activity) 03 4.20* 70 .09 12.94** 01 
White person (activity) 38 65 2.49 1.11 7.81** 3.77 
Negro person (activity) 42 3.36 04 00 pat .05 


Note.—df = 1/232 for all F ratios for the white sample; df = 1/112 for all F ratios for the Negro sample. 


*p <.05. 
wp < 01. 


sizes for the appropriate analyses of variance would be 
desirable. Therefore, for whites and Negroes separately, 
cases were eliminated randomly until there were equal 
numbers of subjects in each subgroup (males high in 
black power orientation, males low in black power 
orientation, females high in black power orientation, 
and females low in black power orientation). Then for 
whites and Negroes separately, four sets of analyses 
of variance were performed. First, sex and black power 
orientation were the independent variables, and 21 
semantic differential scores served in turn as the 
dependent variables, namely, the evaluation, potency, 
and activity scores for the concepts friend, enemy, 
ideal person, colored person, black person, white 
person, and Negro person. Second, sex and black power 
orientation were the independent variables, and D 
scores for friend versus each of the four racial concepts 
served as dependent variables. Third, sex and black 
power were independent variables, while D scores for 
enemy versus each of the four racial concepts served 
as dependent variables. Finally, sex and black power 
were independent variables, while D scores for ideal 
person versus each of the four racial concepts were 
the dependent variables. These analyses were per- 
formed on the IBM 360 by means of the Mesa 98 
least-squares analysis of variance program developed 
by Finn (1968). The analysis of variance technique 
was utilized to analyze the D-score data in spite of 
Osgood et al.’s (1957) warning that the distribution 
of D is not known and is probably not normal. Boneau 


(1960, p. 63) concluded that for sample sizes over 30, 
and particularly for comparisons involving samples of 
equal size, violations of the assumptions underlying 
the ¢ and F tests have negligible effects upon the power 
of these tests to reflect differences between population 
means on the basis of sample data. 


RESULTS 


The three-way analysis of variance with 
BPIS scores as the dependent variable yielded 
an F of 170.51 for race (p < .001), an F of 8.48 
for sex (p < .01), and an F of .03 for social 
class (p > .05), The 176 Negro subjects earned 
a mean BPIS score of 69.9, as compared with 
the mean of 58.5 earned by the 272 white 
subjects. While the significant F for sex 
reflects the fact that males earned a higher 
mean BPIS score than females, it was only 
among the two Negro subgroups that the sex 
difference was statistically significant. Negro 
males earned a mean BPIS score of 71.7, while 
Negro females earned a mean of 68.2. The sex 
differential may be partly a function of the 
militancy that is a part of black power ideology 
and partly a function of the ideology’s emphasis 
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TABLE 2 


SEMANTIC DIFFERENTIAL FACTOR Means For Hic anp Low BLACK Power (BPIS) 
SUBGROUPS OF THE WHITE AND NEGRO SAMPLES AND RESULTS OF 
i Tests RELEVANT TO HYPOTHESES 


ee 
White sample Negro sample 
Concept and semantic 
differential factor R 
High BPIS Low BPIS High BPIS Low BPIS 
(n = 118) (n = 118) (n = 58) | (n = 58) 
Friend (evaluation) 5.97 6.14 6.14 6.13 
Enemy (evaluation) 2.64 2.47 2.49 2.48 
Ideal person (evaluation) 6.38" 6.578 6.60 6.57 
Colored person (evaluation 4.728 4,458.» 5.12 5.31 
Black person (evaluation) 4,70" 4,128 5.72* 5.16 
White person (evaluation) 4.715 4.98* 3.46" 4.37% 
Negro person (evaluation) 4.85" 4.448.» 5.40 5.26 
Friend (potency) 4.36 4,32 4.38 4.57 
Enemy (potency) 4,29 4.41 3.98 4.13 
Ideal person (potency) 4.37 4.44 4.61 4.59 
Colored person (potency) 4.27 4,25 4.19 4.40 
Black person (potency) 4,46 4.46 4.96 4.72 
White person CEV) 4.11 4.12 3.63 3.90 
Negro person (potency) 4.40 4,31 4,32 4.42 
Friend (activity) 4.82 4.68 4.18" 4.41% 
Enemy (activity) 4.26 4.16 4.06 4.09 
Ideal person (activity) 5.02 4.94 5.00 4.85 
Colored person (activity) 4.32 4.21 4.29 4.52 
Black person (activity) 4,52" 4.32" 5.20" 4,62" 
White person partied 4.27 4.34 3.70" 4,228 
Negro person activity) 4.44 4.27 y 4.52 4.46 


Note,—Significant differences not relevant to hypotheses are not marked 


a Difference between means o! 


b Difference between this mean and comparable mean for black person 


level. 


upon political and economic concerns. Negro 
males have previously been found to express 
more racial militancy than females (Inter- 
university Social Research Committee, 1967, 
p. 17; Murphy & Watson, 1967, p. 32). Males 
in general have been found to surpass females 
on measures of political and economic values 
(see Kinnick, 1966, p. 114; Lott & Lott, 1963, 
p. 220; Smith, 1962). In the validation study 
subsample of whites (see Lessing & Zagorin, 
1970), the sex difference reached statistical 
significance. However, the higher total BPIS 
score of white males was attributable to their 
higher score on the black culture subset of 
items. They did not surpass white females 
in advocating that Negroes achieve political 
and economic power. 

The results of the two sets of analyses of 
variance (one for whites and one for Negroes), 
with sex and level of black power orientation 
as independent variables and semantic differ- 
ential evaluation, potency, and activity scores 
as dependent variables, are presented in Table 
1. Table 2 contains the means for the high 


f high and low BPIS groups is significant at the .05 level. 


produced by same BPIS group is significant at the 05 


and low black power subgroups within each 
race. 

Hypotheses a and b were upheld in both the 
white and the Negro samples. As predicted, 
the students who were high in black power 
orientation produced a significantly higher 
evaluation score for black person and a 
significantly lower evaluation score for white 
person than students who were low in black 
power orientation. The significant Sex X BPIS 
interaction obtained in the Negro sample in 
regard to the evaluation scores for black 
person reflects the greater differential among 
female, as compared with male, black power 
ideological subgroups in regard to favorable 
evaluation of black person. 

In addition to the predicted differences in 
attitudes toward individual concepts, other 
significant differences were found between high 
and low black power subjects in regard to 
attitudes toward both racial and nonracial 
concepts (see Table 1). In the white sample, 
subjects who were low in black power ideology 
assigned a higher evaluation factor score to 
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TABLE 3 


RESULTS OF Sıx Two-Way ANALYSES OF VARIANCE WITH LEVEL OF BLACK POWER ORIENTATION 
(BPIS Scores) AND TYPE OF ETHNIC CONCEPT (COLORED, BLACK, WHITE, OR NEGRO) AS 
INDEPENDENT VARIABLES AND SEMANTIC DIFFERENTIAL DISTANCE (D) SCORES 
COMPUTED VERSUS IDEAL PERSON. FRIEND, AND 
ENEMY As DEPENDENT VARIABLES 


F values for white sample F values for Negro sample 
Reference concepts 


for D scores 


BPIS Ethnic concept Rares ees, BPIS Ethnic concept Sa ea 
Ideal person 1.06 163.66** 186.77** 3.12 46,04** 13/50" 
Friend 62.81** 3.37* 4.40** 8.47** 29,08"" | 6.75" 
Enemy 88.55** 72.96** 74.16** 30 34.87** 9.23** 


Note.—For the white sample, df = 
Negro ones df= 


> < 01, , 


ideal person, a lower activity factor score to 
black person, and a lower evaluation factor 
score to all three nonwhite ethnic concepts than 
did subjects high in black power orientation. 
Among the white subjects, low acceptance of 
black power ideology tended to be associated 
with a generally less favorable attitude toward 
Afro-Americans than the attitude held by 
black power advocates. 

In the Negro sample, students high in black 
power orientation, particularly males, tended 
to place a higher value upon being powerful 
and active than did low black power subjects, 
even when nonracial concepts were involved. 
Thus, the high black power Negro subjects 
perceived friend as significantly higher on the 
activity factor than did low black power 
Negro subjects (F = 6.75, p < .05; see Table 
1). The ratings of ideal person on potency and 
activity revealed a similar pattern of differ- 
ences that fell short of statistical significance. 

The results of the analyses of variance with 
level of black power orientation and type of 
ethnic concepts as independent variables and 
D scores as dependent variables are presented 
in Table 3. All three reference concepts (ideal 

person, friend, enemy) for the D scores pro- 
- duced significant interactions between level of 
_ black power orientation (BPIS score) and the 
_ particular ethnic group being evaluated. There- 
fore, the hypotheses regarding differences 

‘between specific pairs of means within the 
interaction term were evaluated by means of 

Scheffe’s 7. Table 4 contains the mean D values 
- for the racial and BPIS subgroups and the 
` results of the ¢ tests for each hypothesis. 


1/234 for the F for BPIS and 3/702 for the F for ‘ethnic: concept and interaction; for the 
1/114 for the F for BPIS and 3/342 for the F for ethnic concept and interaction, 


Hypothesis c, which predicted that subjects 
high in black power orientation would perceive 
a significantly smaller distance betw black | 


black power orientation received pa: 

port among both white and Negro subjects. 
Among white subjects, the hypothesis was 
confirmed in regard to black person and friend 
and among Negro subjects in regard to black 
person and ideal person. White subjects who 
were high in black power orientation actually 
saw black person and ideal person as more 
distant than did low black power subjects (D 
scores of 2.11 versus .99). However, the reason 
for this anomaly was not a paradoxical de- 
preciation of blacks by pro-black-power whites, — 
but, instead, a less differentiated and less 
extreme pattern of person perceptions on the 
part of high black power whites. These subjects 
saw all persons, white, black, colored, and 
Negro, as moderately distant from the ideal 
person (see D-score means in Table 4), while 
the low black power subjects had much more 
differentiated perceptions with some ethnic 
concepts being seen as very close to ideal person 
and some as extremely distant. 

As was predicted by Hypothesis d, both 


white and Negro students high in black power 


orientation perceived a significantly larger 
distance between black person and enemy than 
did students low in black power orientation. 
Hypothesis e, which predicted that students 
high in black power orientation would perceive 
a significantly smaller distance between white 
person and enemy than subjects low in black 


jy T AAR | 
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TABLE 4 


D-ScorE Mrans ror HicH anp Low Brack Power (BPIS) SUBGROUPS or THE WHITE AND 
NEGRO SAMPLES AND RESULTS or SELECTED / Tests RELEVANT TO HYPOTHESES 


Colored person Black person White person Negro person 
Sample 
Low BPIS | High BPIS | Low BPIS | High BPIS | Low BPIS | High BPIS | Low BPIS | High BPIS 

White 

Ideal person 1,59" 2,09 998 2.118 1.29 2.10 4.02» 2.01 

Frien: 2.86 1.85 2.448 1.87" 2.51 1.78 2.81» 1,74 

Enemy 2.49» 2.60 968 2.56" 1,24 Zois ras 2.64 
Negro 

Ideal person 1.78 2.38> 2,128 1,56" 2.71 3,84 1,92 2,06 

Friend 1.70 2.30 1,99 1.90 2.42 3.44 1.76 2.08 

Enemy 3.26 3.36 3.27% 3.95% 2.66" 2,028 3.16 3.58 


Note.—D score represents distance between concept in left-hand column and concept in top row, Significant differences not 


relevant to hypotheses are not marked. 


a Difference between D-score means of low and high BPIS subgroups differed significantly ; 


for Scheffe'st < .05, 


b Difference between this D-score mean and D-score mean for black person differed significantly for subjects within this BPIS 


evel, 


power orientation, was upheld only in the 
Negro sample. In the white sample, the 
high-black-power-oriented subjects tended to 
view all types of persons, including white 
person, as more distant from enemy than they 
were viewed by low black power subjects. 
Across all ethnic concepts, the mean D be- 
tween concepts and enemy was 2.59 for high 
black power white subjects, as compared with 
1.35 for low black power subjects. The signifi- 
cance of this difference is indicated by the F 
of 88.55 (p < .001) for the main effect of BPIS 
scores in relation to enemy D scores (see 
Table 3). 

While Hypotheses a-e compared the atti- 
tudes of high and low black power orientation 
subgroups in regard to separate concepts, 
considered one at a time, Hypotheses f-k made 
predictions regarding perceptions of several 


concepts by the individuals within a single 
ideological subgroup. The results of the two 
sets of analyses of variance (one for whites 
and one for Negroes) with level of black power 
orientation and type of ethnic person as 
independent variables and, in turn, evaluation, 
potency, and activity scores as dependent 
variables are presented in Table 5. These 
analyses involved the same means that are 
presented in Table 2. However, instead of 
comparing the high versus low BPIS subgroups 
in regard to mean factor score for each ethnic 
concept (as was done in the analysis reported 
in Table 1), the high versus low BPIS sub- 
groups were compared in regard to mean factor 
score for all four ethnic concepts at the same 
time by treating the scores on the ethnic 
concepts as repeated measures on the same 
subjects. As was expected, there were generally 


TABLE 5 


Resutts or Six Two-Way ANALYSES OF VARIANCE, WITH LEVEL OF BLACK POWER ORIENTATION 
(BPIS Scorrs) anp Type or Eranic Concept (Wurrr, BLACK, COLORED, OR NEGRO) As 
INDEPENDENT VARIABLES AND SEMANTIC DIFFERENTIAL EVALUATION, POTENCY, AND 
Activity FACTOR SCORES AS DEPENDENT VARIABLES 


F values for white sample F values for Negro sample 


Factor serving as 


dependent variable ER 


BPIS Ethnic concept | pennic Concept BPIS Ethnic concept Emeis Convene 
i 4.96* 20.05** 19:67"? 44 66,67"* 12,46** 
mae 12 19,82** -06 97 29.13** 1.95 
Activity 2.80 3.95** 2.33 AS 21,90** TRS 


es i i 1/234 for the F for BPIS and 3/702 for the F for ethnic concept and the interaction; for 
the ee i ie alee bore BPIS and 3/342 for the F for ethnic concept and the interaction, i 
x 05. 4 
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no main effects of BPIS. That is, except in the 
case of whites in regard to the evaluation 
factor, the mean evaluation, potency, and 
activity factor scores across all four racial 
concepts did not vary by level of black power 
orientation. (Since there were three nonwhite 
concepts and only one white concept, the 
white evaluation factor mean across all four 
concepts was depressed by the uniformly low 
evaluation scores that the low black power 
white subjects gave for all nonwhite concepts.) 
However, the predicted significant interaction 
between ethnic concept and level of black 
power orientation was obtained for the evalua- 
tion factor in both the white and the Negro 
samples. Therefore, Scheffe’s ¢ (Hays, 1963, 
p. 484) was utilized in order to test the signifi- 
cance of the differences between pairs of means. 
Table 2 contains the results of these ¢ tests. 

Hypothesis /, which predicted that students 
high in black power orientation would produce 
a higher evaluation score for black person 
than for Negro person and colored person, 
received no support in the white sample, but 
was partially confirmed in the Negro sample 
(for the ratings of black person versus colored 
person). Hypothesis i, which conversely pre- 
dicted that students low in black power 
orientation would produce a lower evaluation 
score for black person than for Negro person 
or colored person, was not upheld in either 
the white or the Negro sample. 

The other four hypotheses concerned with 
comparative reactions to the three nonwhite 
racial labels were stated in terms of D scores 
(see Tables 3 and 4). Again, there was quite 
limited evidence of any consistent pattern of 
differential response on the part of either the 
highly black-power-oriented subjects or the 
subjects low in black power orientation. 
Hypothesis g, which predicted that high black 


_ power subjects would perceive a smaller 


distance between black person and friend and 
between black person and ideal person than 
between the other two nonwhite concepts and 
friend and ideal person, received no support 
within the white sample, but was partially 
confirmed in the Negro sample (for black 
person versus ideal person D compared with 
colored person versus ideal person D). 

The converse of Hypothesis g, namely 
Hypothesis j, which predicted that low black 
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power subjects would perceive a significantly 
larger distance between black person and 
friend and ideal person than between other 
nonwhite concepts and friend and ideal person 
was likewise not upheld in either the white or 
the Negro sample. In fact, among the white 
subjects, the findings were statistically signifi- 
cant in a direction opposite to that predicted. 
Whites low in black power orientation per- 
ceived black person, friend, and ideal person 
as similarly powerful and active. This simi- 
larity on two out of three factors (potency and 
activity) resulted in a relatively small D score 
between black person and friend or ideal 
person. Colored person and Negro person, on 
the other hand, were, not seen as resembling 
friend or ideal person on any of the three 
factors: evaluation, potency, or activity. 

The final pair of hypotheses, # and k, like- 
wise received only limited support. Hy- 
pothesis 4, which predicted that subjects high 
in black power orientation would perceive a 
significantly larger distance between black 
person and enemy than between the other two 
nonwhite concepts and enemy, received no 
support in the white sample, but was partially 
confirmed in the Negro sample (in regard to 
black person versus enemy D, as compared 
with colored person versus enemy D). Hy- 
pothesis k, which predicted that students low 
in black power orientation would perceive a 
significantly smaller distance between black 
person and enemy than between the other two 
nonwhite concepts and enemy, was not upheld 
in the Negro sample and was only partially 
upheld in the white sample. White subjects 
who were low in black power orientation pro- 
duced the predicted significantly smaller black 
person versus enemy than colored person 
versus enemy D (.96 versus 2.49), but not 
because of a more negative reaction to black 
person than to colored person. Enemy and 
black person were seen as being similar in 
regard to having high potency, while colored 
person was more distant from the concept 
of enemy mainly because of being seen as 
much weaker. 


Discussion 


The present study was introduced as an 
exploratory investigation of the extent to 
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which the cognitive and emotional restructur- 
ing sought by black power advocates was 


_ actually being accomplished. The specific ques- 


tion posed was: If one compares high and low 
scorers on a measure of black power ideology 
focused mainly upon its political and economi- 
cal aspects, is there independent evidence 
that the high pro-black-power scorers express 
less depreciation of blacks and less idealization 
of whites than the low black power scorers? 
The data suggest a strongly affirmative answer 
to this question. 

In both the white and the Negro samples, 
those college students who were high in black 
power orientation placed a lower evaluation 


‘upon white person and, in the Negro sample 


only, saw white person as closer to enemy than 
did students who were low in black power 
orientation. White and Negro students who 
were high in black power orientation evaluated 
black person more favorably, saw black person 
as further from the concept of enemy, saw 
black person as closer to friend (in the white 


: sample only), and saw black person as closer 


to ideal person (in the Negro sample only) 
than did fellow students who were low in black 
power ideology. Morland and Williams (1969) 
noted that while white students tested in 1968 
replicated earlier findings (Williams, 1966), 
their later sample of Negro students rated 
Caucasian less favorably and Negro more 
favorably than before. They attributed this 
difference to “the heightened sense of racial 
identity among Negro Americans [p. 110].” 
The present data indicate that in the last half 
of the 1960s, more was accomplished than a 
mere increase in favorable attitudes toward 
Negroes on the part of Negro students. While 
Williams (1966) expressed pessimism regarding 
the possibility of changing the connotations 
of color-coded ethnic designations ‘‘in a culture 
where the symbolism of white as good and 
black as bad is so thoroughly entrenched 
[p. 539],” the process of change seems to have 
made considerable headway among youthful 
black power advocates. Coslin, Cardwell, and 
Pargament (1969) reached a similar conclusion 
on the basis of findings obtained in an in- 
genious study of the ratings of attractiveness 
assigned to pictures of dark and light Negro 
girls by black power advocates as compared 
with integrationists. 


Bennett (1967) suggested that in times of 
crisis and social upheaval there is usually a 
resurgence of the always seething controversy 
regarding the appropriate label for Americans 
of African ancestry. At the time data were 
collected for this study, the young, the militant, 
and the advocates of black power were ex- 
pressing intense opposition to the term 
“Negro.” However, there was no clear agree- 
ment as to what ethnic label would be prefer- 
able. Of 2,000 reader respondents to a poll 
conducted by Ebony Magazine (Ebony, 1968), 
48% preferred “Afro-American,” 23.3% pre- 
ferred “black,” 12% chose “African American,” 
8.1% chose “Negro,” and 3% chose “‘colored.” 
While the editors were careful to indicate that 
their poll involved no systematic sampling 
procedures, findings from other sources con- 
verge to suggest that their findings at least 
reflected fairly accurately the degree of dis- 
sensus and the lack of any overwhelming 
preference for the term black (see Krystall, 
Friedman, Howze, & Epps, 1968). 

The data from the present study, when 
viewed in the light of these other studies, 
suggests that as of 1968-1969 the major effect 
of black power ideology upon young people 
was the removal of the odious connotations of 
black without giving it clear and unequivocal 
priority over other nonwhite ethnic designa- 
tions. In a study conducted prior to the 
emergence of black power ideology, Williams 
(1966, p. 533) found that white and Negro 
students evaluated black person considerably 
more unfavorably than Negro person on the 
semantic differential. The white and Negro 
students in the present study who were low in 
black power ideology likewise evaluated black 
person slightly (but not significantly) less 
favorably than Negro person. However, the 
students who were high in black power orien- 
tation did not show a statistically significant 
reversal of this trend. They evaluated black 
person only slightly more favorably than 
Negro person. Moreover, while they rated black 
person as significantly closer than Negro 
person or colored person to their concept of 
ideal person, they saw black, Negro, and 
colored persons as equally close to their concept 
of friend. White students who were high in 
black power orientation described Negro, 
black, and colored persons as being equally 


re 
ree 
person. 

In spite of the lack of evidence for any clear 
preferences for one racial term over another, 
among both white and Negro students, each 
racial term appeared to- have distinctive 
connotations. Moreover, while there was dis- 
agreement between the races and between high 
and low black power ideology subgroups 
regarding the positive or negative evaluative 
attitude expressed toward each ethnic term, 
there was considerable agreement regarding the 
potency and activity of each ethnic type. 
Blacks were seen as the strongest, most potent, 
and most active. Negroes were viewed as 
slightly less strong and active, whites as still 
less strong and active, with colored persons 
being viewed as the weakest and most passive. 
Though the word “Negro” was once held to 
have the meaning of “slave” by a pre-Civil 
War court (Isaacs, 1963, p. 65), the contention 
of Baird (cited in Bennett, 1967, p- 52) that 
“Negro” evokes a slave image in the minds of 
twentieth-century Americans is questionable 
on the basis of the present findings: Negro 
person was perceived as no weaker than white 
person. Clearly, however, the designation 
“black person” was the most evocative of the 
powerful image which black power advocates 
desire for Afro-Americans. 

Recent history, including publicity about 
black militancy, may have played some role in 
the semantic associations elicited in response to 
black person. Isaacs (1963) has traced in rich 
detail the interplay of historical events and 
social factors determining reactions to various 
racial labels. However, the connotation of 
black as “‘strong” and ‘‘active” antedates the 
black power movement and appears to be 
based on pervasive, cultural-linguistic associa- 
tions. Williams’ (1964) college student subjects 
rated the color black as “strong” and white 
as “weak” in a completely nonracial context. 
Williams and McMurtry (1970) demonstrated 
that these associations of color names and 
traits were well-established in subjects as 
young as 13 years of age. Moreover, Williams 

(1966, p. 533) documented the tendency for 
color-coded ethnic types to be rated in a 
manner consistent with the semantic associa- 
tions elicited by their color label. 


‘their concepts of both friend and ideal 
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The correlates of high black power orienta- 
tion are much more specific among the Negro 
subjects than among the whites. The Negro 
subjects, regardless of black power orientation, 
all expressed a moderately favorable attitude 
toward Negroes. Previous findings of anti- 
Negro bias among college students (Bayton 
et al., 1965; Maliver, 1965; Steckler, 1957) 
were not replicated, probably because of the 
decreased social acceptability of direct expres- 
sions of anti-Negro prejudice and the lack of 
any provision in the experimental design for 
indirect expressions of anti-Negro bias (e.g., 
by permitting the evaluation of lower-class 
Negroes or other types on whom anti-Negro 
feeling could be projected without being per- 
ceived as constituting bias). The almost 
complete limitation of significant differences 
between groups varying in level of black power 
orientation to the concepts black person and 
white person, on which the ideology focuses, 


is consistent with the interpretation that the 


A . Tirate 
differences represent effects of inte! nalizing 
s 


black power ideology. Since all person concep 


were rated before the administration Brie 
BPIS, there was no 
responding in polarized fashion to black and 
white persons as a result of having ideological 
issues made salient by the external stimulus 
of the BPIS items. Even though the high and 
low black power subjects differed on one 
nonracial concept, with high black power 
subjects conceiving of a ‘friend’ as more 
active than low black power subjects, it seems 
reasonable to qualify the interpretation of the 
direction of probable causality only to the 
extent of suggesting that subjects who valued 
activity and vigorous striving most highly 
were most amenable to action-oriented black 
power ideology. 

However, in the case of the white students, 
the data suggest that the high and low en- 
dorsers of black power ideology are quite 
different types of people, with level of black 
power orientation following from, rather than 
causing, the extensive personality differences. 
The white subjects who were low in black 
power orientation exhibited several reactions 
that have been identified as comprising the 
authoritarian personality syndrome (Adorno, 
Frenkel-Brunswik, Levinson, & Sanford, 1950). 


question of subjects ay 


; 
| 
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The greater perfection which these subjects 
required of their ideal person, as compared 
with the requirements of their high-black- 
power-oriented peers, would seem to reflect 
the authoritarian’s “rigid adherence to con- 
ventional, middle-class values [Adorno et al., 
1950, p. 248]” and overemphasis upon the 
importance of possessing certain ideal virtues. 
The low-black-power-oriented subject’s ideal 
person had to be more “kind, clean, good, 
honest, pleasant, valuable, fair, brave, friendly, 
and successful” than the ideal person of other 
subjects. The low-black-power-oriented sub- 
jests manifested interpersonal attitudes sug- 
gestive of the general “misanthropy” which 
Sullivan and Adelson (1954) proposed was one 
of the underlying bases for the ethnic prejudice 
of the authoritarian. These students saw every 
type of person (including fellow whites) as 
Closer to enemy and further from friend than 
did their ideologically divergent peers. Their 
more negative attitude toward Negroes, in 
comparison with high-black-power-oriented 
white peers, was partly a simple exaggeration 
of their general malevolence. 

Serious, carefully reasoned objections to 
black power ideology have been made by 
Negro writers and scientists who are committed 
to the cause of full racial equality (Clark, 
1969; Rustin, 1969). Similarly, within the 
Negro sample of this study, the subjects who 
were relatively unfavorable toward black 
power ideology were not more negative toward 
Negroes as a racial group than were the black 
power advocates. They could justify the claim 
that they merely opposed the methods and 
specific black emphasis of black power ideology. 
However, in the white sample, being against 
black power was simply an aspect of being 
somewhat against black people. The data are 
somewhat similar to Lessing and Zagorin’s 
(1969) finding that among white high school 
students, militant methods were rejected 
mainly by those who were opposed to civil 
rights goals per se. One could not, of course, 
completely equate rejection of black power 
ideology by whites with racial prejudice. How- 
ever, it may be that among college student sub- 
jects, the concern of black power ideology with 
problems of identity and autonomy stimulate 
a sympathetic resonance unless racial prejudice 
blocks the empathic response. 


Thus far, the findings have been discussed 
as if they reflected genuine, meaningful atti- 
tudes on the part of the students tested. 
However, it is impossible to avoid the issue 
of whether the students were expressing mere 
verbalisms. Though Poussaint (1969, pp. 
150-151) has described widespread changes in 
Negro behavior which have accompanied the 
growth of the black power movement, the 
present exploratory study did not include the 
collection of any data regarding the behavioral 
correlates of a belief in black power. Still, the 
importance of purely verbal reactions should 
not be discounted. If the students high in 
black power orientation were merely verbal- 
izing pro-black attitudes which did not reflect 
their own deepest convictions nor guide their 
social behavior, then there are both positive 
and negative consequences which could follow. 
Empty rhetoric and hypocritical emphasis 
upon the form, rather than the substance of 
black pride, is likely to breed cynicism and 
increased alienation of Negroes from each 
other, Poussaint has castigated the critics of 
the black power movement “who want to 
condemn the whole movement because they 
.. . saw ‘a black nationalist dating a white 
girl [p. 151].’” However, continuation of the 
old, well-documented pattern of outgroup 
preference and in-group rejection (e.g., see 
Proshansky & Newton, 1968, pp. 186-189) to a 
verbal accompaniment of black pride slogans 
is hardly likely to build group cohesiveness. 
On the other hand, even mere verbalization of 
respect for fellow blacks does reduce at least 
the quantity of negative images available for 
Negroes to internalize into their self-concept 
as representative of the views of significant 
others, (See Mead, 1934, pp. 158-160, for this 
view of the genesis of the self-concept.) There 
is also the further possibility that inner 
attitudes would increasingly match not-yet- 
internalized pro-black attitudes as part of a 
process of “dissonance reduction” (Festinger, 
1957). Janis and King (1954), for example, 
found that individuals who had been induced 
to argue publicly against their beliefs tended 
to shift their private beliefs in the direction of 
greater consistency with their counteratti- 
tudinal, public role playing. 

The present research, which suggests that 
acceptance or rejection of black power ideology 


— 


has verbal attitudinal correlates, is only a 
beginning step toward accumulating much 
needed data regarding the effect of black power 
ideology upon maladaptive behavior patterns 
among Negroes. In view of past research 
demonstrating a complex relationship between 
attitudes and behavior (see Wicker, 1969), 
there is great need for research designed not 
only to delineate differential behavioral corre- 
lates of black power ideology, but also to 
determine whether various particular actions 
are mediated by specific black power beliefs. 
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|“ COST, DEPENDENCY, AND HELPING * 


ERIC SCHAPS? 
Department of Psychiatry, University of Chicago 


A field experiment examined how helping was affected by dependency and the costs 
of helping. Sixty-four shoe salesmen were randomly assigned to dependency condi- 
tions. In low dependency, accomplices of the experimenter posed as normal shoe 
store customers, while in high dependency they limped into the store with the heel 
of their shoe broken and announced that they could walk no further. In low cost, the 
store was relatively empty, while in high cost the store was fuller. Salesmen work on 
a commission basis, and it is especially costly for them to persist in showing shoes to 
difficult customers (as the accomplices were) when the store is full and other cus- 
tomers can be served. A behavioral index of service by the salesman showed a 


Cost X Dependency interaction. Under 


high-cost conditions, females received a 


statistically insignificant lower level of service when they were dependent, whereas 
when costs were low, dependency elicited significantly greater service. 


Many studies have shown that dependent 
persons are more likely to be helped than 
nondependent persons (Berkowitz & Daniels, 
1963, 1964; Bryan, 1966; Goranson & Berko- 
witz, 1966; Rosen & Bielefeld, 1967; Schopler 
& Bateson, 1965). However, it does not seem 
that dependency invariably leads to help, since 
in some experiments there have been depen- 
dency conditions where help was not found. 
Since dependency is not sufficient for helping, 
it would seem that a promising research focus 
at present is one of searching for interactive 
variables that determine when dependency 
does or does not cause help. 

Schopler and Bateson (1965), Krebs (1970), 
and Schwartz (1970) have noted that costs 
of giving help were probably low in studies 
which demonstrated a positive relationship 
between dependency and help. What happens 
if the costs of helping increase? While two 
studies (Schopler & Bateson, 1965, Experiment 
IL; Wagner & Wheeler, 1969) have shown that 
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less help was given to dependent and nonde- 
pendent persons when it was costly to give aid, 
other studies (Schopler & Bateson, 1965, 
Experiment I; Schwartz, 1970) have failed to 
find this main effect. 
In addition to a main effect of cost, there is 
also: the plausible possibility of an interaction 
‘of costs and dependency. The help given 
dependent persons is likely to decrease as costs 
rise, and it may decrease either to or below a 
base-line level, defined as the amount of help 
received by less dependent persons. Thus, hi 
costs may result in either a simple inhibitica 
of any helping effect or in exploitation of the 
dependent person’s vulnerability as the po- 
tential helper is prompted to pursue his own 
self-interest. Whereas there is no unambiguous 
experimental support in the literature for such 
an interaction (Schopler & Bateson, 1965, 
Experiments I and I; Schwartz, 1970; Wagner 
& Wheeler, 1969), evidence for exploitation of 
dependent persons can be inferred from socio- 
logical studies where the costs of helping are 
embedded in strong conflict of interest situa- 
tions. For example, Caplovitz (1967) and 
Siegel (1965) have pointed out that merchants 
in ghetto areas charge high prices and are con- 
scious of their customers’ poor shopping habits 
and restricted physical mobility. Of course, 
while high prices may be a conscious strategy 
on the part of these merchants, it is also possi- 
ble that they may simply reflect higher operat- 
ing costs. 

The foregoing discussion points to the 
importance of obtaining a Cost X Dependency 
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interaction and of specifying the inferences 
that result from the particular form it takes. 
Helping caused by high dependency might 
be inferred if a high-dependency—low-cost 
condition resulted in more help than a low- 
dependency—low-cost condition, The help-in- 
hibiting character of high costs could be 
inferred if there were less help in high-de- 
pendency-high-cost than in the high-depen- 
dency—low-cost condition. And an exploitation 
effect could be inferred if there were less help 
in the high-dependency-high-cost than in the 
low-dependency-high-cost condition. In order 
to rule out some of the restrictions to external 
validity and interpretation associated with 
laboratory experiments, the Cost X Depen- 
dency prediction was tested in a natural field 
setting using naive subjects, unobtrusive meas- 
ures, and unobtrusive treatments that pre- 
sumably had high experimental realism (Aron- 
son & Carlsmith, 1968). 


METHOD 
Subjects 


Subjects were 64 shoe salesmen, each of whom worked 
in a different exclusive women’s shoe store in the 
Chicago Metropolitan area. Exclusive stores were 
chosen because they insist that salesmen wait on only 
one customer at a time. Exclusivity was assessed by 
asking two experienced shoe store managers to name 
exclusive stores, and they jointly identified 46. The 
remaining 18 were selected by calling 60 stores that 
seemed exclusive in order to learn the size and price 
range of their inventories. Twenty-eight were elimi- 
nated because of insufficient inventories or low prices. 
The consulting managers then chose 18 from among the 
32 remaining stores. This made a total of 64 independent 
stores, and these were randomly assigned to treatment 
conditions. 


Independent Variables 


. Dependency was manipulated by having the experi- 
mental accomplice enter the store either with or with- 
out a broken heel on one of her shoes. In high-depend- 
ency conditions, the accomplice limped into the store 
with a broken heel and told the salesman that she could 
not walk any farther because of the discomfort. Pre- 
sumably, this comment made the salesman feel that the 
accomplice’s need for shoes was great and that she could 
not easily leave the store without making a purchase. 
In low-dependency conditions (no broken heel), the 
salesman would not have such cognitions. 

Costs were not experimentally manipulated; instead, 
the naturally occurring ratio of customers to salesmen 
in the store was used to determine costs. Low cost was 
defined as one or fewer customers per salesman, and 


high cost, as more than one customer per salesman, 
When there are few customers, the salesman can attend 
to any one customer knowing that there is no potential 
commission to be gained from other customers (low 
cost). However, when unattended customers are wait- 
ing to be served, the salesman may realize that he can 
increase his commissions by prompting a customer to 
make a quick purchase and then waiting on other 
customers (high cost). 

Since the cost ratio changed as genuine customers 
entered and left the store, it was possible for the ratio 
to fluctuate beyond the designated limits in both cost 
conditions. To minimize this problem, accomplices were 
instructed to enter stores in low-cost conditions only 
when the ratio of customers to salesmen was .75 or 
less, and they were instructed to enter in high-cost 
conditions only if the ratio was 1.5 or higher. 

A third independent variable was teams. Each female 
accomplice was accompanied by a male accomplice. 
There were eight such teams, and all 16 persons were 
undergraduates at Northwestern University. The one 
criterion for selecting female accomplices was that they 
have common shoe sizes, Only sizes 63-9, and widths 
AA to B were considered acceptable, since these sizes 
are the most heavily stocked. Selection of male accom- 
plices was unsystematic. 


Procedure 


So that the experiment could be completed before 
the end of the school year, accomplices were allowed 
to visit stores in the sequences most convenient for 
them. Accomplices entered a store when the cost ratio 
was appropriate. The female bore nearly the entire 
burden of interacting with the salesman. In high-de- 
pendency conditions, she first mentioned her discomfort 
and pain, and after that the high- and low-dependency 
procedures did not differ. 

The female asked to see white leather shoes (a style 
category in season at the time and known to be heavily 
stocked), She looked at the first selection of shoes 
brought to her by the salesman and ascertained their 
prices. She used a standard list of memorized excuses 
for turning down all of the shoes first shown to her, 
but she singled out the second most expensive pair to 
be left with her, saying: ‘‘They’re the ones I like best, 
but I’m not really excited by them.” She proceeded to 
turn down all other styles brought to her until the 
salesman ended the interaction by saying that he had 
no more shoes of the kind requested. (Previous experi- 
ence in the shoe trade indicated that this statement does 
not necessarily reflect the exhaustion of appropriate 
styles. Rather, it is the salesman’s most powerful tech- 
nique for forcing the customer to choose from among thé 
available alternatives. Once she has chosen, the sales- 
man is then free to seek further sales and commissions 
if there are customers to be waited on). 

The male accomplice, obviously a college student, sat 
next to the female, ostensibly reading from an open 
book. He seemed oblivious to his partner’s interaction 
with the salesman, and if the salesman tried to engage 
him in conversation, he replied that he was cramming 
for an upcoming test and could not afford the time to 
judge shoes. Only four or five salesman initiated con- 


76 — 


versation with male accomplices, and these few cases do 
not seem to be systematically related to treatments. The 
task of the male accomplice was to record details of the 
interaction. While it would have been preferable if the 
female had been alone (because the presence of the male 
partner may have inhibited exploitation), pilot testing 
showed that it was impossible for the female adequately 
to perform both a customer role and a data-collector 
role, To make the data-gathering function unobtrusive, 
only the male noted details of the interaction when the 
salesman was in the stockroom getting more shoes. 

In order to minimize the effects of experimenter ex- 
pectancies (Rosenthal, 1966), accomplices were not 
told the true purpose of the experiment until it was 
over. Instead, they were told that the study was con- 
cerned with the salesmen’s “long-term-memory proc- 
esses in the normal course of occupational experience.” 
The cost manipulation was explained as an attempt 
to ascertain the effects of pressure and distraction on 
memory, while the dependency manipulation was ex- 
plained as an attempt to ascertain the effects of an 
atypical event on memory. Accomplices therefore be- 
lieved that the salesmen would be later contacted to 
determine their recall of details of the interaction (the 
records of which were being kept by the male accom- 
plice). At the end of the experiment, 10 of the experi- 
mental accomplices reported believing this explanation. 
The remaining 6 thought that they, the accomplices, 
were the subjects of the experiment and that the 
postvisit questionnaire (see Dependent Variables) was 
the real focus of interest in the study. No accomplice 
reported any explanation even remotely close to the 
real purpose. 


Dependent Variables 


The major dependent variables of the study were 
related to the service provided by the salesmen. The 
number of pairs shown, the time spent in serving, and 
the number of trips to the stockroom for additional 
shoes were the three service measures. As expected, 
they were highly correlated. The range of their inter- 
correlations was from .49 to .63, and each was signifi- 
cantly different from zero with all ps < .001. A service 
index was therefore constructed by converting each 
service variable to z scores with a mean of 100 and a 
standard deviation of 20. The three z scores were then 
simply summed.* 

A final dependent variable was a self-rating of the 
tension experienced by female accomplices during the 


3 An attempt was made to measure the price of shoes 
shown. But since it was implausible to ask after the 
price of every pair shown, indirect and probably un- 
reliable methods of assessing price were used. More- 
over, there is little variablilty in the price range of 
shoes within a particular style category in most ex- 
clusive shoe stores. Since the salesman’s commission is 
typically 10%-127%, there is little to be gained from 
selling a more expensive pair of shoes to the customer. 
More is gained by finding other customers. Thus, it 
was not anticipated that price would vary with treat- 
ments, and it did not. See Schaps (1970) for further 
details, 
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interaction. These ratings were made immediately 
after leaving the store. Their purpose was not known 
to the accomplices, and was to ascertain, albeit in- 
directly, whether the interaction had closed because 
the salesman had wanted to force a decision to purchase 
or whether it had closed because the salesman had read 
signs of distress on the female accomplice’s face. The 
interaction could have been full of tension for the 
accomplice, since she was to leave without making a 
purchase after seeing many pairs of shoes. Though ex- 
tensive pilot testing and training (over a 2-month 
period) had offered assurance that this problem was 
minimized, the postvisit ratings enabled a test of 
whether the various experimental conditions evoked 
different degrees of tension. 


Data Analysis 


The overall design was a 2 X 2 X 8 factorial, with 
2 levels of cost, 2 levels of dependency, and 8 teams of 
accomplices. It was planned that there would be two 
stores in each of 32 cells. But one store had to be 
assigned from low to high cost because the ratio r 
well above the appropriate low-cost limit soon after the 
experimental team entered, and remained at a high level 
until the interaction was completed. Thus, there were 
30 cells with two scores, 1 cell with three seores, and 1 
cell with only a single score. ‘te 

The design permitted two kinds of statistical 
The first was a totally between-subjects ‘an 
which the error term was provided by t 
between the salesmen in the 64 store 
utilized the fact that each team experience 
and dependency condition, and so the effects of cost 
and dependency were tested against error terms based 
on the variance within the teams. 


RESULTS AND DISCUSSION 


Cost X Dependency Interaction 


Was there a Cost X Dependency inter- 
action? Table 1 presents the appropriate 
analysis of variance as well as the F values 
derived from the analyses based on the be- 
tween-salesmen and within-teams error terms. 
Tt can be seen that the interaction was signifi- 
cant at the .06 levelt in one analysis and at 
the .01 level in the other. The interaction is 
depicted in Figure 1. 

What do the contrasts tell us about the 
forces that mediated the interaction? Con- 
trasts were computed using the conservative 
between-salesmen error term, and they permit 
four conclusions. First, there was the predicted 
helping when the accomplice was dependent 
and costs were low, for the dependency con- 
ditions differed within low cost (f = 252, 


4 All tests are two-tailed. 
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; TABLE 1 
ANALYSIS OF VARIANCE OF SERVICE INDEX 


Source df MS Fa Fb 
Within teams 31 | 3172.82 
Teams (A) 7| 3925.72 1,96 
Cost (B) 1 | 11194.39 4,39% bia 
AXB 7| 2551.56 1,27 
Dependency (C) 1 868.04 <1 <1 
AXC 7. | 4113.23 2.05 
B XC 1| 7865.87 | 12,827! 3.925% 
AXBXC 7 613.67 <1 
Between-sales- 
men error 32 | 2006.32 
Total variance 63 | 2580.31 


aF computed from the within-teams error term that is 
listed directly below the effect being tested. 
F computed from the within-cells error term due to the 
difference between salesmen. 
*p <.10,df = 1/7, 
Kp <.06, df = 1/32, 
+t p <.05, df = 1/32. 
xax b < 01, df = 1/7. 


p < .01). Second, it is clear that less help was 
given to a dependent other when the costs of 
giving help increased, for the cost conditions 
differed within high dependency (t= 4.62, 
p < .001). Third, there was no clear exploita- 
tion effect, for the two dependency conditions 
did not significantly differ within high cost. 
And last of all, the two cost conditions did 
not differ within low dependency, indicating 
that a reduction of costs is probably not suffi- 
cient to induce helping in the absence of some 
stimulus (like dependency) that can arouse 
motivation to help. 


Alternative Explanations 


Accomplices visited stores in whatever 
sequence they liked, and it is possible that 
the subjects visited stores in the deviant high- 
dependency-low-cost cell at a later date than 
they visited stores in other conditions. If they 
did, and if the sequence of visits affects service, 
then the helping effect may have been caused 
not by dependency but by the sequence of 
visits. To examine the possibility, a four- 
level sequence variable was constructed by 
placing the first two visits of each team into 
one block, the third and fourth visits of each 
team into a second block, and so on until four 
blocks were formed. The blocks of visits were 
then entered into an analysis of variance to- 
gether with costs and dependency. The se- 
quence of visits did not affect the service index 
in any way. 

The rationale for the study was predicated 
on the salesman’s terminating the interaction, 


Since the interaction may have had different 
properties across experimental conditions, it 
may have been the accomplices and not the 
salesmen who signaled, however covertly, that 
they wanted the interaction to close. The ex- 
tensive preexperimental training was designed 
to rule out this possibility, as were the self- 
report tension data from the female accom- 
plices. We might expect the accomplices to 
report little tension if they had inadvertently 
wanted to prolong the interaction and high 
tension if they had wanted to terminate it. 
The tension ratings were analyzed in 2 X 2 
analyses of variance (Cost X Dependency), 
and none of the resulting F values was greater 
than one. Furthermore, the teams were split 
at the median into those who scored high and 
low on tension ratings, and this variable was 
analyzed together with cost and dependency 
in 2 X 2 X 2 analyses of variance; there were 
no effects of tension. Hence, these analyses 
provide indirect support that it was not the 
accomplices who terminated the interaction. 
They also indirectly suggest that reactance 
(Brehm, 1966) did not occur among salesmen 
in the high-dependency-high-cost condition. 
The cost of helping was not directly manip- 
ulated. It is possible, therefore, that some 
variable correlated with low cost (an empty 
store) may have caused the helping effect. 
For instance, stores are more likely to be empty 
on weekdays than on Saturdays, and they are 
more likely to be empty during midmorning 
or midafternoon than at lunchtime or in the 
early evening. But any alternative interpreta- 
tion of this kind must explain why an empty 


cost 


Fic. 1, Service index as a function of whether a heel 
was or was not broken (dependency) and whether the 
store had less than .75 customers per salesman or more 
than 1,50 (cost). 


store facilitated helping when the accomplice 
was dependent and failed to facilitate helping 
when she was not dependent. That is, an alter- 
native interpretation which invokes artifacts 
related to cost logically has to involve an arti- 
fact that interacts with dependency. No such 
artifact comes to mind. 


Significance of the Present Findings 


It has been frequently speculated that the 
cost of giving help is an important deter- 
minant of helping. Yet, the literature provides 
inconclusive support either for a main effect of 
cost or for a Cost X Dependency interaction. 
What the present field experiment makes ¢lear 
is that high costs inhibit the help given to a 
dependent female, and that lowering the costs 
of helping will not produce a strong aiding 
response unless there is some cue, such as 
dependency, that activates a social responsi- 
bility norm. Taken in conjunction with the 
experiments of Daniels and Berkowitz (1963) 
on dependency, liking, and help, and in con- 
junction with the experiment of Gruder and 
Cook (1970) on dependency, sex, and help, 
the present study of dependency, costs, and 
help suggests that dependency is most likely 
to elicit helping when a dependent adult is 
female, is not disliked, and it is not costly to 
give aid. When dependent adults are male, or 

are disliked, or when giving help is costly, any 
- social responsibility norm that facilitates help- 
ing may be at best weakened or at worst 
neutralized. 

The suppression of helping was not of suffi- 
cient magnitude to be interpretable as exploita- 
tion, since the dependency conditions did not 
differ within high costs. But the trend was in 

the direction of an exploitation effect similar 
to the one that has been inferred in nonexperi- 
mental studies of poor people in urban settings. 
An exploitation effect might have been more 
likely in the present study had there been no 
male observer or had there been a larger 
or more salient negative correlation between 
the dependent person’s outcomes and those of 
the potential helper. However, it must be 
stressed that such exploitation would not be the 
result of interpersonal hostility or the like. 
It would be the result of a social and economic 
situation that makes it possible for one person 
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to pursue his own self-interest only at another's 
expense, and of a power difference that makes 
the dependent person particularly vulnerable 
to exploitation. 
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TOWARD A PSYCHOLOGICAL THEORY OF CROWDING! 


J. A. DESOR? 


Cornell University 


A broad psychological correlate of population size, overall level of social stimulation, 
was postulated as the variable controlling human judgments of crowding. Seventy 
human subjects were presented with scaled-down rooms and human figures and 
asked to place as many people as possible in the rooms without overcrowding them. 
Room area was constant, while architectural features varying interpersonal percep- 
tion (partitions, disparity of linear dimensions, number of doors) varied. Results 
support the theory that “being crowded” is reception of excessive social stimulation 
and not merely a lack of space. Secondary results indicate one’s criterion of crowding 
varies with ongoing activity. No replicable sex differences were observed. 


“Crowded” is defined by Webster (1970) 
as “(1) filled with people or things; packed. (2) 
packed too full. (3) close together; inconve- 
niently lacking room [p. 436.” Psychologists 
have not expanded significantly upon Web- 
ster’s definition in spite of the obvious need 
for interjecting experimentally based advice 
into decisions currently being made with re- 
spect to urban crowding and overpopulation. 
Research to date has concentrated on correlat- 
ing changes in physiology and behavior with 
increases in the size of animal populations. 
There is ample evidence that animals do re- 
spond to the size of their population and re- 
spond pathologically when numbers are high 
(e.g., Calhoun, 1962; Christian, 1959, 1961, 
1963). The observed pathologies are ones that 
tend to reduce population size and are consid- 
ered natural mechanisms of population control 
(Christian, 1961; Wynne-Edwards, 1962). 
Based on this, one might apply the word 
“crowded” to “that population size at which 
pathologies tending to reduce the population 
appear.” Christian (1963) advanced the theory 
that these changes are responses to the psych- 
ological stress of high levels of “intraspecific 
competition, social strife, sociopsychologic 
pressures, or whatever other comparable des- 
ignation one may choose to use for the inter- 
actions between mammals in the same popula- 
tion [p. 326].”” One might argue that “overall 


This research was submitted to the Graduate 
Faculty of Cornell University in partial fufillment. of 
the degree of Doctor of Philosophy. 

2 Requests for reprints should be sent to J. A. Desor, 
who is now at the Monell Chemical Senses Center, 
3500 Market Street, University of Pennsylvania, 
Philadelphia, Pennsylvania 19104. 
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level of stimulation from social sources” be the 
designation used until experimentation implies 
a more specific one. This broad term describes 
a rather direct and obvious psychological cor- 
relate of population size and suggests a testable 
psychological theory of crowding. The theory 
being considered is that “being crowded” is 
“receiving excessive stimulation from social 
sources.” It is the purpose here to provide an 
experimental link between this theory derived 
from population biology and whatever it is 
people mean when they use the word that found 
its way into Webster’s dictionary. The hy- 
pothesis here is that subjects’ judgments of 
crowded are controlled by overall level of 
social stimulation. Further, subjects will judge 
a situation to be “less crowded” if the level of 
social stimulation is reduced. 

An area of applied relevance amenable to 
testing this hypothesis is architectural design. 
Any architectural feature of a space that re- 
duces interpersonal perception within that 
space should reduce the level of crowding 
there. Design features in this catagory include 
use of sound-absorbing materials for walls and 
ceiling, partitioning, minimizing the number of 
mirrors, doors, and windows that view “‘peo- 
pled spaces,” maximizing the disparity between 
the linear dimensions of the space, use of low 
lighting levels and high nonsocial noise levels, 
and any other feature that a designer can con- 
trol which interferes with the transmission of 
stimulating energy within the space. Obvi- 
ously, increasing the amount of space per per- 
son, either by decreasing the number of people 
or enlarging the space, will permit reduction of 
interpersonal perception. Decreasing the den- 
sity of people is by no means the only method of 


alleviating crowding if the proposed theory is 
correct. This is somewhat contradictory to the 
commonsense notion as reflected by Webster 
that being crowded is merely a matter of having 
insufficient space per person or “inconveniently 
lacking room.” 

The three design features selected for testing 
were partitions, number of doors, and disparity 
of linear dimensions. Clearly, partitioning insu- 
lates those on one side from those on the other. 
Further, the more insulating the material used 
for partitioning, the more effective it should be 
in reducing crowding in the space. A door can 
be viewed as a hole or break in a partition, and 
the greater the number of such holes, the less 
effective the partition in reducing crowding. 
For the third, disparity in linear dimensions, 
it can be shown mathematically that as the di- 
mensions of a plane area are made more dis- 
parate, the mean distance between pairs of 
points, summed over all pairs, increases. Thus, 
with all else constant, making a room more 
rectangular will increase the mean interper- 
sonal distance and thereby reduce oveall level 
of interpersonal perception. Specific experi- 
mental hypotheses are: When presented with 
partitioned and unpartitioned spaces, two- 
door and six-door rooms, or rectangular and 
square rooms, all of equal area and containing 
the same number of people, subjects will indi- 
cate the first of each pair is the “less crowded.” 
If Webster’s more narrow connotation is cor- 
rect, and judgments of crowding are controlled 
by the amount of space per person, then all 
six rooms will be judged as equally crowded. 


GENERAL METHOD 


In order to present the alternatives to the subjects 
for judgmental responses with minimal extraneous vari- 
ability, scaled-down rooms and small figures represent- 
ing people were used. Rooms were as simply constructed 
as possible and were devoid of roomlike features except 
those being tested. To give the subjects maximum free- 
dom to express their criterion of crowded, each was pre- 
sented with empty rooms of the relevant types, a supply 
of figures, and instructions to “place as many people as 
you can here without overcrowding them.” It was as- 
sumed that these instructions would generate a criterion 
of “crowded” for each subject that he would apply con- 
sistently across rooms. Presumably, the addition of one 
more figure into a subject's display would push the 
situation over his cutoff between “‘uncrowded’’ and 
“crowded” or “tolerably crowded” and “overcrowded.” 
Differences in number of figures placed were taken as a 
reflection of the effect on crowding of the architectural 
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feature varied. If more figures were placed in one room 
than another in filling them to their maximum un- 
crowded capacity, then it was assumed that the features 
characteristic of that room reduced crowding. 

Type of room and ongoing activity assumed were con- 
trolled. In all cases, the room was termed “a lounge 
area.” All hypotheses were tested across four activities 
to test their generality. Two predominantly interact: v 
activities and two predominantly coactive activi 
were selected. One of each of these is typically car 
out while standing; the other, while sitting. Instructions 
for the four specified (a) “standing and talking at a 
cocktail party,” (b) “in an airport where people are 
standing and waiting between flights,” (c) “sitting 
around talking and relaxing,” and (d) “sitting and 
reading.” In all cases, instructions were printed on 
cards accompanying the rooms and read: 


Imagine this is a lounge area (where people are stand- 
ing and talking at a cocktail party, in an airport where 
people are standing and waiting between flights, etc.). 
Place as many people as you can here without over- 


crowding them. That is, imagine you are one of gh 
the people (at the cocktail party, waiting between vid j 


flights, etc.), and put as many people as you can 
here without making it too crowded. 


This method of presenting scaled-down rooms and , 
figures with printed instructions defining the situation 
was used to replicate the natural observations of 
Sommer on preferred distance and orientation between 
conversing dyads (Sommer, 1961, 1962) and orientation 
of interacting and coacting dyads (Sommer, 1965). Con- 
clusions reached using this method were identical to 
those drawn by Sommer, and results from) this ab- 
stracted situation differed no more with field results 
than the latter differ with each other (Desor, 1970). 


Stupy I; EFFECT OF PARTITIONS 
Method 


Subjects. Subjects were 10 male and 10 female 
undergraduate students. ` 

Apparatus. Sixteen scaled rooms were constructed of 
beige fiberboard. They represented rooms 35 feet long 
and 15 feet wide with walls 8 feet high. There was one 
door centered in each of the shorter walls. Scale: 18.13 
inches of real distance was represented by 1 inch. 
Twelve of these rooms were bisected by partitions 
paralleling the shorter walls. Four were divided by beige 
fiberboard barriers half as'high as the walls. Four were 
divided by walls made of Plexiglas, and four were 
divided by walls made of beige fiberboard. The floor of 
each was covered with an unmarked sheet of white 
paper. Partitions were positioned so that the paper 
could be placed under them and removed easily. 

A large supply of figures representing people 5 feet 
8 inches tall were available. Each figure was 3.75 inches 
high and was constructed by fitting a rounded-head 
clothespin with arms made of pipe cleaners. A straight 
pin was glued to the base of each figure so that it stood 
erect when pushed into the fiberboard floors of the 
rooms described above. 
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‘The 16 scaled rooms were placed on tables around 
the perimeter of a large laboratory. They weré oriented 
with the longer walls parallel to the table edge nearest 
the subject. Each was accompanied by a card with in- 
structions printed on it, Each room type was paired 
once with each of the four sets of instructions described 

. above (see General Method). Order of presentation was 
balanced across'subjects. 

Subjects were tested individually. Upon entering, 
each was told “Imagine these [pointing] are rooms and 
those [indicating ] are people. Read the instructions that 
are with each room and follow them.” He was then left 
undisturbed. After the subject had completed five 
tooms, the experimenter began recording data at the 
first room and lagged approximately five rooms behind 
the subject throughout. Number of figures places was 
recorded, and marks indicating the position and orien- 
tation of figures were made on the recording sheet 
covering the floor. 


Resulis 


Partition differences accounted for a signifi- 
cant part of the variance in number of figures 
placed (F = 7.36, df = 3/54, p < .001). Com- 
parisons between means indicated that fewer 
figures were placed in unpartitioned rooms than 
in partitioned rooms (F = 15.7, p< .05), 
but the numbers placed in the three types of 
partitioned rooms did not differ from each 
other. More figures were placed when inter- 
acting activities were assumed (F = 17.7, df 
= 1/18, p < .001) and when figures were as- 
sumed to be standing (F = 29.3, df = 1/18, 
p < .001). Though there were no overall sex 
differences, there was a Sex X Interact/ 
Coact interaction (F = 13.1, df =1/18, p 
< 001). The main effect for interacting versus 
coacting activities was marked for females and 
minimal for males. Mean number of figures 
placed by each subject ranged from 10.5 to 
Sos À 


Discussion 


“Tt appears that partitioning an enclosed 
space reduces the degree to which people in the 
space are crowded. No differences were found 
between the three types of partitions used, 
however. A waist-high barrier, a glass wall, and 
a solid wall were equally effective in reducing 
the level of crowding as judged in this situation. 


It may be that the differences in perceptual iso- _ 


lation offered by the three were not as salient 
in the scaled-down displays as they would be in 
real situations. A second possibility is that some 


characteristic of the displays other than the 
insulating qualities of partitions resulted in the 
subjects placing more figures in partitioned 
spaces. The most significant variable con- 
founded with partitioning was traffic flow 
through the space. Unpartitioned rooms had 
one door in each of the shorter walls, thus al- 
lowing people to pass through the room. When 
partitions were added, two adjacent rooms, 
each haying one door, were created. Subjects 
were placing figures in large rooms through 
which traffic could flow and pairs of smaller 
rooms in which traffic was restricted. Further 
tests of partitioning in which traffic flow was 
controlled were conducted and are presented 
below. 


Stupy IT: Errect or Room 
SIZE (PARTITIONING) 


The difference between partitioned and un- 
partitioned spaces is the difference between 
having several small rooms and having one 
large room. To say that partitioning permits 
greater numbers of people to be accommodated 
with no increase in the level of crowding is to 
say that people can be placed at greater densi- 
ties in smaller rooms without increasing the 
level of crowding. To supplement Study I, the 
effects of partitions were tested in another way, 
by varying room size. Traffic variables were 
eliminated by using single rooms, each with 
one door. If judgments of crowding are con- 
trolled by the amount of space per person, 
then figures should be placed at equal densities 
in all rooms. Alternatively, if the controlling 
variable is overall level of social stimulation, 
then figures should be placed at greater densi- 
ties in smaller rooms. 


Method 


Subjects. Subjects were 10 male graduate students, 10 
male undergraduate students, and 10 female under- 
graduate students. 

' Apparatus. A total of five rooms as described in Study 
I were used in two tests. One pair represented rooms 14 
feet X 34 feet and 14 feet X 17 feet. These were accom- 
pained by the “cocktail party” instructions described 
previously. The other three represented rooms with — 
floor dimensions of 35 feet X 15 feet, 23.3 feet X 15 feet, 
and 11.6 feet X 15 feet. These three were accompanied 
by “sitting and reading” instructions. Supplies of figures 
as described in Study I were available. | 

The 10 graduate students were presented with the 

pair of rooms accompanied by cocktail party instruc- 


ome 
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tions. The 20 undergraduates were given the set of 
three rooms with areas in the ratio 1:2:3. Otherwise, 
the procedure was identical to that in Study I. 


Results 


For the first pair of rooms, the mean numbers 
of figures placed were 34.1 in the large room 
and 18.9 in the room half as large. The first 
mean is less than double the second (¢ = 2.43, 
p < 05). 

For the set of three rooms, the mean num- 
bers of figures placed were 17.6 (large room), 
11.6 (room two-thirds as large), and 7.9 (room 
one-third as large). Mean density placed in the 
smallest room was greater than that placed in 
the medium room (t = 3.3, p < .005) and in 
the large room (t = 3.8, p < .005). Mean den- 
sities placed in the large and medium rooms did 

not differ from each other, but 13 of the 20 sub- 
= jects placed figures at higher densities in the 
medium room (SD = 1.4, p < .08). 


Discussion 


A clear difference in the density judged 
“maximum uncrowded” appeared when a small 
room was compared to rooms double and triple 
its size. A similar difference did not appear 
when comparing a room with one two-thirds its 
| size, although there is some evidence for a 
trend in that direction. Where differences were 
observed, they were consistent with the parti- 
‘tion effect observed in Study I. These cannot 
be attributed to differences in traffic flow and 
support the original hypothesis that parti- 
tioning reduces crowding through perceptual 
isolation. 


_ STUDY III: EFFECT or LINEAR DIMENSIONS 
AND NUMBER OF DOORS 


Method 


Subjects. Subjects were 10 male and 10 female un- 

 dergraduate students. 

Apparatus. Twelve scaled rooms of the type de- 
scribed in Study I were used. Four were identical to the 
unpartitioned rooms used there. Four differed from these 
only in that they had four additional doors, two in each 
of the longer walls. The remaining four rooms were 
equal in area to the rest, but were square rather than 
rectangular. Each of these had two doors oriented to- 
ward the subject in the same way as those in the two- 
door rectangle. Other than differences in the room types 
used, the experimental situation was identical to that 
used in Study I. 


Results 


Differences attributable to architectural 
features were significant (F = 9.2, df = 2/36, 
p < .001). More figures were placed in two- 
door rectangular rooms than in two-door 
square rooms (F = 9.3, p < .05) and more 
were placed in two-door rectangles than in six- 
door rectangles (F = 17.0, p < .01). More 
figures were placed when interacting activities 
were assumed (F = 6.0, df = 1/18, p < .05) 
and when figures were assumed to be standing 
(F = 24.6, df = 1/18, p < .001). There were 
interactions between room type and the in- 
teract versus coact variable (F = 4.4, df = 
2/36, p < .05) and between these two and the 
stand-sit dimension (F = 3.4, df = 2/36, p 
< .05). This was due to irregular effects when 
square rooms were used. The coact versus in- 
teract effect was small for square rooms. Fur- 
ther, when sitting and reading was assumed, 
the mean number of figures placed was slightly 
greater in square rooms than in either of the 
rectangular rooms. No sex differences were ob- 
served. Mean number of figures placed by 
individual subjects range from 9.8 to 49.4. 


Discussion 


Both effects predicted by the theory that 
being crowded is reception of excessive social 
stinulation were supported by the data. Mak- 
ing linear dimensions of a room more disparate 
and decreasing the number of doors resulted 
in more figures being accommodated when 
level of crowding was judged “maximum 
uncrowded.” 


CONCLUSIONS 


Three fairly minor variations on a two-door 
rectangular space resulted in subjects placing 
different numbers of figures in that space when 
instructed to “place as many people as possi- 
ble without overcrowding them.” Cutting four 
more doors in the walls, adding a bisecting par- 
tition, or squaring the area resulted in different 
densities being judged “maximum uncrowded.” 
Subjects allocated less space per person when 
partitions were added, number of doors was 
less, or area was rectangular rather than 
square. Further, space allocation per person 
was less when room area, was reduced by one- 
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half or two-thirds. All of these effects were pre- 
dicted by assuming only that judgments of 
crowded are controlled by overall level of 
social stimulation, 

The commonly held notion that crowding is 
reducible by providing more space was sup- 
ported incidently. Subjects clearly judged that 
large rooms hold more people without over- 
crowding them than do smaller rooms. Thus, 
although judgments of crowding varied with 
space, they reflected that crowding is much 
more than merely a matter of space. One might 
conclude that Webster’s definition of crowded 
be amended to read “close together; inconve- 
niently lacking room and other means of re- 
ducing perception of others.” 

To briefly discuss deviant results, predicted 
differences between types of partitions used 
were not found. It is possible that they will ap- 
pear in more sensitive tests for them. Alterna- 
tively, research directed at what constitutes 
a socially acceptable “barrier” to interaction 
may show that waist-high barriers, glass walls, 
and solid walls are functionally equivalent. 
Perhaps norms are such that people on the op- 
posite side of anything even suggesting segre- 
gation can be ignored, “filtered out,” deper- 
sonalized, or however else one wishes to express 
psychological nonperception of others. Though 
a few unexpected interactions appeared, those 
tested more than once were not replicable. 
The only one not retested was the reflecting 
deviance when square rooms were used. 

Secondary results indicated that density 
considered maximum uncrowded varies with 
ongoing activity. From highest to lowest den- 
sity, the four activities were ordered cocktail 
party (standing), waiting in an airport (stand- 
ing), sitting and talking, and sitting and read- 
ing. These differences suggest that crowding 
is a far more complicated area than implied 


above. Though level of social stimulation ap- 
pears to control judgments of crowding when 
other things are held constant, it seems those 
“other things” are at least as important in 
determining when a situation is crowded. Un- 
raveling the variables relevant to crowding that 
differentiate activities poses a rather formid- 
able challenge for future research. 

Individual differences were large. Since one’s 
criterion of crowded probably reflects his 
tolerance of conspecifics, it is possible that the 
method used here can be applied to studying 
individual differences in tolerance for and 
responses to crowding, as well as related 
variables. 
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CONSENSUS AND MODALITY IN THE ATTRIBUTION PROCESS: 
THE ROLE OF SIMILARITY AND INFORMATION? 
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Ps 

7 
After groups of subjects judged the relative success of two students, an exchange of 
evaluations was arranged. Half of the subjects were led to believe that they would 
get an evaluation from a group member who had the same information about the 
students, half, that the group member had different information. Subjects were 
also led to believe either that the group member was similar or dissimilar in terms 
of his interpersonal orientation and style of judging other people. Then subjects 
received agreeing or disagreeing evaluations. A Similarity X Information within 
Agree interaction supported the attribution theory predictions that when the other 
had the same information, his agreement would increase confidence more if he 
were dissimilar than if he were similar, but when the other had different informa- 
tion, his agreement would increase confidence more if he were similar, A statisti- 
cal trend suggested that similar disagreers reduced confidence more than dissimilar 


disagreers. 


Whenever a person makes a judgment— 
about himself, about another person, or 
about an object—he runs the risk of being 
mistaken. Consequently, people are often 
concerned with discovering whether or not 
they are right. They frequently attempt to 
learn if their judgments are correct by com- 
paring them with the judgments of others 
(Festinger, 1954). The information that a 
| person receives through such a comparison 
typically affects the confidence with which he 
~ maintains his judgment. The present study 

is directly concerned with the way confidence 

is affected when a person finds that others 
agree or disagree with a judgment he has made. 

The possibility that a person’s judgment 
may be correct or incorrect may be rephrased 

in terms of attribution theory (Heider, 1944, 

1958; Jones & Davis, 1965; Kelley, 1967). 

Attribution theory concerns the human tend- 

ency to see behavior, one’s own and others’, 

as caused primarily by the person or the 
environment. Any judgment, evaluation, or 
emotion may reflect the judge’s peculiar 
characteristics or be attributed entirely to the 


1 This study is based on a doctoral dissertation sub- 
mitted to Duke University. The author would like to 
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vided by National Science Foundation Grant G-8857, 
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compelling stimulus qualities of the entity 
being perceived. 

In the realm of judgment, to say that a 
response reflects the.entity. implies the ac- 
curacy of the judgment. The response mirrors 
or describes the entity without distortion. 
Conversely, an inaccurate response is attribut- 
able either to the person or to misleading 
features in the context of judgment. The 
judge is either a flawed perceiver, or the 
entity is obscured by the qualities of its 
surround. 

Thus, the person wanting to know whether 
his judgment is correct needs to know whether 
it is attributable to the entity. Kelley (1967) 
suggested that a person can attribute his 
response to the entity if he responds differen- 
tially or distinctively to the entity, if he re- 
sponds in the same way over repeated inter- 
actions with it and through different modes of 
transaction, and if other people respond in 
the same way. That is, an entity response is 
marked by distinctiveness, consistency across 
time and modality, and consensus across others. 

It is the dimensions of consensus and 
modality that concern the present investiga- 
tion. Kelley’s model implies that if a person 
wants to establish that a response accurately 
reflects an entity, he must find whether other 
people respond to the entity in the same way 
and whether it manifests itself in the same 
way through other modalities. If others agree, 
he can be more certain that the response is 


CONSENSUS AND MODALITY IN THE ATTRIBUTION Process 


entity caused. If not, it will be regarded as 
person caused, and if the person is to under- 
stand the response fully he must discover what 
personal characteristics led him to respond in 
the unusual way he did. Also, if other mani- 
festations or other information lead to the same 
response, he can be more certain that the 
response is entity caused. If not, the person 
must discover what features of his original 
information distorted the true attributes of 
the entity. 

These considerations make no mention of 
the characteristics of the person or persons 
constituting the consensus. While Kelley’s 
implications may be true in general, it would 
seem that one’s reaction to comparison in- 
formation would depend in part on the char- 
acteristics of the other. The present investiga- 
tion focuses on two dimensions that might 
affect the person’s response to agreement or 
disagreement: the other’s degree of similarity 
and his information about or modality of 
transaction with the entity. It asks how a 
person’s confidence in his judgment is in- 
fluenced by feedback from others who vary in 
their similarity and their information. 

Let us first consider the case in which an- 
other person is exposed to the same informa- 
tion about the entity that the subject has 
been and both reach the same conclusion 
about its essential characteristics. Assuming 
that the person feels some degree of un- 

` certainty about his judgment, how might his 
confidence be affected when similar or dis- 
similar others agree? 

The person’s uncertainty reflects his ap- 
prehension that his judgment is person caused. 
He may doubt his own objectivity or per- 
ceptiveness. If a similar person agrees with 
the uncertain judge, his confidence should be 
increased on the basis that another agrees. 
Yet, the similar other may share the char- 
acteristic which generates the inaccurate 
judgment. Thus, by comparing with similar 
others, the person runs the risk of discovering 
an agreeing consensus which will lead him 
confidently to make an entity attribution 
when in fact the consensus combines a number 
of person-caused judgments. If a dissimilar 
other agrees, there is a correction for bias, he 
is less likely to share the error-producing 
characteristic. Thus, his agreement helps to 


rule out the possibility that the judgment is 


person caused. 

[This reasoning suggests that if a person 
makes a judgment about which he is not 
entirely confident, his confidence should be 
more increased by agreement from a dissimilar 
other than from a similar other. The similar 
other may share the characteristics which 
could be generating an incorrect response; 
the dissimilar other is less likely to. 

In other terms, if a person is unsure about a 
judgment, his confidence might be more in- 
creased if another person with a different 
perspective or viewpoint on the matter agreed. 
Such agreement would tell him that his judg- 
ment is not merely a function of his peculiar 
point of view. It suggests that people with a 
variety of different orientations reach the same 
conclusion, that the supporting consensus is a 
heterogeneous one. 

This discussion omits 
modality, one of the key dimensions in Kelley’s 
(1967) attribution model. A response can be 
attributed to the entity if, among other things, 
it is consistent across modes of interaction with 
the entity. An uncertain judge should be more 
confident if he knows that his response is 
consistent across modalities, that is, that other 
information would lead him to the same 
judgment. 

Suppose that the person himself has no 


exposure to other information but his judg- 


ment of the entity is supported by someone 
else who has been so exposed. If the other were 
similar, the person might be able to assume 


that since they share a common orientation, ` 


they would respond similarly to the different 
information. He would have reliable data 
suggesting that his response to the entity 
would be consistent across at least one other 
modality. 


It should be noted that when a person finds ~ 


that a similar other with different information 
agrees, he can increase his confidence on two 


counts. First, he has discovered that his re- — 
sponse is consensual, and, second, he can — 


assume that it would be consistent across 
modality. 


The case where the agreer with different — 
information is dissimilar becomes fairly com- — 
plex. The logical extension of the discussion ‘ 
above might be that confidence should be ~ 
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consideration of © 


+ 


maximally increased by agreement from a 
dissimilar other with different information. 
Wouldn’t the person have bias-correcting con- 
sensus information in addition to consistency 
information? Such a conclusion overlooks an 
important new source of ambiguity. 

It was said above that if the other is similar, 
the person can assume that he would respond 
to the new information as he did to the 
original. Their common orientation permits 
this assumption. If the other is dissimilar, 
the person simply cannot assume similar 
responses to the different information. Thus, 
the dissimilar other’s agreement on the basis of 
different information tells him very little. 
Since the two are dissimilar, the person could 
as well assume that if he had access to the 

other information, it would lead him to the 
_ opposite conclusion. He has no reliable way of 
approximating his response to that information. 
Thus, under conditions where the other 
person has the same information, the judge’s 
_ confidence should be increased more by agree- 
ment from a dissimilar other than from a 
similar other. Under conditions where the 
other has different information, the person’s 
confidence should be increased more by agree- 
ment from a similar other than from a dis- 
similar other. These two predictions can be 
combined to yield a prediction of a statistical 
interaction among the four instances in which 
' a judge receives agreeing feedback. This pre- 
_ dicted interaction is referred to as the agree- 
~ ment hypothesis. 
~ Let us next consider instances in which a 
person finds that another person reaches a 
- different conclusion about an entity. If the 
disagreer has the same information and is 
similar, he should greatly reduce the person’s 
confidence in his judgment. The other’s dis- 
agreement cannot be dismissed as a reflection 
_ of personal characteristics. To the extent that 
_ the two are similar, they share the same per- 
' spectives. By comparison, it should be easier 
for the person to handle disagreement from a 
dissimilar other. In such instances, people 
generally assume that their own responses 
accurately mirror the environment and that 
_ the other’s response is idiosyncratic (Heider, 
1958). The other’s dissimilarity conveniently 
facilitates this assumption. 
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When the disagreeing other has different 
information, the situation is slightly more 
complex. As with same information, the 
similar others disagreement cannot be dis- 
counted on the basis of personal biases, The 
possibilities that remain are that the origina 
judge is correct, that the other person is cor- 
rect, or that there is a complex entity-modality 
interaction giving rise to the divergent opinions 
of the two similar observers. In the latte: 
instance, the judge might take the simila: 
other’s disagreement to imply that while the 
entity manifested itself in one way through the 
original modality, it looked distinctly dif- 
ferent in the other modality. He would assume 
that if he had the other information he would 
see it as the other person did. 

Thus, the person would have to make some 
judgment as to which manifestation is more 
truly reflective of the entity’s attributes. 
Those who dismiss the other’s source of 
information would not greatly reduce their 
confidence; those who begin to suspect the 
representativeness of their transaction mo- 
dality would substantially decrease their con- 
fidence. A great deal of variability might be 
expected in this condition. 

A dissimilar other’s disagreement on the 
basis of different information would have no 
systematic impact on the person’s level of 
confidence. First, because he is dissimilar, his 
disagreement can easily be attributed to 
personal characteristics, Furthermore, his dis- 
similarity makes it difficult for the person to 
make any assumption as to how he would 
respond to the other’s information. 

The above reasoning yields two predictions 
about the way a person’s confidence in his 
judgment will be affected by disagreement 
from others. Under conditions where the 
other is similar, the judge’s confidence should 
be more decreased than where the other is 
dissimilar. Also, disagreements based on the 
same information should result in larger de- 
creases in confidence than corresponding in- 
stances where the disagreement is based on 
different information, Thus, the greatest de- 
crease should be produced by similar dis- 
agreers who have the same information. The 
smallest decrease should be produced by 
dissimilar disagreers who have different 
information. 
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METHOD 
Overview 


As part of an alleged study of college admissions 
procedures, small groups of subjects were shown video- 
taped segments, purported to be excerpts from the 
interviews of two prospective students. After the 
subjects had judged the students’ relative subsequent 
success, an exchange of evaluations was arranged. 
Half of the subjects were led to believe that they 
would get an evaluation from a group member who 
had seen the same segments; the others were led to 
believe that their evaluation came from a group mem- 
ber who had seen different segments. Before they read 
the other’s evaluation, the subjects were given their 
score and the other’s score on a fictitious test measuring 
interpersonal judgment style. Half of the subjects were 
led to believe that the other subject was similar in 
judging others; half were led to believe that he was 
markedly different. The evaluations that the subjects 
received either agreed or disagreed with their own. 
Thus, the design was a 2 (similar versus dissimilar) 
X2 (agree versus disagree) X 2 (same information 
versus different information) factorial. 


Subjects 


One hundred and twenty-one undergraduate men 
enrolled in introductory psychology classes at Duke 
University served as subjects in the experiment. 
Subjects were scheduled in groups of four and five for 
an experiment entitled “student selection study.” 
When only three subjects arrived, the group was run 
in same-information conditions. 


Experimental Materials 


Videotapes, The experiment required the subjects to 
decide which of two Duke students was more successful 
academically. The theoretical discussion in the intro- 
duction assumes at least a modicum of uncertainty. 
Furthermore, if initial confidence levels were too high, 
there would be a ceiling on possible increases. Con- 
sequently, the tapes were designed not to favor one 
student so much that initial confidence ratings would 
be too high. 

The tapes consisted of eight interview segments, 
four of one student and four of another. The scripts 
were written to make one student appear more intel- 
ligent and creative. However, he appeared to be some- 
one who might pursue a variety of extracurricular 
interests. The other, in contrast, seemed like one who 
would study diligently without concerning himself 
with other matters. Thus, while the first appeared to 
have more potential, the subjects could choose the 
second on the basis of his self-disciplined single- 
mindedness. 

Similarity manipulation, Specially prepared “Inter- 
personal Judgment Profile” sheets were used to manipu- 
late similarity. The subjects were given two profiles, 
their own and that of the subject whose evaluation 
of the two students they received. The profiles al- 
legedly indicated their standing on a test which meas- 
ured one’s orientation toward other people or style of 


evaluating personal characteristics. Subjects were told 
that they could compare the two profiles to determine 
how similar their own style was to that of the other 
subject. The profiles were almost identical in the 
similar conditions and very divergent in the dissimilar 
conditions. 


Procedure 


The experimenter greeted the subjects and ushered 
them into a room with several chairs and a videotape 
monitor and recorder. 

The experimenter explained that the study con- 
cerned college admissions procedures: how admissions 
decisions were made and especially what the role of 
interviews was in such decisions. It was explained that 
in order to explore these questions, they would be 
asked to make some judgments about two Duke 
applicants whose admissions interviews had been 
recorded. 

He added that only short segments of the tape would 
be played, showing the two applicants discussing such 
issues as coeducation and curriculum relevance. Then 
he explained that both applicants had been accepted 
and were presently juniors at Duke. Both were doing 
well, but in terms of grades, faculty evaluation of 
independent work, and faculty assessment of each 
one’s potential for contributing superior work to his 
major field, one student was clearly better. The sub- 
jects’ task would be to identify the better student. \ 
The experimenter said he was interested in seeing how 
accurate they could be, individually and as a group, 
in making judgments. 

In the same-information conditions, the experi- 
menter started the videotape without further comment. 
In the different-information conditions, he explained 
that when committees considered various applicants, 
it often happened that not all committee members 
had precisely the same information. The experimenter 
said that in an attempt to duplicate this feature of 
committees, he was showing some of them, chosen at 
random, a different tape. He added that the second 
tape merely showed different segments of the same 
two interviews. 

Then he led two subjects into an adjoining room. 
Both subjects were then in fact shown the same 
segments in different rooms. After six interview seg- 
ments were shown, all subjects were brought together 
in the main room. 

After the tapes were played, the subjects were asked 
to complete an evaluation sheet indicating which 
student they thought was better and explaining their 
choice, After collecting these evaluations, the experi- 
menter explained that he was interested in which 
student the group thought was more successful and 
that one of the ways that the group’s choice would be 
determined was on the basis of their individual prefer- 
ences. Each of their preferences would be counted as a 
vote, but not all the votes would be weighted equally, 
He explained that each choice would be weighted ac- 
cording to the confidence with which it was held. The — 
experimenter emphasized the importance of each sub- 
ject indicating exactly how confident he felt. He said © 
preferences held with certainty should be weighted _ 
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heavily, but uncertain preferences should be weighted 
lightly to avoid misleading the group. 

‘The subjects were told that they could indicate their 
confidence using numbers from 1 to 100. Finally, they 
were reminded that this was a preliminary indication, 
that they would be asked to indicate their confidence 
later when they had more information, and that the 
later rating would be used in determining the group's 
choice. Wa 

The measure of confidence change was presented with 
rather lengthy instruction in order to achieve two goals. 
First, it was hoped that the experimenter’s explanation 
would make it plain to the subjects that this confidence 
rating was important. Second, by tying the ratings to 
real consequences, the group’s chances of performing 
relatively well, it may have provided motivation for 
the subjects to use care and honesty in indicating their 
level of certainty. 

After collecting the ratings, the experimenter told the 
subjects that it was rare for committee members to go 
to a group discussion without having any indication of 
the feelings of the other members, Usually, they had 
discovered informally how others felt. He said that 
this feature of group process was going to be duplicated 
in the present experiment and that each subject would 
be given the evaluation of one of the other group 
members. 

The experimenter added that in real groups when a 
member received another’s opinion, he did not merely 
get a written evaluation from a stranger. He generally 
knew the other person and knew whether that other 
judged people in the same way that he did. Again, 
they were told that a feature of group process would be 
simulated. Each subject would be given his own score, 
and the other subject’s score, on a test embedded in the 
series of questionnaires that they had all taken at the 
start of the semester. The experimenter explained the 
scale and how by carefully comparing profiles they 
could determine how similarly they and the other sub- 
ject judged people. 

After explaining the profiles, the experimenter asked 
the subjects to read the other person’s evaluation sheet. 
Subjects assigned to the agree conditions received an 
evaluation, prepared ahead of time, favoring the same 
student they had chosen. Disagree subjects received 
an evaluation favoring the other student. 

After the subjects had read the other person’s 
evaluation, the experimenter said he would show them 
another short tape segment that might be useful in 
helping them resolve any final questions about the 
two students. This segment was included so that 
subjects could change their confidence ratings without 
admitting being directly influenced by the other person. 

When the last tape segment was finished, the ex- 
perimenter distributed a questionnaire containing the 
final measures of preference and confidence. The 
experimenter instructed the subjects to be especially 
thoughtful in answering the confidence rating since 
this time it would be used to weight their vote in the 
group decision. 

The completion of the questionnaire marked the 
conclusion of the experiment. There was a final dis- 
cussion in which the true purpose of the experiment 
was fully explained. 


RESULTS 


Ten subjects were excluded from the data 
analysis. Three subjects were suspicious of the 
experimental procedures, and two did no! 
understand all of them. Two subjects changed 
their original judgment before receiving 1h: 
other’s evaluation, and one could not ac- 
curately recall his original judgment. Finally, 
two subjects reported that they deliberately 
misused the confidence rating so as to influence 
the group’s decision in a way which did not 
at all reflect their level of certainty. Of the 
111 subjects included in the analysis, 75 judged 
the first stimulus person to be better, and 36 
chose the second.* 


Manipulation Checks 


Bearing on the effectiveness of the similarity 
variation was an item on a postexperimenta! 
questionnaire asking whether the other's 
evaluation was»about what the subjects ex- 
pected or different from what was expected. 
An analysis of variance performed on the 
data from this measure shows two significant 
results. First, there is a main effect for agree- 
ment (F = 58.63, df = 1/103, p < .001), in- 
dicating that agreeing evaluations were ex- 
pected more than disagreeing evaluations. 
More germane at present is the significant Sim- 
ilarity X Agreement interaction (F = 25.38, 
df = 1/103, p < .001). This comparison shows 
that the subjects expected a similar other to 
agree and a dissimilar other to disagree. Thus, 
the subjects who received an agreeing evalua- 
tion from a dissimilar other were mildly 
surprised. Also, while the subjects were 
mederately surprised when the dissimilar other 
disagreed, disagreement from the similar 
other was totally unexpected. In sum, these 
data suggest that the similarity manipulation 


3 Since there was no theoretical reason to expect the 
predictions of the study to apply differentially on the 
basis of the subjects’ choice of stimulus person, the 
subjects were assigned to conditions regardless of their 
choice, and the data were analyzed and are presented 
without regard to choice of stimulus person. However, 
if the data are analyzed separately, it can be seen that 
the agreement hypothesis receives slightly stronger 
support from the majority of subjects who chose the 
first stimulus person. The hypotheses for the disagree 
conditions are somewhat more strongly supported by 
the 2-4 subjects per condition choosing the second 
stimulus person. 
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had impact—specifically, it created an ex- 
pectancy as to whether the other subject 
would agree or disagree. There were no other 
effects or trends on this measure. Another 
item asked whether the other subject was 
similar or dissimilar. All subjects correctly 
recalled the manipulation. 

Subjects were also asked to recall whether 
or not the other’s opinion agreed. All subjects 
correctly grasped the agreement variation. 
Postexperimental questioning indicated that 
without exception, the subjects perceived the 
information manipulation as intended. All 
subjects in the different-information conditions 
realized that the evaluation they received 
came from a subject who had seen different 
interview segments. Subjects in the same- 
information conditions naturally assumed that 
the evaluation was based on the tape the group 
had just seen. 


Changes in Confidence * 


A factorial analysis of variance and the 
Newman-Keuls procedure (Winer, 1962) re- 
vealed no significant differences on the initial 
confidence ratings. The only trend was toward 
a main effect for information (F = 2.94, 
df = 1/103, p< .10). Subjects in the dif- 
ferent-information conditions tended to be 
initially less confident than those in the same- 
information conditions. This may have been 
due to the fact that different-information 
subjects knew of the existence of other inter- 
view segments before they indicated initial 
confidence levels. The knowledge of other 
information may have slightly increased 
uncertainty. 

The means for changes in confidence are 
presented in Table 1. The analysis of variance 
performed on these data is presented in 
Table 2. Table 2 reveals a main effect for 
agreement (F = 36.76, df = 1/103, p < .001). 
Not surprisingly, subjects who received agree- 
ing evaluations showed greater increases in 
confidence than did subjects receiving dis- 


4 Before conducting the experiment, it was decided 
that if any subjects in the disagree conditions changed 
their judgment from one stimulus person to the other, 
their confidence would be treated as having changed 
from the level of confidence in their first judgment to 
zero, no confidence. Only one subject changed his 
judgment. 


TABLE 1 
CHANGES IN CONFIDENCE 
Same information Different information 
Condition 
Similar | Dissimilar | Similar | Dissimilar 
Agree 
x +7.94 | +9.33 | +14.29 | +3.50 
Ae 100.06 151.95 160.99 | 120.73 
n 16 15 14 14 
Disagree 
X —10.00 —6.21 —8.08 +38 
Si 104.17 200.64 292.27 | 201.92 
n 13 14 12 13 


agreeing evaluations. There is also a significant 
Similarity X Agreement interaction (F = 4.96, 
df = 1/103, p < .05), indicating that subjects 
were on the whole more influenced by similar 
others. Overall, changes in confidence were 
more increased by similar agreers and more 
decreased by similar disagreers. Only among 
the agree-same-information conditions is there 
a reversal, with the dissimilar other being 
more influential than the similar other. The 
trend toward a Similarity X Agreement X In- 
formation interaction (F = 3.00, df = 1/103, 
p < .10) reflects the Similarity X Information 
interaction within the agree conditions and the 
lack of such an interaction in the disagree 
conditions. 

The Similarity X Information within Agree 
comparison tests the agreement hypothesis. 


TABLE 2 


ANALYSIS OF VARIANCE OF CHANGES 
IN CONFIDENCE 


Source af MS F 
Similarity (A) J 14.00 
Agreement (B) 1 5990.94 36.76** 
Information (C) 1 140.57 
AXB 1 808.07 4.96* 
AXC 1 97.23 
BAC 1 110.06 
AXBXC 1 489.48 3.00 

Error 103 162.98 
A X C (By)* 1 545.56 4.12* 
Error (Bı)? 55 132.26 


a Tests the interaction of the similarity and the information 
variables for subjects in the agree conditions, 
Error term for subjects in the agree conditions, 
*p <.05. 
**p < 001. 
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Using the error term for the agree subjects 
only (Winer, 1962), the comparison is sig- 
nificant (F = 4.12, df = 1/55, p < .05). Under 
same-information conditions, the dissimilar 
other’s agreement yields a slightly larger in- 
crease in confidence; under different-informa- 
tion conditions, it is quite clearly the similar 


other’s agreement that most strengthens. 


confidence. 

The evidence bearing on the role of the 
similarity and information variables in the 
case of disagreement is not particularly sup- 
portive. Table 1 indicates that the means are 
ordered ‘as expected, but the predicted effects 
are not significant. However, there is some 
support for the hypothesis that disagreement 
from similar others reduces confidence more. 
Tf the error term for the entire analysis is used, 
there is a trend in the predicted direction 

(F = 2.99, df= 1/103, p< .10). Also of 
interest is the predicted large variance in 
the similar-disagree-different-information con- 
dition. It is significantly larger than the 
pooled variance in the other seven conditions 
(F = 1,98, df= 11/92, p < .05). 


Discussion 


While the present investigation was not 
designed to test directly Kelley’s (1967) state- 
ment of attribution theory, it seems appro- 
priate to consider the bearing of the results on 
his original formulation before considering 
evidence for the extensions of that formula- 
tion proposed in the introduction. 

Kelley’s statement with regard to the con- 
sensus dimension is that if the perceiver finds 
that others agree with him, that his response 
is consensual, his confidence that the response 
is entity caused will be increased. The experi- 
ment provides direct support for this hypoth- 
esis. Agreeing others produced more of 
an increase in confidence than disagreeing 
others. 

With regard to the modality dimension, 
Kelley reasoned that a perceiver will be more 
confident that his response is entity caused 
if it is consistent across more than one mo- 
dality. Although the subjects in the present 
experiment have direct experience with only 
one modality, those in the different-informa- 

tion conditions do receive such information 
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secondhand. If these subjects were fully 
convinced that they would process the dif- 
ferent information as did an agreeing other, 
Kelley’s model would unequivocally predict 
an increase in confidence. 


Assuming the equivalence of self and similar 
other, it might be argued that a comparison ~ 
of the similar-agree-same-information condi- ~ 
tion and the similar-agree-different-informa- 
tion condition tests the consistency-across- ~ 
modality notion. In the former cell, subjects ` 
have only consensus information; in the latter, 


they have both consensus and consistency 


information, This simple effect is in the right ~ 


direction (see Table 1) but is not significant 


(t= 1.53, df = 28). With this comparison, then, ~ 


we have only marginal support for the Kelley 
hypothesis that information from more than 


one modality increases confidence in entity — 


attribution. This is not too surprising, how- 


ever, since the subjects could not be entirely 2 


certain that their own response would-match 


that of the similar other. It can be noted, how-  ~ 


ever, that the increase in confidence in the one 
cell where the subjects have fairly reliabl 
consistency information, the similar-agree 
different-information condition, is greater than 


the average increase in the other three agree 


conditions (F = 4.53, df = 55, p < .05). 

Proceeding to the experimental hypotheses 
extending Kelley’s model, one prediction was 
that when an agreer has the same information 
as the judge, he will be more influential when 
he is dissimilar than when he is similar. This 
prediction constitutes half of the agreement 
hypothesis that was supported by the Sim- 
ilarity X Information within Agree interaction. 
Here, the person should welcome support from 
different perspectives. They permit the best 
triangulation. The relevant comparison is in the 
right direction, but does not approach statisti- 
cal significance. Nevertheless, the effect stands 
in contrast to the other three comparisons of 
similar and dissimilar others where the greater 
influence of the similar other is substantial 
enough to yield the overall Agreement X Simi- 
larity interaction. Furthermore, it is distinct 
enough to generate a trend toward a three- 
way interaction. Thus, the predicted effect 
is not in itself significant, but is in clear con- 
trast to what is happening in other cells of the 
experiment. 


bist nos 
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Perhaps dissimilar others are influential 
because of the triangulation they provide, but 
for other reasons their agreement is not as 
impressive as it theoretically should be. One 
problem is that triangulation is only relevant, 
theoretically, when a verifiable belief about 
the true attributes of an entity are at issue. 
However, when a value is in question, there is 
no attribute to be correctly judged, no entity 
to be objectively perceived. Rather, the 
person’s interest is in discovering whether 
he has reasons for favoring or opposing the 
entity. Here he should be more influenced by a 
cooriented peer (Jones & Gerard, 1967), a 
person who has the same basic interests and 
perspectives, that is, a similar other rather 
than a dissimilar other. 

Although the issue in question in the present 
study was intended to be clearly a belief issue, 
some subjects may have taken a more evalua- 
tive stance. As a result, they may not have 
been impressed by the dissimilar other’s 
agreement, or they may have been impressed 
by the agreement of a cooriented peer even 
though it provided little correction for bias. 
Perhaps people generally find themselyes in 
situations where values are at issue and 
triangulation is therefore irrelevant. If so, 
they may not “shift gears” fully enough to 
conform to the logic of the triangulation 
. situation. 

In sum, the agreement hypothesis is sup- 
ported by the predicted interaction, but the 
simple effect bearing on an important part 
of the overall hypothesis is not significant. 
(The simple effect favoring the similar other 
in the agree-different-information conditions 
is significant; i= 2.40, df= 26, p < .05.) 
Thus, the triangulation notion must be viewed 
with caution, but something is going on that 
reverses a more general trend favoring the 
similar other, and triangulation would appear 
to be responsible. The hypotheses for the 
disagree conditions are not supported by any 
significant effects, although the means are 
ordered as predicted. There is a trend sup- 
porting the prediction that similar disagreers 
are more effective in reducing judgmental 
confidence. 

How do the predictions and results of the 
present study relate to existing literature? 
First, the overall Similarity X Agreement in- 


teraction supports the position that has 
characterized most social influence research, 
namely, that people are more influenced by 
similar others than dissimilar others. As noted 
above, however, this interaction conceals a 
reversal within agree-same-information cells 
where the dissimilar other has at least the same 
if not greater influence. 

In searching for additional support for the 
triangulation hypothesis, there is little re- 
search bearing directly on the present problem. 
In question is the relative impact of agree- 
ment from similar and dissimilar others. 
However, most of the social influence re- 
search—on conformity, attitude change, and 
social comparison—considers instances of dis- 
agreement. Subjects typically are exposed to 
judgments that are different from their own 
so that the amount of change from the sub- 
jects’ initial stand can be measured. It should 
be noted that the present study’s predictions » 
about disagreement are entirely consistent with 
positions such as social comparison theory 
(Festinger, 1954) which suggest that similar 
others are more influential in changing a 
person’s attitude. 

There are, however, a few relevant studies. 
Berscheid (1966) conducted a study to show 
that if a subject found that a dissimilar other 
agreed, he would experience dissonance. She 
predicted that subjects would reduce dis- 
sonance by moving away from their own 
initial position. She found that subjects 
diverged from the dissimilar other only when 
the dissimilarity was on values directly relevant 
to the experimental issue. It is important to 
note that Berscheid’s experimental issues were 
value-related policy issues. 

There is at least one study that can be 
interpreted as supporting the triangulation 
notion of the present study. Wheeler and 
Levine (1967) conducted a study showing a 
dissimilar model to be more effective in in- 
ducing the contagion of aggression. Consistent 
with Berscheid’s results and the belief-value 
discussion, they suggested that it might be 
dissonant to agree with a person with different 
values but perfectly consonant to agree with 
another person of different background and- 
experience. Thus, if neither the issue (belief- 
value) nor the dimension of similarity (back- 
ground-values) suggest that the person and the 
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dissimilar other should have opposing interests 
in a question, that other’s agreement would not 
seem inconsistent. It would simply be taken 
to indicate the broad consensus on which the 
judgment in question rests. If, on the other 
hand, due to differing value orientations, the 
person feels that he and the other should dis- 
agree, the other’s agreement might cause dis- 
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ROLE OF VALUE OF REWARD AND MODEL AFFECTIVE 
RESPONSE IN VICARIOUS REINFORCEMENT: 


LAURIE LERNER? anp ROBERT L. WEISS 


University of Oregon 


It was proposed that the basis for the experience of vicarious reinforcement 
in an observer was either the value of reward given to the model for per- 
forming ‘specific behaviors or the model’s affective response to the reward 
given him, rather than a combination of the two variables. Sixty 7-year-old 
boys were exposed to a film in which reward value and affective response of 
model were independently varied, for instance, both positive, both negative, 
positive reward negative affective response, negative reward positive affective 
response. The number of specific behaviors imitated was determined following 
exposure to the modeled repertoire and model’s affective response to either 
of the rewards. To test for the extent of acquisition of the repertoire, positive 
inducements for replicating all remembered model behaviors were offered. 
Results indicated that children exposed to positive model affective response, 
regardless of reward value, imitated significantly more (P<.01) than those 
exposed to negative model affective response; value of reward was not a 
significant source of variance nor was the interaction between reward and 
affective response significant. The test for acquisition indicated that under 
appropriate incentive conditions, the modeled responses were available to all 
groups. The central role played by model’s affective response in vicarious 


reinforcement, and the conditioning model, are discussed. 


Although the efficacy of vicarious reinforce- 
ment as a learning tool has been clearly 
established, little is known about the mecha- 
nisms by which its effects are produced. The 
purpose of this study was to attempt to 
isolate the variable or variables responsible 
for the efficacy of vicarious reinforcement. 

In a vicariously reinforcing event, perform- 
ers enact some sequence of behaviors, which 
results in either punishing or rewarding conse- 
quences to the performer. Upon presentation 
of these consequences, the performer generally 
exhibits appropriate emotional responses. Two 
elements in the sequence of events might 
serve as the basis for vicarious reinforcement: 
(a) the reinforcement given the model for 
his behavior and (b) the model’s affective 

17This article is based on a master’s thesis sub- 
mitted by the first author to the Department of 
Psychology, University of Oregon, under the direc- 
tion of the second author. This study was reported 
at the 1971 meeting of the Western Psychological 
Association, San Francisco, California. The authors 
wish to thank Stephen M. Johnson, Benson Schaeffer, 
and Myron Rothbart for helpful suggestions, and 
Cheryl Ramberg and Chris Chan for their assistance 
in running subjects. 

2 Requests for reprints should be sent to Laurie 
Lerner, Department of Psychology, University of 
Oregon, Eugene, Oregon 97403. 
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response to that reinforcement. The present 
study attempts to determine the relative con- 
tribution of these two variables in the process 
of vicarious reinforcement. 

Much of the literature to date has either 
assumed that it is the entire model-reward- 
reaction package that is responsible for the 
experience of vicarious reinforcement, or that 
the “how” of vicarious reinforcement is not 
an issue. The effort, rather, has been to dem- 
onstrate that vicarious reinforcement, what- 
ever its mechanism, can indeed modify the 
behavior of an observer in a manner similar 
to that of direct reinforcement (Bandura, 
Grusec, & Menlove, 1967; Kanfer & Marston, 
1963; Walters, Leat, & Mezei, 1963). The few 
researchers who have focused on the process 
itself have approached the problem on a 
theoretical level that is perhaps prematurely 
speculative, given the current state of knowl- 
edge. Berger (1962), for example, in an at- 
tempt to define the elements of vicarious re- 
inforcement, made the distinction between 
true vicarious reinforcement and what would 
better be termed pseudovicarious reinforce- 
ment. Berger, working from a classical condi- 
tioning paradigm, defined vicarious reinforce- 
ment as a state in which an observer responds 
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emotionally to a performer’s unconditioned 
emotional response. However, since the per- 
former’s unconditioned emotional response is 
not directly observable, the observer must 
infer the nature of this response from the 
unconditioned stimulus and the performer’s 
overt unconditioned response, in combination. 
Berger defined as pseudovicarious reinforce- 
ment any emotional responses made by the 
observer to any stimuli other than the un- 
conditioned stimulus and the unconditioned 
response (overt) combination. If we extend 
Berger’s somewhat restricted definition of 
vicarious reinforcement by adapting it to an 
instrumental paradigm, we find that the ex- 

» perience of vicarious reinforcement can exist 
only" when both the reinforcement to the 

model (for Berger, the unconditioned stimu- 
lus), and the model’s affective response (the 
unconditioned response), are of the same 
valence, that is, are congruent in this regard. 
The absence of one of these cues, or the pre- 
sentation of incompatible cues, cannot result 
in the inferred emotional response necessary to 
the experience of vicarious reinforcement. 
Therefore, any alleged vicarious reinforcement 
would be more appropriately labeled as 
pseudovicarious in those instances in which 
either the reinforcement to the model or his 
overt affective cues were absent, and those 
instances in which incompatibility of these 
two stimuli prevented the inference of an 
unconditioned emotional response. Thus, it 
would seem that for Berger, both the rein- 
forcing stimulus and the model affective re- 
sponse are essential for the truly vicarious 
experience. 

Bandura (1969) also attempted to restrict 
the definition of vicarious reinforcement. For 
Bandura, only those situations in which the 
reinforcing stimulus given to the model has 
not previously acquired strong eliciting po- 
tency for an observer may be considered vi- 
carious reinforcement. Such a prior connection 
would imply that the observer’s emotional 
responses were being evoked by the reinforce- 
ment itself regardless of the model’s behavior. 

In addition to delineating what may not 
be appropriately labeled vicarious reinforce- 
ment, some investigators (Bandura, 1969; 
Stotland, 1969) have suggested mechanisms 


to account for the vicarious experience, 
Bandura (1969) has proposed that ; 


modeled affective cues produce vicarious arousal 
largely through an intervening self-stimulation pre 
ess involving imaginal representation of aversive or 
pleasurable consequences occurring to oneself 
similar situations [p. 171]. 


Stotland has, in addition, to the “imagine- 
self” hypothesis, suggested that vicarious rein- 
forcement may be the result of a process 
whereby the observer imagines how the person 
felt who directly experienced the consequen: 
(“imagine-him” hypothesis). The imagine 


self and imagine-him hypotheses would seem. _ 


to differ in their relative emphasis on the 
value of reward and model affective response 
variables. The imagine-self hypothesis sug- 
gests that the observer reacts, imaginally, as 
if it were he who were being reinforced: 
vicarious reinforcement would therefore be 
based on the observer’s evaluation of, av 
affective response to, the reinforcing stimulus 
given the model. Conversely, the imagine-him 
hypothesis suggests that the observer imagines 
how the model feels when rginforced; here, 
the model’s affective response is the stimulus 
that determines vicarious reinforcement. 

The imagine-self and imagine-him formula- 
tions provide a vivid contrast to Berger’s 
dual-stimulus notion of vicarious reinforce- 
ment in that the former emphasize either the 
value of the reinforcing stimulus or the model 
affective response variable, rather than the 
combination of the two variables. Given cer- 
tain modeling situations, the three theories 
would make different predictions as to the 
degree, and even the affective valence of 
vicarious reinforcement, depending on their 
relative emphasis on the two variables in 
question. 

Clearly, neither the empirical nor the theo- 
retical literature on vicarious reinforcement 
to date has adequately dealt with the question 
of what variable or variables are responsible 
for the experience of vicarious reinforcement. 
The present experiment was designed to indi- 
cate whether value of reward, and model 
affective response, affected vicarious reinforce- 
ment, either in concert or individually, 

The effects of the two variables on the 
degree of vicarious reinforcement experienced 
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were assessed indirectly by means of an imita- 
tion paradigm. Children were exposed to a 
film in which the reward to a model for per- 
forming certain activities, and the model’s 
affective response to the reward, were either 
(a) both positive, (b) both negative, (c) 
` positive for the reward variable and negative 
for the model affective response, or (d) nega- 
tive for the reward variable and positive for 
the model affective response. Following the 
film, the children were given a free play 
period in which they had the opportunity to 
use the same play materials that had been 
used by the model. The play period was 
followed by a test for response acquisition 
in which the children were offered attractive 
rewards for duplicating the model’s activities. 
The number of precisely imitated responses 
(which served as the dependent variable) 
was scored during both the free play period 
(which constituted the test for performance) 
and the test for response acquisition. 


METHOD 
Subjects 


Sixty second-grade boys drawn from three ele- 
mentary schools in the Bethel school district of 
Eugene, Oregon, servicing a community of farming 
families and lower middle-class blue-collar workers, 
participated in the experiment, 


Design 

A 2 X2 factorial design based on 2 levels of model 
affective response (positive and negative) and 2 
levels of reward value (high and low) was employed. 
Children were randomly assigned to each of the 
four resulting cells, with 10 subjects each in the two 
congruent groups (++, ——) and 20 subjects each 
in the two mixed groups (+—, —+).8 


Film Conditions 


The films provided the observers with a repertoire 
of to-be-modeled responses as well as providing an 
affective, vicarious reinforcement component. Each 
film was designed to provide one of the four possible 
combinations of reward value and model affective 
response. Each film was subdivided into eight content 
sections, with 2-20 specific behaviors that could be 
modeled and a maximum of 61 discrete behaviors that 
could be coded. The film depicted the following se- 
quence: A woman entered a room with several toys 
and told the model, as she gave him the toys, that if 
he “played with the toys the right way, he would get 


3 The two mixed groups provided the crucial test 
of our hypothesis so that a larger N was used to 
increase the power of significance testing. 


a prize, but that if he did not play with them the 
tight way, she would not be able to give him a 
prize.” She then left the room, and the model per- 
formed the first half of the behavioral repertoire, 
which included novel block-tower building and dis- 
assembly. When the model completed the set of 
activities, the woman returned and gave the model 
his reward. The rewards and model’s affective re- 
sponse to them varied with the film, according to 
the following scheme: In the congruent-positive 
condition (++), an attractive locomotive was paired 
with positive model response. In the positive-mixed 
condition (+—), the same reward was paired with 
negative model response, In the negative-mixed con- 
dition, (~+), an old torn book was paired with 
positive model response, and in the congruent- 
negative condition (——), the same, unattractive 
reward was paired with negative model response. 
The woman then returned with several more play ma- 
terials, stating that the boy would now have another 
opportunity to earn a prize. The model then per- 
formed the second half of the behavioral repertoire, 
which included novel play with a Bobo doll, bean 
bags, and hoola hoop. In each of the four film 
conditions, a reward similar in valence to that 
presented earlier was given to the model as reward 
for these activities (robot instead of locomotive, old 
glove instead of old book), and the model’s affective 
response was identical to that of the earlier reward 
sequence, Positive model affective response was por- 
trayed for all four rewards by the model’s smiling 
and exclaiming that “this was the greatest toy he 
had ever seen,” “the best reward he had ever gotten,” 
etc., while negative model affective response was 
portrayed by the model’s putting his reward aside 
and exclaiming that “this was a terrible reward,” 
“who would want something like that,” etc. 


Variations in Rated Attractiveness 


An ordinal scale was constructed to measure the 
appropriateness of objects to serve as rewards. Color 
photos of four objects not used in the films (checker- 
board, bowling pin, mirror, and magic slate) were 
shown to the subjects together with the four objects 
that were ultimately used as rewards. Children were 
asked to rank order the photos according to the 
principle “toys I would like to have most.” 


Valence Check 


In order to isolate the relative effects of reward 
value and model affective response, it was necessary 
to make certain that the children’s evaluations of the 
rewards used in the films were not affected by the 
model’s affective response to those rewards. To test 
for such possible changes, the photos of the eight 
items were presented to the children for their at- 
tractiveness ratings twice prior to, and once directly 
following, exposure to the films. A check on the 
stability of the relative valence of the film rewards 
was thereby achieved. 
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Procedure 


Valence pretests. Two weeks prior to film presenta- 
tion, each subject was individually shown the eight 
color photos. The photos were laid out in random 
order, and the child was asked to select the item he 
liked the most. This liked item was removed, and 
the child was asked to name the object he preferred 
most from the remaining photos. The procedure was 
repeated until all the items had been arranged in a 
preference hierarchy. As an additional check, once 
the child’s hierarchy had been established, pairs of 
random items were selected from the hierarchy, and 
the child was asked which of these items he preferred. 
These responses were then checked for consistency 
with the original hierarchy. The photos were pre- 
sented again, 1 week prior to film exposure, to 
determine whether the hierarchy was still valid. The 
manner of presentation was identical to the earlier 
pretest, but, in order to avoid the suggestion that 
their earlier hierarchies should be duplicated, each 
child was told to rank the pictures without regard 

“to his previous rankings. 

_» - Bxperimental procedure. The children were brought 
individually into a semidarkened room. The experi- 
menter explained that she had some photos she would 
like to show the boy and that while she was out of 
the room getting them, he might like to watch some- 
thing on television. After the child was seated, the 
experimenter turned on the television console and 
left. One of the four videotapes, each of approxi- 
mately 5 minutes duration, was then shown by means 
of a videotape recording that was placed unobtru- 
sively in the rear of the room. When the film ended, 
the experimenter returned with the photos and ad- 
ministered the valence check for the third time. She 
then apologized for having to leave the room again 
for a short period of time, but indicated that she 
had some toys the child might like to play with in 
her absence. The experimenter then brought out the 
play materials used by the model. Before leaving the 
room again, the experimenter told the child that he 
could play with these toys in any way he wanted to. 
Each child was then given a 6-minute play period 
before the experimenter returned. Two inconspicu- 
ously placed observers independently recorded the 
child’s imitative responses on observer scoring sheets 
which listed each activity performed by the film 
model. One point was given for each precisely 
imitated response. After 6 minutes, the experimenter 
returned, stating that she would be gone for a while 
longer, but had some more toys for the child to 
play with. She then replaced the first set of play 
materials with the items that the model had used 
in the second half of the behavioral sequence. During 
her absence, the children’s matching responses were 
again recorded by the observers. Scores obtained 


4 Because it was necessary for the observers to 
show the videotaped sequences to the subjects, ob- 
servers were not blind to conditions, but they were 
blind with regard to the hypothesis being tested. 


during these two free play periods constituted the 
measure of performance. 

The test for acquisition followed upon the experi- 
menter’s return, The child was told that he would 
now have an opportunity to earn a reward, just as 
the model had, that he would be given four marbies 
if he could demonstrate everything the boy in the 


film did. All of the stimulus items from both filyn 


segments were presented, and those instances of 


model]-imitated activity were recorded by the ob- 
servers, Hesitant children were given a maximum of 
five proddings by the experimenter if the child had 
imitated only one item and still had much of th 
5-minute limit remaining. Proddings consisted of 
encouraging comments, such as, “let me see what 
else you remember.” The test for acquisition wos 
terminated if a child hesitated after five such 
proddings. As soon as the child stated that he could 
remember no more, he was given four marbles and 
praised, 


RESULTS 
Observer Agreement 


Observer agreement was calculated on five 
randomly selected pairs of scores on the tesi 
for performance and on five randomly selected 
pairs of scores on the test for acquisition. 
Observer agreement for the 10 pairs of scores 
ranged from 87% to 100%, with only three 
instances below 90%. 


Valence Check 


The hierarchy of rated object attractiveness 
was trichotomized for purposes of statistical 
analysis. The first category listed those items 
that a child evaluated as highly desirable or 
attractive, the second category contained the 
four items placed in intermediate (i.e., 
neutral) position, and the third category listed 
those items evaluated as relatively unattrac- 
tive. Preferences were considered to be con- 
sistent if there were no between-category in- 
versions in any of the three valence tests. 
Changes within a category were judged to 
be of little psychological consequence. Con- 
sistency of placement in the most preferred 
and least preferred categories (one and three, 
respectively) was evaluated separately. 

The valence-check data yielded the follow- 
ing frequency distribution of consistent and 
inconsistent placements for the most pre- 
ferred and least preferred categories: Forty- 
nine of the 60 subjects consistently placed 
train and robot in the most preferred cate- 
gory over the three sessions, while 11 were 
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inconsistent in their placements of these items. 


Fifty of the 60 subjects were consistent in 
their placements of book and glove in the 
least preferred category over the three ses- 
sions, while 10 were inconsistent in their 
placements of these items. 

‘The binomial test for deviation from a 
50-50 split was applied. The distribution of 
frequency of consistent versus inconsistent 
placements of train and robot in the most 
preferred category yielded a z score of —4.8, 
p= 00003 (one-tailed). Setting the prob- 
ability of deviation under the null hypothesis 
from a 50-50 split at p= .0001, the null 
hypothesis of no difference between pı and ps 
may be rejected. Thus, significantly more 
children were consistent, relative to those who 
switched, in placing train and robot in the 
same category over the three sessions. 

The frequency distribution of consistent 
versus inconsistent placements of book and 
glove in the least preferred category yielded 
a z score of —5.04, p = .00003 (one-tailed), 
so that the null hypothesis of no difference 
between pı and pə may again be rejected. 
Significantly more children were consistent in 
placing book and glove in the least preferred 
category over the three sessions than those 
switching categories. 


Test for Performance 


Table 1 presents a comparison of the mean 
number of imitative responses made by the 
subjects in the four film conditions for the 
performance and acquisition tests. Inspection 
of the performance means suggests that sub- 
jects in the positive model affective response 
conditions imitated to a greater extent than 


TABLE 1 


COMPARISON OF PERFORMANCE AND ACQUISITION 
MEANS AND STANDARD DEVIATIONS FOR THE 
Four EXPERIMENTAL CONDITIONS 


Performance 


| Acquisition 
ny 


M | SD M SD 
ine 30.0 10.1 | 36.9 5.7 
Sue 20.6 64 34.4 7.2 
ot 27.3 7.9 34.3 7.0 
=, 21.2 8.2 32.5 69 


TABLE 2 


ANALYSIS OF VARTANCE OF PERFORMANCE IMITATION 
SCORES BY VALUE OF REWARD AND MODEL 
AFFECTIVE RESPONSE 


Performance Acan ge 
Source df 
MS F | MS | F 
Value of reward (A) 1| 14.66 .22 | 67.98 |1.32 


Model affective re- 
sponse (B) 1 | 800.60 | 11.80* | 61.32 |1.18 
B 1| 36.52| 54 | 1.33 | 026 


x 
Within cell error | 56 | 67.80 51.60 


*p <.01; Foi df = 1/56, a = .10 is > 2.80. 


those subjects in the negative model affective 
response condition. Value of the reward 
would appear to be irrelevant to the degree 
of imitation. 

An analysis of variance for unequal Ns 
was performed (Winer, 1962, p. 241) on 
these imitation scores and revealed a sig- 
nificant main effect for only the model affec- 
tive response variable (F = 11.8, df = 1/56, 
p<.01). Neither the main effect for value 
of reward nor the interaction approached 
significance (see Table 2). 


Test for Acquisition 


Inspection of Table 1 suggests that chil- 
dren in all conditions were able to reproduce 
more of the model’s behaviors (when offered 
an attractive incentive for doing so) than 
they had reproduced during the test for per- 
formance. The acquisition means also suggest 
that subjects in all conditions were equally 
able to duplicate the modeled repertoire, re- 
gardless of the nature of film exposure. Mean 
scores indicate that children in all condi- 
tions were able to reproduce an average of 
slightly greater than one-half of the modeled 
repertoire during the test for acquisition. 

An analysis of variance (for unequal Ns) 
of the imitation scores obtained during the 
test for acquisition revealed no significant 
main effects for value of reward or model 
affective response, nor a significant interaction 
(see Table 2). Thus, children in all condi- 
tions were equally able to reproduce the 
activities of the model regardless of their 
experimental group membership. 
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Discussion 


Results indicate that only one variable of 
the two investigated—the model’s affective 
response to the reward—was responsible for 
the efficacy of vicarious reinforcement. It was 
found that positive model affective response 
resulted in a significantly greater degree of 
imitative behavior than negative model af- 
fective response, regardless of the value of 
the rewards. 

In view of these findings, it is somewhat 
surprising that children in the two negative 
model response groups imitated even to the 
limited extent that they did. There are sev- 
eral possible explanations for imitation in 
these two groups. A limited degree of vicari- 
ous reinforcement may have been experienced 
as the result of the woman’s labeling of the 
reinforcement given to the model as a reward, 
“regardless of the absolute value of the rein- 
forcement. Or, second, the limited verbal 
reinforcement provided by her in her state- 
ment to the model that he “played with all 
the toys the right way” may have been 
experienced as vicarious reinforcement in the 

' observers. Third, much of the imitative be- 
havior in these two groups may have been 
the result of the mere observation of another 
person’s behavior, rather than the result of 
Vicarious reinforcement (Bandura, 1965a). 
Bandura (1965b) has also demonstrated that 
some degree of imitative behavior does occur, 
even when children have viewed a model who 
has been severely punished for his behavior. 
However, further research must answer the 
question of whether the presentation of a 
reward paired with negative model response 
would limit imitative behavior to the same 
extent found by Bandura in presentation of 
punishment. It does seem that if the rela- 
tionship between model response and vicarious 
reinforcement holds within an observation-of- 
punishment paradigm as well, equally nega- 
tive model reactions to the presentation of a 
reward and to the administration of punish- 
ment should result in a similarly limited 
degree of imitative behavior. 

Whether the relationship suggested by this 
experiment between model affective response 
and vicarious reinforcement would hold with 
reinforcements of greater positive or negative 


value, or with less extreme model affective — 
cues, cannot be known from the present study. — 
Conceivably, with an extremely desirable or 
undesirable object, the value of the reinforce- 
ment could override any model reaction. It 
would seem, however, that for at least these 
7-year-old boys, the rewards employed were 
viewed as highly attractive or unattractive. 

Since all but three children who saw the 
model react negatively to the (desirable) train 
and robot, and all but three who saw the 
model react positively to the (undesirable) 
book and glove, ranked these items in the 
preferred and nonpreferred positions, respec- 
tively, even after viewing the films, suggests 
that the observers’ evaluations of the rewards 
had not been influenced by the model’s re- 
actions to them, Perhaps this is due to the 
relatively unambiguous value of the reinforce- 
ments. Had the reinforcements been less 
familiar to the children, or had they been 
relatively neutral in value, the model’s affec- 
tive response to them might have had greater 
significance for the observers, that is, might 
have defined for them the value of the reward. 
The relative importance of the model’s affec- 
tive cues in defining the value of the reward 
may exist only when the rewards used are not 
clearly perceived as high or low in value. 

In spite of the fact that children appeared 
to be relatively unaffected by the model’s af- 
fective response in terms of their own ranking 
evaluations of the items, model affective cues 
seem to be the basis for the experience of 
vicarious reinforcement. This suggests that 
the vicarious experience is not merely an in- 
formational or discriminative process whereby 
the observer learns that in order to obtain 
the model’s reward he must imitate the 
model’s behavior, Had the experience been 
on the level of “if I do what the model did, 
I will be rewarded in the same manner,” the 
children who saw the model rewarded with the 
train and robot and who ranked these items 
as most preferred should have imitated in 
order to receive the same reward, regardless 
of the model’s affective response. Results do 
not, however, eliminate the possibility that 
vicarious reinforcement is a discriminative 
process with a different focus—one in which 
the discrimination is made that the model’s 
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affective state is somehow related to his per- 
formance of the behavioral repertoire. The 
discrimination in this case is that “if I do 
what the model did, I will (ultimately) be 
happy or unhappy, as the model was.” 
Another alternative hypothesis that might 
initially be suggested by the results of the 
present study emphasizes the role of model 
attractiveness in imitation. However, such an 
argument requires an assumption that may 
be untenable at this point, namely, that chil- 
dren label people who make positive affective 
responses (i.e., generally “agreeable” people) 
as highly attractive, while labeling people 
who make negative affective responses (i.e., 
generally “disagreeable” people) as highly 
unattractive. An observer, then, regards the 
agreeable model as more imitable than the 
disagreeable model, so that one would predict 
greater imitation in the positive than the 
negative model affective response condition. 
Since model attractiveness was not rated in 
this study, it cannot be known whether the 
positively responding model was perceived as 
more attractive and, therefore, more imitable 
than the negatively responding model. How- 
ever, it would seem that if model attractive- 
ness were the variable determining the degree 
of imitation, an observer would be more likely 
to rate as attractive, and hence, to imitate, 
a model whose values are similar to his own, 
rather than imitating the generally agreeable 
model, An observer who devalues a book or 
glove should be attracted to a model who 
similarly devalues these items (——), while 
not being attracted to the model who responds 
positively to the book and glove that he 
himself does not value (—+). If perceived 
similarity between model and observer were 
indeed to heighten perceived model attract- 
tiveness (as social psychological literature 
would suggest), and if model attractiveness 
were indeed the variable responsible for imita- 
tion, one would predict relatively greater 
imitation in the two congruent conditions, 
where the model’s reactions are similar to the 
observers, than in the two mixed conditions. 
The data suggest that this is not the case. 
These results would seem to weaken the 
hypothesis that vicarious reinforcement is the 
result of an imagine-self process whereby an 
observer imagines the aversive or pleasurable 


consequences that he would feel in a situation 
similar to that of the model. If this were 
indeed the process, children, viewing the 
model receive a reward that they themselves 
value, would, in imagining their own feelings 
given a similar situation, imagine pleasurable 
feelings, which should lead to a great de- 
gree of vicarious reinforcement and, hence, 
greater imitation. However, children who 
viewed the attractive-reinforcement—negative- 
model response combination (+ —), appar- 
ently did not go through an imagine-self 
process, since their imitative behavior was 
limited. Stotland’s imagine-him hypothesis 
would seem to be a more tenable one at this 
point, in spite of the fact that Stotland (1969) 
found greater empathy in the imagine-self 
than the imagine-him condition. 

This hypothesis suggests that an observer 
imagines the feelings of the model, based on 
the model’s overt affective response, rather 
than the observer’s own evaluation of the 
reward, or the observer’s affective response 
to the reward. Using this hypothesis, one 
would predict negative model affective re- 
sponse to elicit negative emotional arousal in 
an observer, while positive model affective re- 
sponse should elicit positive emotional arousal 
in the observer; such a relationship is sug- 
gested by the data. However, the imagine-him 
hypothesis does not make the necessary link 
between model-elicited, observer affective 
response, and subsequent observer modeling 
behavior. The relationship between observer 
affective response and modeling behavior may 
be explained within a conditioning model. 
Such a paradigm would suggest that affective 
responses elicited in an observer (via the 
imagine-him process suggested by Stotland) 
become conditioned to the modeled repertoire. 
Thus, the subsequent initiation of matching 
responses by an observer generates emotional 
arousal similar to that of the model, in the 
now-performing observer. Given this frame- 
work, one would expect positive model affec- 
tive cues to elicit a comparable positive affec- 
tive response in an observer. This vicariously 
aroused positive affective response in the ob- 
server becomes conditioned to the modeled 
activities, which, when subsequently initiated 
by the observer, generate in him positive 
emotional arousal as he now performs each 
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of the modeled activities. This self-generated, 
positive emotional arousal should then lead 
to a greater degree of imitation. In a similar 
manner, the model’s negative affective re- 
sponse would result in a lesser degree of 
imitation. Such a relationship ‘is clearly 
demonstrated by the data. 

Bandura (1969, p. 169) suggested an 
alternative design for demonstrating the rela- 
tive contributions of model affective response 
and reinforcing stimuli to vicarious reinforce- 
ment. This design would involve showing 
either model affective response to some un- 
known reward or showing the actual reward 
given to the model, with no indication of the 
model’s affective response to that reward. 
These experimental manipulations, which 
were not undertaken in the present study, 
would certainly shed further light on the 
questions raised here. 
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VALENCE OF SUCCESS AND FAILURE AS A FUNCTION OF 
ACHIEVEMENT MOTIVES AND LOCUS OF CONTROL! 


STUART A. KARABENICK? 


Eastern Michigan University 


Sixty-one male undergraduates performed 20 anagram and 20 substitution tasks. 
After each series they rated their anticipated degree of satisfaction (valence of 
success, V.) upon success and dissatisfaction (valence of failure, Vs) upon failure 
at 11 similar tasks assumed to have designated difficulty levels. Inverse linear rela- 
tionships between V, and subjective probability of success and between Vs and 
subjective probability of failure were found based on data averaged overallsubjects, 
supporting the inverse linear assumption made by Atkinson’s model of achievement- 
motivated behavior. However, analyses of individual ratings also revealed sig- 
nificant nonlinear trend components for half of the sample that were apparently 
canceled by the averaging process. There were no significant relationships between 
either projective (Thermatic Appreception Test need for Achievement) or objective 
measures of approach or avoidance (Test Anxiety Questionnaire) achievement- 
motive measures and either V, or Vs linear slope regression coefficients, which failed 
to support predictions derived from the Atkinson model. However, significant rela- 
tionships were found between both Vs and V+ slopes and individual differences in 
locus of control attribution: persons with relatively high internal control beliefs 
generated steeper slopes than persons with relatively low internal control beliefs. 


Atkinson’s (Atkinson & Feather, 1966) 
expectancy-value model of achievement moti- 
vation asserts that the tendency to engage in 
behavior is a multiplicative function of motive, 
subjective probability, and incentive value. 
A basic assumption of the model is that the 
incentive value of success (Zs) is an inverse 
linear function of the subjective probability 
of success (P,). Likewise, the incentive value 
of failure (Z;) is assumed to be an inverse linear 
function of the probability of failure (Ps); that 
is, I, = 1 — P, and J; = 1 — Pr. Given these 
relationships, the model predicts optimal per- 
formance at and preference for tasks with 
P, = .50 for persons whose motive to approach 
success is greater than the motive to avoid 
failure (Ms > My). Persons with Mr > Ms 
are predicted to perform better and to prefer 
either extremely easy (Ps = 1.00) or extremely 
difficult (P, = .00) tasks. 

The inverse linear relationship is quite 
specific and necessary, since almost any devia- 
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tion from linearity would result in altering the 
model’s predictions with regard to risk taking 
and performance. For example, if J, and Ie 
were negatively accelerated (ie., concave) 
functions of P, and P;, respectively, maximum 
preference and performance for persons with 
Ms > My would occur with difficulty levels 
less than .50 rather than at .50. Persons with 
My > Ms would avoid tasks with difficulty 
levels greater than P, = .50 (instead of at 
P, = .50, given inverse linearity) and have 
maximum preference and performance ex- 
pected when P,;<.50. Almost an infinite 
variety of predictions could be generated with 
appropriate functional relationships between 
incentive and probability (see Heckhausen, 
1968). 

Indirect support for the inverse linear as- 
sumption has come from studies confirming 
either risk-taking or performance predictions 
based on that assumption (e.g., Atkinson & 
Litwin, 1960; Karabenick & Youssef, 1968; 
Mahone, 1960; Moulton, 1965). However, few 
studies have directly tested the assumption. In 
one such study, Litwin (1966) had one group 
of subjects assess the probability of succeeding 
at various distances from a peg in a ring-toss 
game, while a second group assigned monetary 
values ranging from $.00 to $1 that they 
deemed appropriate prizes for success at each 
distance. Average monetary values assigned 
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were inversely, and approximately, linearly 
related to distance. Brown (1963) had sub- 
jects rate their chances of success at anagrams 
of five different lengths. Subjects also rated 
the degree of “pleasure” when they succeeded 
or “displeasure” when they failed to solve an 
+ anagram. Average rated positive affect was a 
direct linear function of anagram length, while 
rated negative affect was an inverse linear 
function of length. Feather (1967) found simi- 
lar linear relationships between task difficulty 
and the number of points that subjects thought 
should be awarded for success or penalized 
for failure on a hypothetical performance task. 
While the degree of linearity was not deter- 
mined, there is also evidence that the prestige 
ranking of occupations is inversely related to 
the probability of successfully entering them 
(Atkinson & O’Connor, 1963). 
The evidence cited is generally consistent in 
su the postulated inverse relationship. 
+ Ho , some aspects of the data are prob- 
Denie First, conclusions have been based 
entirely on aggregate data. It is conceivable 
that the linear relationship found could have 
arisen from averaging offsetting nonlinear 
trends. An analysis that examines individual 
data would be required to determine whether 
such is the case. Second, subjects have typi- 
cally rated success and failure at tasks that 
have varied in difficulty (e.g., for different dis- 
tances or different complexity levels). Linearity 
could not be determined unless there was some 
assurance of equal subjective difficulty inter- 
vals between different tasks or that the sub- 
jective difficulty levels were known for each 
individual and the relationship between incen- 
tive value and P, was determined on the basis 
of those P, levels. Finally, even the use of 
extremely easy or difficult tasks does not 
guarantee subjective difficulty levels of P, 
= 1.00 or .00, respectively. Extreme values 
would have to be included to determine 
whether linearity exists throughout the diffi- 
culty range. The first objective of the present 
study was to test the inverse linear assumption 
directly by having subjects, familiar with 
rating task difficulty in terms of P,, use P, 
levels as the basis for their affect ratings. 
This allowed an assessment of the functional 
relationships over the entire difficulty con- 
tinuum and did not require the assumption of 
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equal P, intervals between tasks of different 
difficulty. 

The second focus of this study deals with the 
relationship between incentive value and 
individual differences in achievement motives. 
Specifically, the degree of anticipated affec 
for success and failure has been interpreted as 
a joint function of incentive value and motive 
levels. The anticipated degree of satisfaction 
upon succeeding at a task has been defined as 
Ms X I, and termed the “valence of success” 
(V.; Atkinson & Feather, 1966). Likewise, the 
anticipated “repulsiveness of failure” has 
been defined as My X J; and termed the 
“valence of failure” (V;). 

One direct consequence of this relationship 
is that ratings of anticipated success and failure 
affect are, operationally, only indirect indexes 
of Z, and Is, respectively. This presents little 
problem since Ms and My can be assume 
constant for all P, levels. A second conse 
quence is that the increase in V, with increasing 
task difficulty (lower P,) should be propo: 
tional to the level of Ms. Likewise, the increase 
in V; should be proportional to the level oí 
My. It follows that the linear slope of V, as a 
function of P, is predicted to be steeper (mor 
negative) for persons high in Ms than for 
persons low in Ms. Similarly, the linear slope 
of Vt as a function of Ps should be steeper 
(more negative) for persons high in My than 
for those low in My. It also follows that the 
level of Vs, averaged over the entire difficulty 
(Ces) range, should be related to individual 
differences in Ms and Mr, as are the slopes. 
However, this depends upon having V, = 0 
at P, = 1.0, and V; = 0 at P, = .00. If this 
restriction does not hold empirically, valence 
slopes and levels need not be related. The 
relationship might also not hold if the valence 
functions are nonlinear. 

Evidence for the slope deductions has been 
contradictory. Litwin (1966) found supportive 
evidence, in that the average linear slope of 
assigned monetary values for success as a 
function of distance was higher for persons 
classified as Ms > My than for those classified 
My > Ms. Strodtbeck, McDonald, and Rosen 
(1957) assessed how pleased teenage boys 
would feel upon entering occupations at several 
prestige levels. Results confirmed the hypo- 
thesis that Jewish boys, coming from back- 
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grounds with a history of high upward social 
mobility (an indication of high achievement 
motivation) would place greater value on high- 
opposed to low-prestige occupations than 
would Italian boys for whom such cultural 
stresses did not exist. Occupational slope was 
subsequently used by Morgan (1964) as a 
measure of achievement motive level and 
found to be related to several economic indexes, 
However, the same study also contained evi- 
dence that Thematic Apperception Test need 
for Achievement (TAT n Ach, a measure of 
Ms) and occupational slope scores were uncor- 
related for a group of college undergraduates, 

Brown (1963) failed to find significant rela- 
‘tionships between valence slopes and individ- 
ual differences in achievement motives, and 
there was even a tendency for low-anxious 
(i.e., low Mr) subjects to give higher ratings 
of “displeasure” following failure than high- 
anxiety (high Mr) subjects, contrary to ex- 
pectation. Feather (1967) also found no cor- 
relations between linear slope regression 
coefficients of V, and V; and indexes of Ms 
and My. He suggested that this may have 
been due to the hypothetical setting in which 
the valence ratings were obtained. Further 
evidence is necessary to resolve the above 
inconsistencies using actual performance situa- 
tions. In the present study, the subjects’ 
ratings were obtained subsequent to extensive 
task performance. It was predicted that the 
linear slope regression coefficients (and average 
levels) of V, would be related to Mg and V; 
linear slopes (and average levels) would be 
related to Mr. 

The third major aspect of this study con- 
cerns “locus of control” beliefs as a moderator 
of affect in achievement situations (Lefcourt, 
1966; Rotter, 1966; Weiner & Kukla, 1970). 
Essentially, locus of control refers to personal 
beliefs about control over events. A strong 
belief that outcomes are due primarily to one’s 
own behavior or personal characteristics indi- 
cates a high degree of “internal control” beliefs, 
while the attribution of outcomes to forces 
beyond one’s control (e.g., chance or fate) 
indicates high “external control” beliefs. As 
applied to achievement contexts, high internal 
control attribution would be expected to result 
in greater affective experiences on success and 
failure, since such outcomes would involve 

renter personal responsibility. 
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Studies varying situational (e.g., skill versus 
chance) factors support this view (Breit, 1969; 
Feather, 1959a, 1959b, 1967; Rotter, 1966). 
However, there is still considerable question 
concerning the relationship between individual 
differences in locus of control and success and 
failure affect. One suggestion by Feather (1967) 
is that the degree of internal control attribution 
should act as a proportionality constant (C) 
on the incentive value, probability of success 
relationship (Z = CL1—P,] and i; = 
— CP,). Thus, persons high in internal control 
beliefs would be predicted to have higher Vs 
and V; slopes than persons low in internal 
control (high in external control beliefs). 
However, Feather (1967) found no support for 
this hypothesis, again, possibly a result of the 
hypothetical context in which the ratings were 
obtained. It was presumed that greater per- 
sonal involvement in the performance task, as 
in the present study, would increase the chances 
of demonstrating this relationship. 


METHOD 
Subjects 


The subjects consisted of male undergraduates at 
Eastern Michigan University selected from the intro- 
ductory psychology subject pool. They participated in 
experimentation as one alternative in partial fulfillment 
of their course requirement, the other consisting of a 
journal article synopsis. Results are based on 61 sub- 
jects who completed all phases of the experiment. 


Assessment of Individual Differences 


Individual differences in Mg were assessed using the 
TAT measure of achievement motivation designed by 
McClelland, Atkinson, Clark, and Lowell (1953). The 
protocols were elicited by a set of four pictures that were 
presented in the following order: (a) Inventors; (b) 
Reporter at desk; (c) Father and Son; (d) Operation 
scene. (Refer to Atkinson, 1958, Appendix IL). The 
subjects were given 4 minutes to write a story about 
each picture after seeing it projected on a large screen 
for 20 seconds. 

The TAT protocols were scored according to the 
manual for n Ach presented in Atkinson (1958) by the 
author who had previously attained expert scoring 
proficiency according to the criteria given in the scoring 
manual, 

Individual differences in My were assessed using the 
Test Anxiety Questionnaire (TAQ) devised by Mandler 
and Sarason (1952), a measure of anxiety experienced 
in testing situations. Subjects’ responses, marked on a. 
5-inch continuum, were scored using a 5-point scale 
for each question and keyed so that high scores are 
indicative of increasing anxiety. 

An index that has been used to summarize the re- 
sultant approach achievement motive (Atkinson & 
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Feather, 1966) is derived by subtracting a subject's 
TAQ z scores from their TAT z scores, which provides 
a resultant z score. It can be seen that this index in- 
creases with increasing TAT scores and decreases with 
increasing TAQ scores. As such, higher resultant z scores 
should be associated with steeper V, slopes and lower 
resultant z scores with steeper Vs slopes. 

An additional scale purporting to objectively measure 
achievement motivation was included. Entitled the 
Student Opinion and Attitude Questionnaire (SOAQ), 
constructed by Atkinson (1969), the scale consists of 
multiple-choice items that were keyed using an empiri- 
cal procedure. The itemanalysis is based on performance 
and intermediate risk-taking criteria. Since the mea- 
sure has been shown to be related to both TAT n Ach 
and TAQ scores (Atkinson, 1969), it would presumably 
be related to slope indexes in the same manner as result- 
ant z scores; that is, persons high on the SOAQ should 
have higher V, and lower Vr slopes than persons low 
on that scale. 

Individual differences in outcome attribution were 
measured by the Locus of Control scale (see Rotter, 
1966), which contains forced-choice items attributing a 
series of Outcomes of events to either internal or ex- 
ternal sources of control. The scale is keyed so that 
high scores indicate increasing external control beliefs, 
while low scores represent increasing internal control 
beliefs. 

The TAT, TAQ, and Locus of Control scales were 
administered, in that order, during a single 1-hour ses- 
sion that was held in a large auditorium. The session 
was conducted by a male graduate student.’ To enable 
scheduling for the second session, the subjects placed 
their names on sheets that contained code numbers. 
They were told that this was necessary in order to 
properly credit them with having participated and that 
the code numbers would guarantee anonymity, which 
was the case subsequent to the second experimental 
session. At the end of the session, the subjects were 
thanked for their cooperation and given a short explana- 
tion of the uses of objective and projective personality 
tests but were not told that they would be asked to 
attend a second session. 


Performance Tasks and P, Ratings 


The V, and V¢ ratings were obtained during a 1-hour 
session that took place at least 2 weeks after the first 
session. Precautions were taken so as not to relate the 
two sessions, and a different experimenter was present. 
Subjects performed two series of tasks, one of which 
involved the solution of anagrams, the other, the sub- 
stitution of a series of letters for numbers (alphanumeric 
substitutions). By suitable arrangements, five levels of 
task difficulty were presented, ranging from certain 
solution to relative impossibility. Anagram difficulty 
was varied using five different lengths. Substitution 
tasks varied as to the number and arrangement of 
substitutions. 


3 The author wishes to thank Robert A. Boardman 
and Thomas Abercrombie for their assistance in con- 
ducting the assessment and individual experimental 
session, respectively. 
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In addition to solving the tasks, the subjects were | 
asked to rate their chances of success at each 0 the 
alternatives prior to each task selection. These ravings 
were made by placing marks on lines scaled at 11 pices. 
The scale indicated chances out of 100 of succeed»; at 
the task. Thus, the difficult end of the scale was labled 
with the designation 0, along with the phrase ‘no 
chance at all” and the easy end with 100 and the pl: «ese 
“absolutely certain.” Below 50, at the midpoint, was 
the phrase “an even chance.” 


V, and V; Rating Forms 


Subjects recorded their valence ratings by indicating 
degrees of “satisfaction” and “dissatisfaction” at tasks 
that were assigned a given difficulty (P,) level. She 
V, and V; were rated on separate forms. Ratings were 
made by placing marks on vertical lines. The uj 
ends of the lines for V, were labeled “a great dea! of 
satisfaction,” and the lower ends, “no satisfacti 
Labels for the V¢ scale were “a great deal of dissatis'c- 
tion” on the top and “no dissatisfaction” on the bottom. 
The point equidistant from the ends of the lines was 
labeled “midpoint.” 

‘There were 11 vertical lines on each rating forn. 
Below each line were two designations: a task number 
and a P, level. The P, levels were presented in terms of 
“chances out of 100.” The line that represented the most 
difficult task had the designation “Task 1” and 0 
chances out of 100 (i.e., Pa = .00). The line used ‘o 
represent the easiest task was labeled “Task 11” aud 
100 chances out of 100 (i.e., Ps = 1.00). The remaining 
9 vertical lines were designated Tasks 2-10 with co 
fidence levels 10-90 chances out of 100 (i.e, Ps = .10 
to Pa = .90). Since the 11 vertical continua on eac'i 
rating form were arrayed in order of difficulty, the sub- 
jects, in effect, generated a graph of the functional 
relationship between V, and Ps and between Vp and 
Ps 


a 


Procedure 


Subjects were run individually during the second 
experimental hour. They were introduced to this ses- 
sion with a set of “ego involving” instructions that told 
them to do the best that they could since their perform- 
ance could be construed as indicative of their ability. 
The subjects were then given instructions on either the 
anagram or substitution tasks and were told that they 
were to work at a series of tasks whose difficulty they 
could select on each trial. In addition, the subjects 
were given instructions on marking the confidence rating 
form on which they indicated their perceived chances 
of success on each of the five alternatives prior to each 
task selection. After completing the rating form, the 
subjects opted to perform at one of the five difficulty 
levels. Since there were no constraints on choice, the 
number of successes and failures was not controlled. 
Subjects completed 20 trials that lasted either 30 
seconds (for anagrams) or 45 seconds (for sub- 
stitutions). 

At the end of the first set of 20 trials, the subjects 
were given instructions for filling out the valence rating 
forms. The following refers to the V, form after com- 
pleting the substitution tasks: 
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What we would like you to do now is to indicate 
how you would feel about succeeding on substitu- 
tion tasks like the ones you just worked on. Please 
look at this scale and examine it. Notice that the 
horizontal axis contains numbers ranging from 
0 to 100. Assume that these numbers indicate what 
you think your chances are of succeeding at each 
of 11 tasks that vary in difficulty. For example, 
assume that you succeed at Task 6 where you 
thought your chances of success were 50 out of 
100. If you would feel no satisfaction having suc- 
ceeded on such a task, then you would place an X 
right on the bottom of the line. If you would feel 
very satisfied having succeeded, your X would 
be placed on top of the scale. Degrees of satisfac- 
tion between these points would be indicated any- 
where along the line that truly reflects your feeling 
of success. Now, we would like you to rate your 
feelings of satisfaction for each of the 11 tasks, 
where, as you can see, Task 1 is extremely difficult, 
while Task 11 is extremely easy. Once again, the 
important thing is that your ratings about success 
be as accurate as you can make them. In other 
words, place your X at a point that indicates how 
you would truly feel about succeeding at these 
various levels of difficulty. 


Instructions for anagrams involved altering the tasks 
referred to. Instructions for V¢ were the same as those 
above, except that the term “dissatisfaction” replaced 
“satisfaction” and “failure” replaced “success,” where 
appropriate. The order of filling out the valence rating 
forms was determined randomly for each subject. 

After filling out the V. and Vs forms, the subjects 
were given instructions for, and worked at, the second 
series of 20 tasks. This was followed by completion of a 
second set of valence rating forms in the same order 
as after the first task series. Thus, in all, each subject 
completed a total of four valence rating forms in one 
of four possible orders (since the order of task presenta- 
tion was also varied randomly). Following completion 
of all rating forms, the subjects filled out the SOAQ 
and were then given both written and verbal feedback 
on the nature of the experiment. 


Valence Slopes and Levels 


Valence scores were derived by measuring the height 
of each affect rating from the base line using a 100- 
point scale. Higher values thus indicate greater degrees 
of satisfaction or dissatisfaction. In addition to single 
scores for each P, level, the 11 scores from a given form 
were used to generate valence levels and slopes. Valence 
level estimates consist of the average of 11 affect ratings 
on each form. Valence slopes were obtained by regres- 
sing affect rating scores on Ps. Each linear regression 
coefficient was computed with reference to Pa. Thus, 
increasing V, slopes are indicated by more negative 
regression coefficients, while increasing Vr slopes are 
reflected by more positive coefficients. 


RESULTS 


Analysis of V, and Vy Trends as a Function of 
P; 
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Fic. 1. Valence of success (Vs) and failure (Vs) (in 
arbitrary units) as a function of probability of success 
(P.) for both tasks combined (Part A) and separately 
for each task (Part B). 


Mean V, and V; values for each of the 11 
P, levels for both tasks combined are presented 
in Part A of Figure 1. The data clearly depict 
the inverse relationships between Z and P 
assumed by the Atkinson model and are con- 
sistent with previous studies (Brown, 1963; 
Feather, 1967; Litwin, 1966). Although a 


TABLE 1 


VARIANCE ANALYSES OF Vs AND Ve RATINGS 
OBTAINED FOLLOWING ANAGRAM AND 
SUBSTITUTION Tasks 


Va Vs 
Source df 
MS F MS F 
Subjects (Ss) 60 | .393 -350 
Tasks (A) 1| 425| 2.789 075 <i 
Error (A X Ss) 60 | .152 +198 
Ps level (B) 10 | 3.350 | 58.674** | 4.910 | 62,548** 
Error (B X Ss) 600 | .057 079 h 
3.083 


AXB 10 | 093| 4.128** | 1048 
Error (A XB XSs)| 600 | .023 .016 


*p <01. 
**p <.001. 
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TABLE 2 


INTERCORRELATIONS BETWEEN LINEAR SLOPE REGRESSION COEFFICIENTS AND MEAN LEVELS 
FOR Vs AND Vs RATINGS GENERATED FOLLOWING ANAGRAM AND 
Susstitution Tasks (N = 61) 


Variable 2 | 3 | 4 | 5 | 6 | 7 | 8 Xx | 
ion coefficient 
A anagram 51a gek 3g 1i —02 —07 24 —.42 
2. V, substitution —3i* 56" —04 14 04 20 —.58 
3. Vt anagram 68*** 03 25% 01 —03 55 6 
4, Vt substitution 21 29* 16 08 66 
Mean level 
5. V, Anagram 45*** = —12 10 65 1 
6. Vs substitution 01 32* 62 1 
7. Ve anagram 32* .49 1 
8. Vs substitution = .50 Í 


Rove, —Decimal points have been omitted in correlations, 
iled. 


slight curvilinearity is present in the P, = .00- 
.50 range, the trends are predominantly linear. 
Indeed, the linear trend component accounts 
for 96% of the differences in V, and 99% of 
the differences in V; as a function of P; level. 

Part B of Figure 1 presents the mean valence 
ratings for each task separately. Two-way 
variance analyses of the data in this figure 
(Task X P, Level) are presented in Table 1. 
The P, level main effects for V, and V; are 
highly significant. While there are no signifi- 
cant main effect differences between tasks, 
there are significant Task X P, Level interac- 
tion effects for both V, (F = 4.13, df = 10/600, 
p< .001) and V; (F = 3.08, df= 10/600, 
p < .01). Trend tests indicate that differences 
in linear trends between tasks account for 
99% of the Vs and 83% of the V; sums of 
squares. No higher order trends are significant. 
As seen in Figure 1, the linear valence slopes 
are steeper when based on the substitution 
task than when based on the anagram task 
ratings. 


Relationships among Slope and Level Indexes 


Table 2 presents intercorrelations between 
linear slope regression coefficients and mean 
levels of V, and V; ratings for each task. 
Consistency of individual differences in valence 
ratings is indicated by correlations across 
tasks. It can be seen that such consistency 
exists for the linear slope estimates both for 
Vs (r = 51, p < .001) and for Vt (r = .68, 


p < 001). Mean levels are also consistent 
across tasks, although at a somewhat lower 
level for both Vs (r = .45, p < .001) and \¢ 
(r = 32, p < 01). 

Correlations between V, and V linear slop 
regression coefficients indicate that high (more 
negative) V, is associated with high (more 
positive) V: slopes: for the anagram tasix, 
r= — 43, p< .001; for the substitutica 
task, r = — .56, p < .001. However, the rela- 
tionships between Vs and V; levels are incon- 
sistent. While levels are significantly correlated 
for the substitution task (r = .32, p < .01), 
they are not for the anagram task (r = — .12, 
ns). 

Although all are positive, there are no sig- 
nificant correlations between valence slopes 
and mean valence levels (Variables 1-5, 2-6, 
3-7, 4-8). This indicates that the two are not 
redundant and may possibly reflect different 
sources of variation. 


Relationships between V,, Vs, and Personality 
Variables 


Table 3 presents the correlations between 
the assessed individual difference dimensions, 
Vs and V; linear slope regression coefficients, 
and valence level indexes. As can be seen, 
there are no significant correlations between 
valence slopes and individual differences in 
Ms and Mr, opposite to prediction. There are 
also no relationships between slopes and the 
combined index of approach motive-z-resultant 
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scores. A similar lack of relationships is found 
for mean valence levels. 

However, as hypothesized, there is a signifi- 
cant positive correlation between locus of con- 
trol scores and Vs slope (r = .39, p < .01) and 
a negative correlation between locus of control 
scores and V; slope (r= — .22, p< .05) 
when based on data following the anagram 
task. The correlations are not significant fol- 
lowing the substitution task, but they are in 
the predicted directions. 

To further elucidate the relationship be- 
tween locus of control and valence, the locus 
of control distribution was dichotomized at 
the median: persons above the median com- 
prising the group high in external control 
beliefs and those below the median, the group 
high in internal control beliefs. The Vs and V: 
means for these two groups over the 11 P, 
levels are presented in Figure 2 for both tasks 
combined. Variance analyses were performed 
separately for Vs and V; ratings. Each analysis 
consisted of a three-way design (2 X 11 X 2), 
the dimensions consisting of locus of control, 
P, level, and task, respectively. Consistent 
with the correlations, there was a significant 
Locus of Control X P, Level interaction for 
both V, (F = 7.93, df = 10/590, p < .001) 
and Vs (F = 6.50, df = 10/590, p < .001). 
Thus, V, and Vs slopes vary as a function of 
locus of control. Trend analyses show that 
95% of the Locus of Control X P, Level in- 
teraction for V, is attributable to differences in 
linear trends between locus of control groups, 
99% for Vs. Thus, as can be seen in Figure 2, 


TABLE 3 


INTERCORRELATIONS BETWEEN Vs AND Vy LINEAR 
SLOPE REGRESSION COEFFICIENTS, MEAN Vs 
AND V; LEVELS, AND PERSONALITY 
VARIABLES (N = 61) 


PAT TAO | Fr, | SOAQ nae 
Variable n A o! 
Ach sultant Rent rol 
Regression coefficient 
Ve anagram 07 07 00 08 39+ 
Vs substitution —12 | —10 | ,—02 03 7 
Vy anagram 03 | —07 08 —02 | —22 
Vi substitution 13, 10 =03 —01 |—16 
Mean level 
Vs anagram 09 07 01 oi | —01 
Vs substitution 12 20 —06 18 | —10 
Vi anagram 05 OL 03 14 | —10 
Vr substitution 15 15 —01 —08 15 


Note.—Decimal points have been omitted, 
* p < 05, one-tailed. 
** > <01, one-tailed. 
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VALENCE 


Fic. 2. Valence of success (V;) and failure (Vs) (in 
arbitrary units) as a function of probability of success 
(P.) for persons high in internal (IC) and those high 
in external (EC) locus of control attribution, both tasks 
combined. 


both the V, and Vs slopes are steeper for per- 
sons tending to attribute the outcome of their 
actions to internal as opposed to external 
sources. Furthermore, the difference in slopes 
is apparently due to reversals in valence levels 
for high-internal-control and high-external- 
control groups at opposite ends of the Ps 
continuum. While V, was higher for internal 
control than for external control at extremely 
difficult tasks ( = 3.615, p < .01 at P, = .00), 
the reverse was true for extremely easy tasks 
(t = 2.539, p < .01 at P, = 1.00). Similarly, 
V; was higher for internal control than for 
external control at extremely easy tasks 
(t= 3.569, p< .01 at P, = 1.00) and the 
reverse at extremely difficult tasks (¢ = 2.138, 
p< 01 at P, = .00). 

An additional finding is that the magnitude 
of the difference between valence slopes is a 
function of the task. This is indicated by the 
significant triple interaction between the three 
variables in the analysis (Locus of Control 
X P, Level X Task) for V, (F = 2.44, df 
= 10/590, p < .01). Inspection of the means 
revealed that the difference between high- 
internal-control and _ high-external-control 
groups is greater for the anagram task than 
for the substitution task. Although this differ- 
ence is in the same direction for V;, the com- 
parable triple interaction is not significant 
(F< 1). 
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TABLE 4 


FREQUENCY OF INDIVIDUAL VALENCE FUNCTIONS 
CONTAINING SIGNIFICANT LINEAR AND HIGHER 
ORDER TRENDS (a = .05) 


Degree of trend 
Type of rating 


1 2 3 4 5 
Vs anagram 55(90) ave 29(48) | 4 (7) | 1 2 
Vs substitution 55(90) | 33(54) | 29(48) | 12(20) | 0(0 
Vs anagram Lon 32(52) | 28(46) | 12(20) | 0(0) 
Vt substitution 55(90) | 36(59) | 29(48) | 10(17) | 1(2) 
Total (N = 244) |224(92) |131 oa) 115(47) | 38(16) | 2(3) 


Note.—Numbers in parentheses are the percentages of 
significant trends, 


Individual Differences among Valence Trends 


While trends averaged over individuals 
»ptoyide general confirmation of the predicted 
relationship between P and /, examina- 
tion of the functions for individual subjects 
reveals a large number of trends departing 
from linearity. Very few of the valence ratings, 
in fact, produced even approximate linearity, 
and many had recognizable quadratic and cubic 
trends. The following analyses were under- 
taken to determine whether these higher order 
trends were statistically reliable and/or whether 
they were related to any of the personality 
dimensions assessed in the present study. 

First, the 11 valence values for each rating 
form were used to generate increasing degree 
polynomial regression equations until no fur- 
ther variance was accounted for by the next 
higher degree or up to fifth-degree polynomials. 
According to the procedure presented by Ostle 
(1954, pp. 138-142), each trend component 
(there could be as many significant trends as 
equations generated) was then tested for 
significance. Table 4 presents the number and 
the percentage of significant trends of each 
order (using a = .05). 

Consistent with analyses presented above, 
significant linear trends were present for 92% 
of the individual valence functions. Thus, 
almost all subjects on each of the four valence 
forms produced ratings that resulted in non- 
zero linear slopes. However, in addition, 86% 
of the functions with significant linear trends 
also contained significant higher order trends. 
Over half (54%) of the valence function 
possessed significant quadratic, and just under 
half (47%), significant cubic trends, Many 
fewer fourth-degree and almost no fifth-degree 
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trends were significant. Thus, analysis of indiv- 
idual data reveals the presence of nonlinear 


trends apparently canceled by averaging. ‘Re- 
call that the trends based on overall means 
were almost entirely linear, and no higher order — 


trends were significant.) 

To see if the shape of the higher order trend 
components was consistent across subjects, — 
valence functions that contained only si ift- 
cant second-order or only third-order tre cs, 
in addition to linear trends, were selec! 2d, 
The valence ratings for each P, level for these 
selected functions were then averaged, anc the 
results are presented in Figure 3 for both 
tasks combined. As can be seen in the figure, 
the second-order trends (Figure 3, Part B) 
for both V, and V; functions show a slight posi- © 
tive acceleration. This trend was also present 
to some extent in the overall averaged ita, 
but not to the same degree. Third-order tre ds 
appear somewhat S shaped for both V, and 
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Frc. 3. Valence of success (V5) and failure (V4) (in 
arbitrary units) as a function of probability of success 
(Ps) for selected subjects generating data containing 
significant cubic (Part A) or quadratic (Part B) trends. 
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V: (Figure 3, Part A), as they are superimposed 
upon the linear trends. Such third-order trends 
were noticeably absent from the data averaged 
over all persons. 

An additional analysis was carried out to 
determine whether the presence of higher order 
trend components was in any way related to 
personality dimensions. This was tested by 
dichotomizing each personality variable at the 
median and tabulating the number of valence 
functions that contained significant higher 
order trends for each group. While there were 
some suggestive directions indicated (a higher 
proportion of second- and third-order trends 
for high- than for low-TAT groups and more 
for high-internal-control than for high-external- 
control subjects), none of the relationships 
were significant (by chi-square analyses). 
Thus, there is no evidence that the presence 
or absence of higher order trends is related to 
the personality dimensions assessed. 


DISCUSSION 


Results of the present study provide sup- 
port for the postulated inverse relationship 
between probability and incentive value as 
inferred from anticipated ratings of affect 
following success and failure. Data averaged 
over subjects support an inverse linear rela- 
tionship between J, and P, and between /; 
and P;, since almost all of the variance in 
valence as a function of P, was attributable to 
linear trend components. In addition, nearly 
all individual valence functions posessed signi- 
ficant linear trends (i.e., nonzero linear slopes). 
However, the averaged valence functions are 
asymmetric. First, they do not intersect at 
P, = .50. Second, success and failure at ex- 
tremely easy tasks: were reported to result in 
more intermediate degrees of affect than when 
the task was extremely easy. This is more 
pronounced for Vs than for V; ratings. Thus, 
moderate degrees of satisfaction were indicated 
even where the task was assumed to be solvable 
with complete certainty (i.e., Ps = 1.00). Per- 
haps the amount of effort expended, even on 
extremely easy tasks, was sufficient to produce 
some positive affect when persons succeeded. 

The failure to find relationships between 
slope measures and achievement-personality 
dimensions is consistent with previous studies 
(Brown, 1963; Feather, 1967) in failing to 
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support this aspect of the Atkinson model. It 
is possible to argue, of course, that persons in 
the present study made judgments that were 
still somewhat hypothetical. Recall that the 
subjects were to “assume” that they had 
succeeded or failed on tasks similar to the 
ones on which they had just worked when they 
made their ratings. However, each person had 
actually experienced a large number of suc- 
cesses and/or failures at tasks of varying de- 
grees of difficulty prior to making their ratings, 
and they were also familiar with the proba- 
bility-of-success dimension since they had 
rated P, prior to each task selection (a total 
of 200 ratings). Thus, although the ratings 
were actually hypothetical, in that they were 
“anticipatory,” the subjects’ experience in the 
situation was not. It should be noted that the 
study by Brown (1963) that also failed to find 
support for the slope predictions used affect 
ratings obtained immediately after task suc- 
cess and failure. 

Another factor that should be noted is the 
relationship between success and failure slopes; 
that is, the steeper (more negative) the V, 
slope, the steeper (more positive) the V; slope 
(see also Feather, 1967). As slopes are assumed 
to be a function of achievement motive levels, 
this would be interpreted as a positive rela- 
tionship between Ms and My. This is incon- 
sistent with previous assumptions that these 
personality dimensions are independent of each 
other (Atkinson & Feather, 1966). 

- The only personality dimension related to 
slope is locus of control. This result is in sub- 
tantial agreement with Feather’s formulation 
of the relationship between locus of control 
and slope (Feather, 1967), although his dis- 
cursive equation would have to be modified 
slightly to account for the manner in which the 
slopes differed in the present study. Specifi- 
cally, the valence functions of the two locus of 
control groups intersected in the intermediate 
difficulty range, rather than diverging from a 
common origin as his equations dictate. Thus, 
while V, levels were lower for high-external- 
control than for high-internal-control subjects 
on very difficult tasks (as expected), Vs was 
higher for high-external-control than for high- 
internal-control subjects on very easy tasks. 
Similarly, while V; was higher for high-inter- 
nal-control than for high-external-control sub- 
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jects on extremely easy tasks (as expected), 
the reverse was true on very difficult tasks. 

The nature of the interaction between locus 
+ of control and task difficulty in determining 
success and failure affect has implications for 
attempts to integrate the locus of control 
dimension into achievement motivation theory 
(e.g., Weiner & Kukla, 1970). Specifically, 
questions concerning affect and locus of control 
will have to take into consideration the diffi- 
culty level of the task involved. Data in the 
present study indicate that different answers 
would result depending upon whether the task 
involved was perceived by the person as ex- 
tremely easy or extremely difficult, 

Of some significance is that over half of the 
individual valence functions possessed signifi- 
cant quadratic or cubic trends. It is possible 
Lagoncle that linearity may not be appro- 
priate for this proportion of the individuals 
tested, which circumscribes support for the 
linear assumption. Such deviations from 
linearity could result in different predictions 
from the Atkinson model, depending on the 
nature and degree of the deviation, in studies 
where P, is a variable. However, it may also be 
possible to utilize the valence functions to 
make individualized risk-taking and perform- 
ance predictions, assuming of course that the 
ratings represent veridical indexes of affect 
and that the scale used has more than ordinal 
properties. 
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SIGNALING UNMODIFIABLE SHOCKS: 
LIMITS ON HUMAN INFORMATIONAL COGNITIVE CONTROL! 
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University of Toronto 


The limits of informational cognitive control, which occurs when merely signaling an 
inescapable shock reduces perceived noxiousness, were explored in four experiments 
by observing the rated aversiveness of and preference for signaled versus un- 
signaled shocks. All four experiments (V/=24, 80, 48, and 32, respectively) used a 
classical conditioning paradigm with conditioned stimulus-unconditioned stimulus 
intervals of either 5 or 8 seconds. In no experiment did signaling reduce rated 
aversiveness, nor was the signaled shock significantly preferred, Contrary to 
current views, therefore, the limits on the influence of human informational cogni- 


tive control appear to be severe. 


Zimbardo (1969) has used the term “‘cogni- 
tive control’? to describe cases where a par- 
ticular cognitive set influences the affective 
properties of noxious stimuli. Similar to the 
formulations of Zimbardo and his associates 
is the notion of informational cognitive control, 
which suggests that merely signaling the occur- 
rence of an inescapable, unmodifiable shock 
reduces the perceived noxiousness of that 
stimulus. Evidence which apparently supports 
the notion of informational cognitive control 
is available (e.g., Lanzetta & Driscoll, 1966), 
and the mechanism through which such con- 
trol might operate has also been recently 
specified by Perkins (1968). If informational 
cognitive control does occur in humans, two 
hypotheses follow: (a) The subjectively rated 
aversiveness of signaled shock is less than that 
of unsignaled shocks, and (b) signaled shocks 
are preferred to unsignaled shocks. The two 
hypotheses are independent, inasmuch as the 
latter (preference) hypothesis, unlike the for- 
mer (aversiveness) hypothesis, is readily in- 
terpretable in terms other than informational 
cognitive control. Specifically, a formulation 
like Berlyne’s (1960) suggests that the signal 
is reinforcing because it reduces uncertainty, 
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rather than actually reducing shock aversive- 
ness. Such a view would predict a preference 
for signaled shock, but would not necessarily 
interpret the preference phenomenon in terms 
of informational cognitive control. However, 
both the preference and the aversiveness hy- 
potheses are implied by the informational 
cognitive control notion; and the purpose of 
this paper is to examine the empirical status of 
human informational cognitive control by re- 
porting some relevant nonautonomic evidence 
which arose from several classical-autonomic — 
conditioning experiments.* 


METHOD 
Subjects 


The 184 subjects in the four experiments were allo- 
cated as shown in the first column of Table 1. In each 
experiment, half of the subjects were male, and all of 
the subjects were either Toronto senior high school 
students paid $1.50 for participating during the summer 
(Experiments II and III) or University of Toronto 
undergraduates who were either paid $1.50 or partici- 
pated to fulfill a course requirement (Experiments I 
and IV). 


Apparatus 


The apparatus used for all of the conditioning experi- 
ments has been described in detail elsewhere (Furedy, 
1970). Briefly, a shock delivered from a. constant- 
current source to the subject’s right upper arm served 
as the unconditioned stimulus (UCS); the two condi- 
tioned stimuli (CSs) to be used in a differential-condi- 
tioning paradigm were a tone (2,000 Hertz, 73 decibels, _ 
presented through a speaker above the subject’s head) 


3 The autonomic data associated with Experiments I, 
II, and III are presented in detail in Furedy (1970) and — 
Furedy and Doob (1971a, 1971b). Bie. 
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TABLE 1 
RaTEeD-AVERSIVENESS AND PREFERENCE RESULTS 
Rated aversiveness Preference 
Experiment N = i 
` Signaled* Unsignaled* t Signaled Unsignaled No pref `e 
Corey 24 42.0 42.5 34 Not reported 
Ia 32 53.4 53.2 .02 13 5 14 
IIb 32 56.5 54.6 1.36 9 10 13 
IIc 16 46.0 43.6 1.51 3 3 10 
lil 48 41.5 40.1 2.57 19 14 15 
IV 32 Not reported 10 10 12 


a Mean % full-scale lever rotatoin. 


and a light (room-illumination increase from 2.9 to 4.7 
foot lamberts); the subject and the experimenter were 
in separate rooms, but were in continuous intercom 


contact. w} 
t dcedure 


The following features were common to all experi- 
ments. The subjects were presented with differential 
conditioning trials, with counterbalanced tone and 
light as the CSs always (CS +) and never (CS —) 
paired with the UCS. In addition to the CS + UCS 
(signaled shock) and CS — alone trials, the subjects 
also received UCS alone (unsignaled shock) trials; the 
intertrial intervals varied unsystematically between 
30, 40, and 50 seconds. Shock aversiveness was rated 
by the subject rotating a lever up to 180° with his 
right hand (Kimmel & Schultz, 1965) after the delivery 
of each shock. Lever rotation was registered on a 

` voltmeter in the experimenter’s room. 

Concerning the procedural differences between ex- 
periments, Experiment I, which was originally run in 
order to investigate an implication of Perkins’ (1968) 
theory for autonomic conditioning, has been detailed 
elsewhere (Furedy, 1970). Briefly, the duration of both 
CS and UCS was .3 second, and the CS — UCS interval 
was 5 seconds (trace-conditioning paradigm). The in- 
tensity of the UCS was varied in an unsystematic order 
within subjects from .5 to 2.5 millamperes in .5-milli- 
ampere steps, and there were 15 each of CS + UCS, 
UCS alone, and CS — trials. The procedure in Experi- 
ment II was the same as in Experiment I, except that 
UCS intensity was held constant within subjects at 
either 1.0 milliampere (for half of the subjects) or 2.5 

- milliamperes (for the remaining subjects) and that at 
the end of the session, each subject was asked whether 
he preferred the signaled shocks, unsignaled shocks, or 
had no preference. The only difference between the 
subexperiments of Experiment II (see Table 1) was 
that in Experiment Ila, trace conditioning was used, 
and the subjects were not informed in advance about 
the CS-UCS relationship; in Experiment IIb, the 
subjects were so informed; in Experiment IIc, the 
subjects were informed, and the delay-conditioning 
paradigm was used with the 5-second CS+ terminating 
at UCS onset. The procedure in Experiment III was the 
same as that in Experiment IIc, except that (a) the UCS 


was varied unsystematically within subjects bot!) in 
intensity (1.5, 2.0, and 2.5 milliamperes) and duration 
(.3, .8, and 2.0 seconds); (b) the CS-UCS inte: val 
and CS duration were both extended to 8 secoj::'s; 
(c) the subjects were asked to rate the unpleasant 38 
rather than the intensity of the shocks; (d) there were 
18 each of CS + UCS, UCS alone, and CS — trics. 
In Experiment IV, the subjects were first presen‘ od 
with the first two-thirds of the trials used in Exp: :i- 
ment IIe (i.e., 10 each of CS + UCS, UCS alone, oid 
CS — trials), and were then asked to indicate (by 
pushing one of three buttons) whether they would n- v 
prefer to receive a series of mostly signaled shoc“, 
mostly unsignaled shocks, or had no preference. 


RESULTS 


The results, summarized in Table 1, are 
presented by hypotheses (aversiveness ar.’ 
preference) rather than by experiments. 


Aversiveness Hypothesis 


As shown in Table 1, there was no significant 
difference, within subjects, between the rated 
intensities of signaled and unsignaled shock in 
Experiment I. As detailed elsewhere (Furedy, 
1970), the subjective ratings in this initial 
experiment were shown to be adequately sensi- 
tive, since they were reliably related to physical 
intensity differences. However, the method of 
validating rating sensitivity in Experiment I— 
varying physical shock intensity within sub- 
jects—may have resulted in the subjects’ fail- 
ing to attend to the finer differences between 
signaled and unsignaled shocks; Experiments 
Ila, IIb, and Ic provide data relevant for 
checking this possibility. As Table 1 shows, 
however, the results do not confirm the possi- 
bility, since, despite the constant within- 
subjects physical shock intensity in these sub- 
experiments, the ratings of the signaled shocks 
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were not significantly exceeded by those of the 
unsignaled shocks, nor were there significant 
interactions in any of the subexperiments be- 
tween physical shock intensity (varied be- 
tween subjects) and the signaled—unsignaled 
factor. 

Another possible method of saving the aver- 
siveness hypothesis arises from the fact that 
the above results are based on a single index 
of rated aversiveness: judged intensity. Ac- 
cordingly, the data of Experiment III provide 
another index, judged unpleasantness, In addi- 
tion, as further evidence for the sensitivity of 
this index of aversiveness, Experiment III 
allowed determination of whether the index 
was sensitive to within-subjects duration as 
well as intensity differences. Judged unpleas- 
antness did vary significantly (p < .001) and 
monotonically with both shock duration and 
intensity, but as Table 1 shows, signaling the 
shock again failed to result in a significant 
reduction in rated aversiveness. In this case, 
indeed, there was a marginally significant 
(p < .05) effect in the opposite direction. 


Preference Hypothesis 


The columns under “‘Preference” in Table 1 
show the frequencies of subjects indicating 
signaled, unsignaled, and no preference in the 
various experiments. In isolation, the outcome 
in Experiment Ila may well be interpreted as 
confirmatory by a proponent of the preference 
hypothesis, since, with a one-tailed sign test, 
the preference reaches the .05 significance level 
for these 32 subjects. However, the sceptic 
will note from Table 1 that (a) not even such 
one-tailed statistical maneuverings can make 
the effect approach significance in Experiments 
IIb and IIc; (b) when the results are combined 
over the 80 subjects in Experiment II, the 
preference effect again fails to approach sig- 
nificance; and (c) the number of subjects ex- 
pressing no preference approaches those of 
subjects in the other two categories. Similarly, 
for the 48 subjects of Experiment III who 
received shocks of varying intensities and 
durations, the signaled shock was not sig- 
nificantly preferred, and the proportion of no 
preferences was again comparable with those 
in the other two categories. 

The above preference results are based only 
on questionnaire data rather than actual be- 


havior, and people at times seem to do exactl 
the opposite of what they say they would 
(e.g., Doob & Gross, 1968), Accordingly, 
data of Experiment IV in Table 1 pro 
actual, rather than merely self-predicted, | 
erence results. However, as Table 1 shows, | 
actual and self-predicted forms of the prefer- 
ence phenomenon did not differ in this case, 
there being no significant proportion of sub- 
jects who preferred the signaled shock in 
Experiment IV. 


DISCUSSION 


The results summarized in Table 1 fail to 
provide evidence for either the aversiveness or 
the preference hypothesis. The case for the 
existence of either the aversiveness or prefer- 
ence phenomena is also weakened here by at 
least two considerations, although lack of 
evidence never constitutes disproof. First, the 
range of conditions over which the phenomena 
were not obtained included different (a) signal- 
shock (CS-UCS) intervals of 5 versus 8 seconds, 
(b) types of physical-shock-characteristic vari- 
ation within subjects (intensity and duration, 
intensity, or neither intensity nor duration — 
varied), (c) levels of physical shock intensity 
between subjects of 1.0 versus 2.5 milli- 
amperes in Experiment II, (d) indexes of 
rated aversiveness, (e) subject populations. 
Second, there were checks on such possible 
confoundings as (a) the sensitivity of the rated- 
aversiveness index to differences in physical 
intensity and duration and (b) the possibly 
important distinction between expressed and 
actual preference. 

The present disconfirmations of the aver- 
siveness hypothesis, inasmuch as signaling 
failed to significantly reduce the rated aver- 
siveness of the shock, do not appear to conflict © 
directly with other experimental reports. To 
our knowledge, the comparison of the rated 
aversiveness of signaled and unsignaled shocks 
has been made only by Kimmel (1967), and 
her results, on this point, were equivocal. One 
possible source of protection for the aversive- 
ness hypothesis from both the present and 
Kimmel’s results can be obtained from a 
methodological objection raised by Obrist, 
Webb, and Sutterer (1969, pp. 720-721), 
wherein it is suggested that having each sub- 
ject rate each shock in a series of signaled and 
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unsignaled shocks focuses abnormally high at- 
tention to those stimuli, and thus attenuates 
the beneficial effects of the signal. However, 

uredy and Chan (1971), in two experiments 
designed primarily to check on some appar- 
ently pro-informational-cognitive-control re- 
sults of Lovibond (1968), have also provided 
data which are relevant to the empirical status 
of the objection of Obrist et al. (1969). The 
relevant groups from Furedy and Chan’s study 
were two sets of 30 subjects who received, 
respectively, a series of signaled and unsignaled 
shocks, and who were only asked after the end 
of the shock series to rate the aversiveness 
of the shocks. The ratings, contrary to both 
the ayversiveness hypothesis and to Obrist et 
al.’s objection, did not differ significantly be- 


t Chan, 1971). 

other hand, the present failures to 
the preference hypothesis do appear 
to conflict directly with the literature, wherein 
the preference-for-signaled-shock phenomenon 
seems to be commonly accepted to occur both 
in animals (e.g., Perkins, Seymann, Levis, & 
Spence, 1966) and humans (e.g., Lanzetta & 
Driscoll, 1966). The animal data, however, are 
based mostly on arrangements where the shock 
was not only delivered through shock grids 
rather than fixed electrodes, but these grids 
were not even scrambled (Lockard, 1963; 
Perkins et al., 1966). It has recently been 
shown that signaling the shock is reinforcing 
for rats only if the shock is delivered through 
unscrambled grids (Biederman & Furedy, 
1970), which suggests that the signaled shocks 
preferred by animals in the previously reported 
studies were preferred only because the shocks 
were not completely inescapable. 

This source of confounding through un- 
authorized escape or avoidance responses does 
not apply to the human-based report of pref- 
erence for signaled shock by Lanzetta and 
Driscoll (1966), where fixed electrodes were 
used for shock delivery. In summarizing their 
findings, however, these investigators noted 
that the preference phenomenon that they 
obtained exhibited considerable individual dif- 
ferences. Moreover, their experiment was not 
optimally designed to test for a preference for 
signaled over unsignaled shock. Subjects were 
instructed to indicate whether or not they 
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would like information as to whether or not a 
10-second interval would be followed by an — 
unavoidable shock and were forced (by threat- ` 
ening them with electric shock) to push a 
button labeled “information” or one labeled — 
“no information.” It is conceivable that given — 
such an alternative, the subjects tended to 

push the information button slightly more 

than the no-information button simply because — 
they thought it to be the more appropriate 

response in the situation. 

All this is not to question the validity of the — 
findings that noxious and unmodifiable events 
presented immediately after a signal are both — 
preferred to, and rated as less aversive than, 
the same events presented after some, usually — 
variable, delay (e.g., Maltzman & Wolff, 1970). — 
The reasons for the discrepancy between these — 
findings and those relating to the signaled- — 
unsignaled difference are not clear, What is — 
clear, however, is that in the signaled—unsig- — 
naled context, there are indeed severe limits — 
on the influence of informational cognitive 
control, at least as regards the group mani- — 
festation of that effect. _ hes: 

There remains the possibility that informa- 
tional cognitive control, as measured by the 
reduction in signaled-shock aversiveness, oc- 
curs for certain classes of subjects, and the 
same can be said about the preference-for- 
signaled-shock phenomenon. It is also possible 
that in subjects for whom the aversiveness 
phenomenon occurs, so does the preference — 
phenomenon. This, however, is not supported — 
by the present findings. In Experiments [I and _ 
III (see Table 1), where both rated-aversive- 
ness and preference information were available 
from all subjects, the signaled—unsignaled 
rated-aversiveness difference was not signifi- 
cantly related to expressed preference between 
signaled and unsignaled shocks. Signaling the 
shock, then, may be important, but the data 
presented here strongly suggest the need for 
information concerning, rather than faith in, 
the general importance of information about 
noxious, unmodifiable outcomes. 
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REACTIONS TO GENEROSITY OR STINGINESS FROM AN 
INTELLIGENT OR STUPID WORK PARTNER: 


A TEST OF EQUITY THEORY IN A DIRECT 
EXCHANGE RELATIONSHIP! 


"i 


ARNOLD KAHN? 
Towa State University 


The study was an attempt to apply equity theory to a direct exchange situation. 

One hundred and twenty subjects performed two tasks. On the first task, the partner 

always divided the group earnings, while on the second task the subject was re- 

sponsible for allocating rewards. A 3 X 2 X 2 factorial design was employed with 3 

levels of reward received relative to partner's rewards, determined by the partner's 

division of rewards on the first task, 2 levels of qualifications of partner (inputs), 

and sex of dyad members comprising the independent variables. On the main de- 

pendent measure, division of the rewards by the subject on the second task, the 

equity theory prediction was confirmed in that previously overpaid subjects took 

ps less money for themselves, and previously underpaid subjects took more money 
Das bi ior themselves than equally paid subjects; however, a preference for equality over 
“aa equity was noted in both the actual dividing of the rewards and questionnaire re- 
sponses. Subjects’ divisions of the rewards were less extreme than equity theory pre- 

dicted, and the subjects preferred equal division situations. F ‘urthermore, partner’s 

inputs had only minor effects on subjects’ behavior. The data are discussed in terms 

of a desire to maintain equal status in a direct exchange, which can most easily be 


assurred by ignoring differences in inputs and allocating outcomes equally, 


Equity theory (Adams, 1963, 1965) main- 
tains that individuals in a work situation com- 
pare their profits with others. More specifi- 
cally, a state of equity exists when a person 
(P) perceives his ratio of inputs to outcomes 
Proportional to the ratio of a comparison per- 
son (O). Inequity results when the ratios are 
not perceived as proportional. Inequity is con- 
ceived as an aversive motivational state that 
P will attempt to reduce by altering his own 
or O’s inputs or outcomes, or by distorting his 
perceptions. In most exchange relationships, 
inputs refer to experience, training, and effort 
directed at the task, and outcomes include 
rewards gained such as salary and status. 

Most research testing equity theory has 
focused on the employer-employee relationship, 
where the employer hires the employee, P, to 
work at either a piece rate or hourly wage for 


1 The research reported in this study was partially 
supported by an Office of Education Grant OEG-6- 
70-0017 (509), “Conditions Leading to the Use of 
Retaliation,” and a faculty initiation research grant 
from Iowa State University. The author wishes to 
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2? Requests for reprints should be sent to Arnold Kahn, 
Department of Psychology, Iowa State University, 
Ames, Iowa 50010. 


which P is underqualified, qualified, or over- 
qualified (Adams, 1965; Lawler, 1968; Pritch 
ard, 1969). The typical dependent measure 
(inequity reduction) has been job performance. 
The nature of the comparison person, O, has 
not been clearly specified. Usually, O is a 
hypothetical person who is equitably paid, or 
P, himself, in an assumed past job situation 
in which he had been equitably paid. 

The present study was designed to test 
equity theory predictions in a situation in 
which P and O work together and where both 
of them have control over the division of the 
outcomes, This kind of situation is termed a 
direct exchange, and it includes many types of 
situations from bargaining over the price of 
goods or services to be exchanged to the ex- 
change of gifts or favors between lovers. In a 
direct exchange situation, the nature of O is 
less ambiguous, being the other person with 
whom the exchange is taking place, and in- 
equity reduction can occur not only through 
task performance, but also by manipulating 
P or O’s outcomes. Behaviors such as reci- 
procity and retaliation can also be examined 
in this context (Adams, 1965, p. 278). 

A series of studies by Leventhal and his 
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colleagues (Leventhal, Allen, & Kemelgor, 
1969; Leventhal & Bergman, 1969) has shown 
that people do allocate rewards to one another 
in accordance with the equity formulation: 
when inputs were equal, P increased his out- 
comes if he was underpaid and decreased them 
if he was overpaid, Furthermore, when over- 
reward occurred intentionally by O, P was 
more likely to decrease his outcomes to a 
greater extent than if the overreward occurred 
by chance, although O’s intentions did not 
affect reallocation of outcomes when P was 
underrewarded (Leventhal, Weiss, & Long, 
1969). Leventhal and Michaels (1969) found 
that when P worked longer than O and pro- 
duced the same amount as O, P took less than 
` half of the reward; but when O worked longer 
than P and produced the same amount as P, 
P took more than half of the reward. 

A number of studies dealing with helping 
behavior in a direct exchange, while not de- 
rived from equity theory, have found results 
consistent with the theory—greater help is 
given to O the more P has been previously 
helped by O (e.g., Frisch & Greenberg, 1968; 
Goranson & Berkowitz, 1966; Greenglass, 
1969; Pruitt, 1968). 

The present study was designed to be a 
more direct test of Adams’ model in a direct 
exchange situation. Three payment conditions 
were created—overpaid by O (generosity), 
equally paid by O, and underpaid by O 
(stinginess)—after performance on Task 1. 
Subjects then worked on a second task for 
which they were responsible for dividing the 
rewards between O and themselves. Since task 
performance was manipulated so that both P 
and O performed equally well, equity theory 
would predict that subjects would keep the 
most money for themselves when they had 
previously been underpaid by a stingy O, and 
keep the least amount for themselves when they 
had been previously overpaid by a generous 
O. In addition, half of the subjects were led 
to believe that O was highly qualified for the 
job and the other half that O was poorly 
qualified. It was predicted that the subjects 
would keep more money for themselves in all 
payment conditions when O was poorly quali- 
fied. The third variable, sex of subjects, was 
‘included since past research (Kahn, Hottes, & 
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Davis, 1971; Markwell, Ratcliff, & Schmitt, 
1969; Vinacke, 1959) has indicated that males 
are more concerned with financial gain, while 
females are more desirous of maintaining a 
harmonous relationship. It was therefore pre- 
dicted that males would take more for them- 
selves than females with both generous and 
stingy Os. 


METHOD ê 


Pairs of subjects participated in an “individual versus 
group performance” study. Subjects first worked as a 
group on a proofreading task and then worked indivi- 
dually on two additional proofreading tasks. On the 
first individual task, O always divided the group earn- 
ings, while on the second individual task the subjects 
had an opportunity to distribute the earnings. The 
division of money on the second individual task was 
the main dependent variable. The independent vari- 
ables—amount kept by O on the first individual task, 
qualifications of O, and sex of the subjects—composed a 
3X 2X2 factorial design. A total of 120 under- 
graduates (60 males and 60 females) participated in 
the study in like-sex pairs. Subjects received both credit 
toward their introductory psychology grade and money 
for their participation. 


Procedure 


As each subject arrived, he was placed in a separate 
room and given a “past background in English” form 
to complete. When both subjects had completed the 
forms, they were brought together in one room and 
introduced. If subjects indicated prior acquaintance 
with one another, the experiment was terminated. They 
were informed that the study involved determining 
whether people worked better in groups or as individuals 
and whether this varied with the ability of the in- 
dividuals. It was further explained that the tasks in- 
volved ability in English composition, specifically 
proofreading paragraphs, and that the past background 
in English form they had completed was a measure of 
that ability. Subjects were told that they would first 
work together in a group and then separately as in- 
dividuals, and that in order to motivate them to work 
hard, they would be paid on the basis of how well they 
performed asa group. It was explained that both quality 
and quantity were important. Instructions for the 
group task were then distributed. After they had read 
the instructions, a panel of lights in front of the sub- 
jects was brought to their attention. They were told 
that when the blue light comes on, a timer would be 
activated and they were to begin, and that when they 
were finished they were to flip a switch which would 
stop the timer. The group task was then distributed. 
The experimenter entered a control room, activated the 


3 Generous thanks are due to Al Ryen and Steve 
Hinds who served as experimenters and assisted with — 
the data analysis. 
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blue light, and waited for the signal that they had com- 
pleted the task. When the subjects signaled that they 
were finished, the experimenter collected the task and 
quickly scored it. He announced they had detected a 
total of ___ errors in — minutes and — seconds, He 
continued: “For this level of performance you have 
earned 90¢. Since you worked together, TIl just give 
you each 45¢. Ok?” Subjects were then paid. The 
experimenter then said it was time for them to work as 
individuals, that they would be working in separate 
rooms on the same tasks, and they would again be 
paid as a group. The experimenter explained that there 
would be two individual sessions and that a coin toss 
would determine who would be first to divide the earn- 
ings. The experimenter then escorted one subject to 
another room which also contained a panel of lights, 
passed out the second task, and activated the blue light. 
When both subjects indicated that they had completed 
the task, the experimenter collected them and told each 
subject that he would be back as soon as he had scored 
the paragraphs. At this point, the experimenter also 
handed each subject a copy of his partner’s (O’s) past 
background in English form stating: “While these are 
being scored I thought you might be interested in seeing 
the kind of background in English your partner had.” 
This constituted the manipulation of O’s qualifications 
and is explained below. The experimenter ‘“‘scored’’ 
the paragraphs, went to one subject and collected his 
partner’s past background in English form, and flipped 
a coin, asking the subject to “call it.” The subject al- 
ways lost the flip and was told: “I guess you'll get to 
divide the money the next time. I’ll be back as soon as 
your partner has indicated how he’ll divide the money.” 
‘The experimenter then went to the other subject and 
announced that the first subject had won the coin toss 
and handed him a form allegedly showing how his 
partner had divided the earnings. The experimenter 
paid the subject, returned to the first subject and 
handed him a similar form showing how his partner 
had divided the earnings, and paid him. Thus, each 
subject was led to believe that O had divided the earn- 
ings on the first individual task, and this constituted 
the payment-conditions manipulation which is fully 
explained below. Subjects then completed the second 
individual task, The experimenter gave each subject 
their “scores?” for the second task and stated that to- 
gether they had earned $1.12. The performance report 


TABLE 1 


PERCEPTION OF FAIRNESS OF THE DIVISION OF 
REWARD WHEN OTHER DIVIDED MONEY 


Payment condition 


Group X 
Overpaid Ea Underpaid 
‘Males 1.80, 4.35» 7.504 4.55 
Females 1.95, 4.70, 6.25, 4.30 
1.87 4.52 6.87 


_ Note.—Cells with the same subscript are not significantly 
different at the .05 level by Newman-Keuls, The higher the 
value, the more the subject felt that O got too much; the lower 
the value, the more the subject felt he got too much. 
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indicated that both subjects had performed equally 
well—one subject always detected a few more errors, 
but took a few more seconds to do so. After each subject 


indicated how he wanted to divide the mone) was 
paid and given a questionnaire to complete. After — 
completion of the questionnaire, both subjects were 


ned 


ques- 


brought together, questioned, and then info 
about the nature of the experiment, During th 


tioning, no subject indicated that he thought the study 
involved equity, retaliation, reciprocation, etc., nor did 
any subject indicate awareness that he had beer de- 
ceived. Earnings were slightly redistributed so that cach 
subject went away with $1.50, and the subjects were 


requested not to discuss the study until the end of the 
quarter. 


Task 


Each proofreading task consisted of a typed )»ge 
containing approximately 125 words of text. Subjects 
were instructed to circle each error they detected. A 
portion of one of the tasks is as follows; “A comuni ‘ty 
must solve the problem of the family by revizing cerian 
established practic. Thats absolutely enevitible.”” F -ch 
task contained approximately 50 errors of spelling or 
punctuation. Subjects generally took between 3 ai! 4 
minutes to complete each task and the time betw: 1 
the partner’s qualifications manipulation, wh h 
occurred prior to the first individual task; and (2 
opportunity to retaliate or reciprocate, which occur: d 
after the second individual task, was no more than ©? 
minutes. 


Manipulation of Independent Variables 


Qualifications of O were manipulated by the bogus 
past background in English forms passed out prior to 
the earnings division on the first individual task. fz 
the highly qualified condition, O “indicated” he was a 
21-year-old, junior, English major with a 3,62 grade 
point average. He had taken nine English courses in 
which he had received seven A’s and two B’s, and was 
currently enrolled in two English courses. He indicated 
that he read from 7 to 9 books a quarter that were not 
assigned and considered himself “much better” than 
others his age in grammar ability. In the poorly qualified 
condition, O “indicated” he was an 18-year-old, fresh- 
man, physical education major with a 1.42 grade point 
average. He had taken two English courses in which he 
had received D’s. He indicated that he read from one to 
three unassigned books per quarter and considered 
himself “a little worse’? than others in his grammar 
ability. 

Payment conditions were manipulated by the division 
of the earnings by O on the first individual task. As on 
the second individual task, the subjects were informed 
that they had performed at about the same level as 
their partner, In the overpay condition, the subjects 
were informed that as a team they earned 98¢ and that 
O had given them 75¢ and had kept 23¢ for himself, 
and each subject was given the 75¢. In the egual-pay 
condition, the subjects were informed that O had divided 
the money equally, and each subject was paid 49¢. 
In the underpay condition, the subjects were informed 
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that O had given them 23¢ and kept for himself 75¢, 
and each subject was given 23¢. 


RESULTS 
Success of Manipulations 


Two questions, answered on 9-point rating 
scales, were used to assess the effectiveness of 
the qualifications-of-partner manipulation. On 
both the first question—‘How good do you 
think the other person is at proofreading?” — 
and the second question—‘‘Who do you think 
was better at proofreading?”—there were 
sizable main effects for qualifications in the 
expected direction (F = 65.62 and 44.10, 
respectively, df = 1/108, p < .001). 

Perception of equity was assessed by the 
question: “How fair was the division of the 
reward when the other person divided the 
money?” Ratings were on a 9-point scale from 
1, I got too much, to 9, he got too much. A 
significant difference emerged for payment 
conditions (F = 168.02, df = 2/108, p < .001), 
and the means are presented in Table 1. A 
Newman-Keuls analysis showed each payment 
condition to be significantly different from one 
another at greater than the .01 level for both 
males and females. A significant interaction of 
Sex X Payment conditions occurred (F = 
5.10, df = 2/108, p < .01) and appears due to 
the underpay condition where males felt 
significantly more strongly that O got too 
much money than did females (see Table iN! 
No main effects or interactions were found for 
O’s qualifications. 


Amount of Money taken for Self 


It was predicted that subjects would keep 
the most money for themselves in the underpay 
condition and least for themselves in the over- 
pay condition, and that males would keep more 
for themselves than females. The data pertinent 
to these predictions are presented in Table 2. 
The equity theory prediction was strongly 
supported (F = 74,82, df = 2/108, p < .001), 
with each payment condition significantly 
different from one another at the .05 level for 
each sex. The overall sex effect was not signifi- 
cant (F = 2.50, df = 1/108, p< .20); how- 
ever, a Newman-Keuls analysis showed that 
males kept more for themselves in the under- 
pay condition than females (p < 01). 
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A third prediction was that subjects would 
keep more money for themselves when O 
was poorly qualified than when he was highly 
qualified, No support was found for this pre- 
diction. The main effect for qualifications 
yielded an F ratio of less than 1.0, and in- 
spection of means showed no difference due 
to O’s qualifications for any of the three pay- 
ment conditions. The three-way interaction, 
however, was significant (F = 4.31, df = 2/ 
108, p < .05) and is illustrated in Figure 1. 
Males were more likely to reciprocate the 
generosity of the poorly qualified O and sto 
retaliate his stinginess; females tended to 
reciprocate the generosity of the highly quali- 
fied O and retaliate her stinginess. However, 
individual comparisons revealed no significant 
differences due to O’s qualifications for either 
sex within any payment condition. 


Other Questionnaire Data 


The postexperimental questionnaire con- 
tained 11 questions. Three of these have al- 
ready been discussed under “Success of the 
Manipulations.” Responses to the questions 
“How hard do you think the other person 
worked on the individual tasks?” and “How 
much do you like the other person?” yielded 
nearly identical findings. For both measures, 
the only statistically significant finding was the 
three-way interaction (F = 3.89, df = 2/108, 
p < 05; and F = 3.73, df = 2/108, p < .05). 
Males liked O more and felt that he worked 
harder when he divided the money equally 
on the first task. When overpaid, males liked 
the poorly qualified O better and felt he worked 
harder; when underpaid, males felt the highly 
qualified O worked harder, but showed no 
difference in liking. Females, on the other hand, 


TABLE 2 
Amount or REWARD KEPT ror SELF on Task 2 


Payment condition 


Group x 


Overpaid Baually Underpaid 
Males 39.75, | 56.00, 73.254 | 56.33 
Females | 39.05, | 55.70, | 64.85. | 53.20 
xXx 39.40 55.85 69.05 


Note.—Cells with the same subscript are not significantly 
different at the ,05 level by Newman-Keuls. 
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PAYMENT BY PARTNER ON TASK ONE 


Fic. 1. Amount of money kept for self on Trial 2 as a function of 
payment condition, sex, and partner’s qualifications, 5 


showed nearly the opposite reaction, When 
overpaid, they liked the highly qualified O 
more, but showed preference for the unqualified 
O in the equal and underpay conditions. This 
pattern was approximated on perception of 
how hard O worked. They felt that the highly 
qualified O worked harder when she was gen- 
erous, that the unqualified O worked harder 
in the equal-pay condition, and no difference 
in the underpay condition. 

For the question : “How fair was the division 
of the reward when you divided the money?” 
a significant main effect emerged for payment 
conditions (F = 16.23, df = 2/108, p < .01), 
showing initially that overpaid subjects felt 
they received too little and initially under- 
paid subjects felt they received too much, The 
question: “How hard did you work on the 
individual tasks?” yielded no significant 
effects. 

Three questions concerned perceptions of 
performance on the group task (How hard did 
you work? How hard do you think the other 
person worked? and How fair was the division 
of the reward?). Responses to these questions 
essentially mirrored those for the perceptions 
of performance on the individual tasks pre- 
sented above. 


ou 


A final question: “How concerned were > 
FELI 


with the financial aspects of this proje 
revealed males more concerned than fem: -ie 
(F = 5.76, df = 1/108, p < .05). 


wn 


Discussion 


Although the basic prediction from equity 
theory was supported—a generous O’s gen- 
erosity was reciprocated, and a stingy O’s 
stinginess was retaliated—the data produced 
two deviations from the theory. The effects of 
O’s qualifications had only minor consequences 
for the subject’s behaviors, and reciprocation 
and retaliation were less than predicted by the 
theory. 

Adams (1965) stated that inputs refer to a 
person’s “education, intelligence, experience, 
training, skill, seniority, age, sex, ethnic 
background, social status, and... effort he 
expends on the job [pp. 276-277].” The manip- 
ulation of inputs in the present study in- 
cluded five of these (education, intelligence, 
training, skill, and age). That these were per- 
ceived by the subjects is attested to by the 
significant differences found for the questions 
assessing how good O was at the task—the 
qualified O was seen as a better proofreader 
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than the unqualified O. According to equity 
theory, the greater O’s inputs, the greater O’s 
outcomes should be, and therefore, the sub- 
cts should have kept less money for them- 
selves when working with a highly qualified O 
than an unqualified O in each of the three 
payment conditions. No such effect was found 
for any of the payment conditions. When the 
data were further broken down by both sex 
and payment conditions, the predicted effect 
still did not obtain for any of the six com- 
parisons. 

It might be that since O performed at ap- 
proximately the same level as the subject on 
both individual tasks, he was seen as not put- 
ting out as much effort in the qualified condi- 
tion, and working very hard in the unqualified 
condition. However, on the question assessing 
how hard subjects thought O worked, no ef- 
fects due to O’s qualifications were found. 
Likewise, the question assessing perception of 
the fairness of the reward division when O 
divided the money yielded no effects for 
qualifications. 

Qualifications may have had less of an im- 
pact than predicted because at the time of 
reward distribution, information concerning 
task performance was before the subjects but 
qualification information was not. This, com- 
bined with the fact that the subjects were 
probably preoccupied with getting even with 
O, probably reduced the salience of qualifica- 
tions. The fact that the experimenter divided 
the rewards on the group task on the basis of 
performance and not past background in 
English may also have contributed to the failure 
to obtain the predicted results. 

For whatever reason, it seems clear that the 
qualifications manipulation was perceived by 
the subjects, but not taken into account when 
allocating outcomes. Hence, it was not con- 
sidered relevant to the exchange. Given this 
state of affairs, the subjects should prefer an 
equal division of the rewards, and this is what 
was found. Subjects liked O more when he 
divided the rewards equally than when he was 
generous (/ = 1.69, df = 38, p < .10) or when 
he was stingy (t= 2.36, df = 38, p< 05). 
Furthermore, when O divided the money 
equally, 39 of the 40 subjects also divided the 
money equally on the second task. 

A closer look at the data suggests an even 
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greater preference for equality than needed 
to maintain equity. On the first task, the gen- 
erous O kept 24% of the reward for himself, 
so that to maintain equity the subjects should 
have kept only 27¢ for themselves on the 
second task. Subjects actually kept 39¢ for 
themselves, an amount about halfway between 
equity (27¢) and equality (56¢). The stingy 
O kept 76% of the reward for himself, and to 
maintain equity, the subjects should have 
kept 85¢ for themselves on the second task. 
In fact, these subjects kept 69¢, again an 
amount halfway between equity (85¢) and 
equality (56¢). These deviations toward 
equality may represent some form of constant 
error in the subjects’ judgments; however, 
taken with the liking for partner data, it might - 
also suggest the outcome of a conflict between 
an equal division, which they prefer, and re- 
ciprocation or retaliation, which is equitable. 
This is substantiated by the responses to the 
question about the fairness of the reward 
division on the second task, where the sub- 
jects indicated greatest fairness when they 
divided the money equally. When they kept 
more for themselves in the O-stingy condition, 
theyindicated they got too much for themselves 
on the second task, even though they retaliated 
less than predicted by equity theory. Parallel 
results were obtained in the O-generous con- 
dition, where the subjects felt O got too much 
on the second task. ` 

A similar preference for equality, or at least 
a diminished tendency to achieve equity, can 
be found in the research of Morgan and 
Sawyer (1967) and Leventhal and Michaels 
(1969). Morgan and Sawyer found that when 
high school students had to achieve an agree- 
ment as to reward division when the reward 
potential of one student was five times higher 
than the potential for the other student, 
equal-division agreements were reached rather 
than equitable ones, even when the boys were 
not friends, and when the person who had the 
higher grades had the higher reward potential. 
Leventhal and Michaels varied the duration 
and quantity of subjects’ performance in- 
dependently, such that some subjects found 
they worked either 35% or 65% of the time 
and contributed either 35% or 65% of the 
quantity compared to partner, While subjects 
took more of the total reward for themselves 
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when they worked 35% of the time and con- 
tributed 65% of the quantity (relative to the 
partner who worked 65% of the time and con- 
tributed 35% of the quantity), the amount 
taken was far less than equity theory would 
predict. If the subject and partner had worked 
an equal amount of time and the subject con- 
tributed 65% and the partner 35% to the task, 
then the subject should take 65% of the reward 
to achieve equity. The fact that the subject 
worked a much shorter time and still contri- 
buted 65% implies that he should have kept 
an even higher proportion for himself. Leven- 
thal and Michaels’ subjects, however, kept 
only 60.62% of the reward for themselves in 
this condition. Similarly, when the subjects 
worked for 65% of the time but contributed 
only 35% to the task, they took far more 
(39.06%) than would be predicted by equity 
theory. 

The major difference between the present 
study and the studies by Morgan and Sawyer 
(1967) and Leventhal and Michaels (1969) 

-on the one hand, and the majority of equity 
studies, appears to be the nature of the ex- 
change relationship. In the 3 aforementioned 
studies, the exchange relationship between P 
and O was direct, while in each of the 13 studies 
reviewed by Pritchard (1969), the exchange 
between P and O was mediated by an employer. 
None of the studies reviewed by Pritchard 
suggests either a tendency to disregard input 
‘differences between P and O or a preference 
for equality over equity. The notion that 
equality might be preferred to equity is, in 
fact, suggested by Adams (1965) in discussing 
exchanges other than with an employer, He 
stated: “in a vast array of social relations re- 
ciprocity is a functional element of the relation. 
What is in fact referred to by reciprocity is 
equality of exchange [p. 278, italics added ].” 
However, Adams does not discuss why equality 
should be the rule in direct exchanges. 

One reason for this preference for equality 
in direct exchanges may be a desire to maintain 
a relatively equal-status relationship and as- 
sure continuation of a profitable exchange. 
Thibaut and Kelley (1959) have shown that 
when outcomes of group members are public, 
status comparisons can easily be made. One 
consequence of status distinctions is to hinder 
communication between persons of different 
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statuses and to lead to cliques and subgroup- 
ings within status levels (Hurwitz, Zander, & 
Hymovitch, 1960; Kelley, 1951; Strodtheck, 
Simon, & Hawkins, 1965). Thus, status dis- 
tinctions can potentially disrupt an exchange. 
If two persons are in a direct exchange, and 
they find their outcomes better than those in 


alternative relationships (what Thibaut in 
Kelley, 1959, call comparison level for alterna- 
tives), their main concern is to keep the re- 
lationship going. To focus on differences in 
inputs and to demand equity would make the 
status discrepancy more salient and might 


threaten the existence of the relationship 

Pritchard has also made a distinction as 
regards the amount of psychological distance 
in the exchange relationship. Two persons 
working for each other (direct exchange) are 
closer, psychologically, than in the typical 
equity study in which a person compares with 
a hypothetical other and where the outcomes 
are controlled by a third person, the employer. 
Pritchard (1969) predicts that differences in 
inputs and outcomes are easier to perceive | 
and more salient the more intimate the 
relationship. 


Assuming this greater sensitivity to differences in 
ratios as the relationship between the two people is 
closer to the intimate end, and assuming that there is 
less direct correspondence between the two people’s 
ratios as the relationship moves toward the impersonal 
end, it would seem less likely that person will experi- 
ence inequity relative to the other person in the relation- ` 
ship the closer the relationship is to the impersonal 
end of the continuum [p. 206]. 


The arguments of Pritchard and those ad- 
vanced in the present study are obviously at 
odds. Pritchard argues that the ease of com- 
parison in a direct exchange leads to heightened 
sensitivity to differences and greater feelings of 
inequity, while the present author proposes 
that desire to maintain a profitable relation- 
ship in a direct exchange will lead to discount- 
ing of differences to preserve equal status. The 
data from the present study cannot be taken 
as support for the point of view presented here, 
since the data preceded the theory. What is 
needed are studies in which the “psychological 
closeness” or the “directness” of the exchange 
is systematically varied, together with differ- 
ences in the inputs between P and O. 
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EFFECTS OF LOCUS OF CONTROL AND TYPE OF FEEDBACK ON ‘TH 
TASK PERFORMANCE OF LOWER-CLASS BLACK CHILDREN + 
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Sixty fifth-grade black males from two inner-city elementary schools participated 
in an experiment designed to examine the effects of variations in locus of control— 
internal versus external—on the efficiency of intrinsic and extrinsic types of feed- 
back. The major point of divergence between feedback conditions was whether 
subjects’ knowledge of correctness was acquired by self-discovery of successful task 
performance (intrinsic reinforcement) or whether the subjects had to depend on the 
experimenter’s personal praise in lieu of being able to directly observe the success of 
their task performances (extrinsic reinforcement). A combined condition con- 
sisting of both intrinsic and extrinsic feedback was also used. Results indicated 
that, as hypothesized, internally oriented subjects made significantly more “cor- 
rect” responses in the condition of intrinsic reinforcement than externally oriented 
subjects, while the reverse was true in the condition of extrinsic reinforcement; 
that is, the performances of externals were superior to those of internals. It was also 

~ found that locus of control did not differentiate performance effects under the com- 
. bined condition. The implications of these findings for elucidating possible dif- 
W e ferential motivational bases for the performance of internals and externals is 


discussed. 


The major purpose of the present study was 
_ to investigate if the introduction of information 
as to whether children perceive themselves as 
internally or externally controlled for success 
and failure in the academic domain will affect 
the efficacy of intrinsic and extrinsic types of 
reinforcers. There is considerable evidence, 
both direct and indirect, that appears to sup- 
port the proposition that extrinsic rewards 
(e.g., tangible rewards, global social approval) 
produce more effective performance than in- 
trinsic rewards (e.g., abstract, correctness- 
oriented feedback) with lower-class children, 
while the reverse appears to be true with mid- 
dle-class children (see Havighurst, 1970; Zigler 
& Child, 1969; for recent reviews of relevant 
studies). 

One possible interpretation of these findings 
is that there are class and possibly race differ- 
ences in the incentive value of intrinsic and ex- 
trinsic’reinforcers (Zigler & Child, 1969). From 
a developmental perspective, Zigler and Child 
(1969) suggested that middle-class children 


1 The writing of this report was supported by Na- 
tional Science Foundation Grant GS-2372 to the first 
author. The report, is based in part on a master’s thesis 
by R. L. Ganz, who carried out the experiment de- 
scribed in this study. The writers are indebted to G. 
Cooke for his comments on the conduct of the research. 

? Requests for reprints should be sent to Reuben M. 
Baron, Department of Psychology, Wayne State Uni- 
versity, Detroit, Michigan 48202. 


attainat an earlier age a level of developr> ent 
where a “motive of effectiveness” beccmes 
central and that there is a shift away from a 
focus on reinforcement by others toward rein- 
forcement by self. 

Given the far-ranging implication of Zigler’s 
developmental motivational formulation (Z 
ler & Child, 1969; Zigler & Kanzer, 1962), it 
is particularly unfortunate that the findings of 
his seminal study—Zigler and Kanzer (1962)— 
have not been replicated in three recent stud- 
ies. Rosenhan and Greenwald (1965) and Mc- 
Grade (1966), using child samples similar in 
age to the Zigler and Kanzer study, failed to 
replicate their major findings; no class differ- 
ences were found in the relative efficacy of two | 
types of verbal reinforcers—personal praise 
and knowledge of correctness. The Rosenhan 
and Greenwald study is of particular interest 
since it added a lower-class black sample and 
still failed to find inferior performance with a 
“correctness” type verbal reinforcer. Further- 
more, Baron, Bass, and Vietze (in press), us- 
ing a sample of lower-class black females aged ` 
16-21, also failed to find significant differences 
in the efficacy of correctness-oriented and per- 
sonally oriented verbal praise. 

Since these investigations cast doubt on both 
the strength and generality of Zigler’s formu- 
lation, it is of considerable interest that there 
is one recent study that provides at least par- 
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tia! sesport for Zigler’s position, Young and 
Feller (1966) reported an investigation, using 
-class black children of kindergarten age, 
replicates Zigler and K anzer’s (1962) find- 
that lower-class children perform more 
ef‘ectively under conditions where personal 
aise rather than correctness is stressed. 

Instead of using two kinds of verbal rein- 
forcers, one focused on personal praise and the 
other on correctness, the Young and Beller 
(1966) study contrasts an extrinsic reward con- 
dition, where the child’s knowledge of success 
is mediated through the experimenter’s verbal 
praise, with an intrinsic reward condition, 
where knowledge of success is available through 
self-observation (i.e., the child is allowed to dis- 
cover for himself whether he has correctly 
solved a form-discrimination problem). In ad- 
dition to the basic finding that for their lower- 
class black sample extrinsic praise is more effec- 
tive than self-discovery, Young and Beller also 
found that for the condition of intrinsic reward, 
children with prior nursery school experience 
performed better than those without such ex- 
perience. Variations in prior nursery school 
experience made no difference for the condition 
of extrinsic reward (these effects emerged on 
the terminal trial blocks). 

The Young and Beller (1966) study raises 
the possibility that previous failures to con- 
firm Zigler’s hypothesis may be due to the fact 
that there is not sufficient psychological dis- 
tance between being told you are correct and 
being told you are good. For example, it is diffi- 
cult to know whether subjects who are told that 
they are correct will be primarily motivated 
to be correct for “the sheer sake of correct- 
ness” or whether they are also motivated by a 
desire to influence the experimenter’s evalua- 
tion of them (Baron, 1970; Jones, 1964). 

The present investigation seeks to build upon 
the Young and Beller (1966) study by refining 
and extending their procedures for differentiat- 
ing between opportunities for intrinsic and ex- 
trinsic reward, while at the same time adding an 
individual difference variable—locus of con- 
trol—which has considerable conceptual and 
empirical power.’ The locus of control construct 


3 It may be noted that the extrinsic reward condition 
in the Young and Beller (1966) study did not clearly 
eliminate the possibility of the subject receiving addi- 
tional intrinsic feedback, since subjects were able to see 
the “correct” box in addition to being praised by the 


125 


as used by Rotter (1966) refers to a generalized 
expectancy relating to whether or not one feels 
he has power or control over what happens to 
him. The particular measure of this construct 
deemed most suitable for the present purposes, 
Virginia Crandall’s Intellectual Achievement 
Responsibility (IAR) scale, focuses on whether 
children perceive that they are responsible for 
their intellectual academic successes and fail- 
ures or that external factors (e.g., luck or 
other people) are responsible for them (Cran- 
dall, Katkovsky, & Crandall, 1965). 


Type of Feedback and Locus of Control 


While subjects’ generalized expectancies of 
control are not likely to differ within the two 
feedback conditions—internals will remain in- 
ternals, externals will remain externals—it is 
suggested that certain motivational correlates 
of being internal or external will be differen- 
tially affected by intrinsic and extrinsic feed- 
back. Evidence is rapidly accumulating that 
internals have a higher level of achievement 
motivation than externals (Weiner & Kukla, 
1970) and are better able to regulate their be- 
havior by self-reinforcement (Heilbrun & 
Norbert, 1970; Weiner, Frieze, Kukla, Reed, 
Rest & Rosenbaum, 1971). On the other hand, 
externals appear to be more responsive to the 
demands of other people than internals. Thus, 
externals are highly responsive to external 
social reinforcement, both vicarious and direct 
(Heilbrun, 1970) and are more influenced by 
conformity pressures in social influence situa- 
tions than are internals (Crowne & Liverant, 
1963; Odell, 1959). 

A close analysis of the situational contexts 
used to demonstrate these effects suggests that 
these differences may not be transsituational, but 
rather may reflect Trait X Situation interactions. 
Research reviewed in Weiner et al. (1971) 
makes it clear that the motivational advan- 
tages that internals possess over externals are 
most clearly manifested in skill as opposed to 
change situations. On the other hand, the situ- 
ations that find externals more responsive than 
internals involve ambiguous task structures 


experimenter. Thus, the possibility is raised that the 
subjects in this condition were exposed to both intrinsic _ 
and extrinsic sources of reward. In the present study, 
this possible source of confounding is eliminated by 
placing a screen over the boxes immediately after the 
subject makes his judgment (see Method section). 
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(chance rather than skill settings?) and maxi- 
mize the importance of pleasing others. Self- 
evaluation of adequacy of performance is usu- 
ally made difficult or impossible by such ex- 
perimental arrangements. 

In the present context, the self-discovery 
treatment should provide setting conditions 
where success will cue off a constellation of 
pride in accomplishment and self-reinforce- 
ment, factors that should strongly motivate 
internals but not externals. That is, the intrin- 
sic reward condition is likely to support 
achievement-, as opposed to social-approval- 
type motives. The present extrinsic reward 
condition, because it involves physically 
screening the subjects from the problem boxes 
immediately after they make a judgment, 
creates a situation where subjects are unable to 
directly verify the accuracy of their responses. 
This restriction may change the meaning of 
the task from a skill to a more chancelike situ- 
ation. A white experimenter’s unverifiable 
verbal praise may be seen not as a veridical 

ssment of performance, but rather as an 
attempt to indiscriminately encourage sub- 
jects, irrespective of the quality of their per- 
formance. In the extreme case, attitudes of 
condescension or manipulative intent may be 
inferred and reactance motivation aroused 
(Brehm, 1966). 

Externals, being low in achievement motiva- 
tion, are not likely to be adversely affected by 
being put in a more chancelike task situation. 
What is important for externals is that with the 
introduction of verbal praise, they have avail- 
able an external source of motivation, Ex- 
ternals are now likely to expend more effort, 
not because they are concerned with pride of 
accomplishment, but because it will please a 
powerful other.* Thus, unverifiable verbal 
praise is likely to weaken the motivational basis 
of an internal’s performance while strengthen- 
ing the motivational basis of performance by 
externals. 

If the above pattern of motivational as- 
sumptions is correct, accuracy of problem soly- 
ing should reflect an interaction between locus 
of control and type of feedback. Specifically, 
it was hypothesized that (a) when success feed- 
back is conveyed by self-discovery, internals 

* Differences in accuracy of judgment on the present 
task are assumed to be largely a product of factors that 


influence performance as opposed to learning processes 
(see Discussion section). 
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will perform better than externals; (b) wi en 
success is conveyed solely through an experi. 
menter’s verbal praise, externals will perform 
better than internals. A combined conditio 
involving both types of feedback is also usec 
on an exploratory basis, in order to help inte : 
pret the other conditions. No clear prediction 
is offered as to the differential behavior of inter- 
nals and externals in this condition. 


METHOD 
Subjects 


The subjects were 60 black males in the fifth grade of 
two inner-city elementary schools in Detroit, In addi- 
tion to being homogeneous as to race and sex, all of the 
subjects in the final sample came from families that 


Procedure 


In order to determine if the subjects were internall 
or externally controlled, the TAR questionnaire was ad- 
ministered individually to each child. The question- 
naire contains 34 items assessing children’s beliefs that - 
they, rather than other people, are responsible for th 
intellectual and academic successes and failures, 
experimenter read each question to each subject, and 
the subject then’checked the choice that seemed best to 
him. 

Seventeen items referred to success situations, and 
17 items referred to failure situations. Each item con- 
sisted of two alternatives, one referring to internal con- 
trol and the other referring to external control. The 
child received 1 point for each answer in the interna 


calculated, one for success and one for failure. The ac- = 
tual range in the present experiment for the success” 
items was from 3 to 16. For failure items, it was from 


cutting points for both success and failure were 9). It 
was decided, for the purposes of the present experiment, 
to use only subjects who were either internally or ex- 
ternally controlled for both success and failure; subjects 
who were classified as internal for success and external ~ 
for failure, or vice versa, were eliminated from the 
study. g 
In order to further explicate the selection procedure, 
the following should be noted. On the average, in each 
class, 50% of the subjects could be classified as external 
for success and failure, 25% as consistent internals, and 
25% were mixed types. (The mixed types were pri- 
marily internal for failure and external for success.) In 
the present case, this meant that many more subjects 
had to be tested in order to achieve the same number of 
consistent internals as externals. (We aimed for a final 
sample of 30 in each group.) J 
Although this finding is consistent with previous rê- 
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which indicates that a lower-class black sample 
y to be more externally than internally oriented 
urt, 1966), it does raise the possibility that be- 
he internals are more highly selected, they differ 
the externals on dimensions other than locus of 
control, While it is not possible to demonstrate that 
s is not the case for some individual difference factor, 
it was deemed important to demonstrate that IQ differ- 

nces were not confounded with the difference in locus 
of control. To this end, average IQ scores based on the 
California Test of Mental Maturity were compared for 
the consistent internals and externals. No significant 
differences were found (¢ < 1). 


Experimental Task 


The experimental task required the subjects to learn 
a simple form discrimination in order to find a hidden 
object. Three black boxes, differing only in that either 
had a red square, red triangle, or red circle on their top 
surfaces, were presented to each subject. For each sub- 
ject, the object was always hidden under the same box; 
however, it was different for different subjects. For 
some it was the square, for some the triangle, and for 
some the circle, For example, one correct pattern of re- 
sponses would be the hiding of the object under the red 
circle. Prior to each trial, the three boxes were rear- 
ranged in a standardized randomly predetermined 
order, hidden from the child’s view by a screen. For 
each experimental trial the screen was removed, and 
the subjects were instructed to search for the hidden 
object. All subjects received 30 experimental trials. 
Three conditions of reinforcement were employed. 


Conditions of Reinforcement 


The major point of divergence between reinforcement 
conditions was whether the subject acquired knowledge 
of his performance by his sighting or not sighting the 
hidden object when the box was lifted (intrinsic rein- 
forcement) or whether he had to depend on the experi- 
menter’s reactions in lieu of observing the boxes being 
lifted (extrinsic reinforcement). 

Extrinsic reinforcement. In the condition of extrinsic 
reinforcement, the subjects received no information 
from their own observations. The experimenter re- 
placed the screen immediately after the child pointed 
to the box that he thought the object was hidden under, 
The experimenter then either said “very good” if he was 
correct, or “sorry, try again,” if he was incorrect. The 
subjects in this condition never saw the boxes lifted nor 
were they able to observe the box they chose at the 
time that the verbal feedback was being given by the 
experimenter, GEREN ; 

Intrinsic reinforcement. After each trial in this condi- 
tion, the experimenter lifted the box chosen by the sub- 
ject as correct. He said and did nothing in response to 
the subject’s performance, being in effect a null in- 
strument. The subject’s only feedback was his observa- 
tion, based on the presence or absence of the hidden ob- 
ject, that he was correct or incorrect, In this condition, 
the source of reinforcement was assumed to be the sub- 
ject’s intrinsic satisfaction at discovering that he had 
made the correct response when he sighted the hidden 
object. à 

Combined condition. In this condition, the subject re- 


ceived both intrinsic and extrinsic feedback. After the 
subject pointed to the box under which he thought the 
object was hidden, the experimenter raised the box to 
let the subject see the results and then said either “very 
good” or “sorry, try again.” The screen was then re- 
placed for each additional trial, 

The experimenter for all conditions was a white male 
graduate student. It should also be noted that following 
the 30 experimental trials, all subjects received 10 
additional trials during which the object was hidden ran- 
domly under each of the boxes. This procedure was 
adopted to confuse the subjects who would be tempted 
to communicate a solution pattern to subjects yet to 
be tested. 


Resutts? 
Number of Successes 


To test the first hypothesis, stating that 
“locus of control and type of reinforcement 
interact,” locus of control scores were first de- 
termined for each subject according to his 
performance on Crandall’s IAR scale. Then, 
according to which reinforcement condition he 
was in, he received a score for the number of 
times he correctly guessed under which box 
the object was hidden. A two-way analysis of 
variance was used with 10 subjects per cell, 
Variable A, locus of control, has two levels— 
internal and external control; Variable B, the 
type of reinforcement, has three levels—intrin- 
sic reinforcement, extrinsic reinforcement, and 
a combination of intrinsic and extrinsic rein- 
forcement. 

The effects of the experimental conditions 
on the mean number of correct responses were 
evaluated by a 2 X 3 factorial design, the re- 
sults of which are summarized in Table 1. 

The analysis of variance failed to yield 
significant main effects for either locus of con- 
trol or type of reward. There was, however, 
strong support for our major experimental hy- 
pothesis: A significant interaction was found 
between type of reinforcement and locus of 
control (see Table 1). 

Because we predicted the specific pattern- 
ing of means that would characterize the inter- 
action, the mean differences were evaluated by 
orthogonal comparisons. The means for the 
experimental conditions are presented in 
Table 2.° 


5 The findings for the two inner-city schools are com- 
bined in the present analyses because a preliminary 
analysis revealed no significant difference between the 
schools for any of the conditions of our factorial design. 

ê The significance of the orthogonal comparisons, 
based on df = 1/54, is evaluated by a one-tailed test 
of significance. A one-tailed test was used because the 
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TABLE 1 


ANALYSIS OF VARIANCE FOR NUMBER OF 
Correct CHOICES ON THE TASK 


Source df MS F 
Locus of control (A) 1 17.07 37 
Type of reinforcement (B) 2 95.57 | 2.19 
AXB 2 41.00 AKA h 
Within error 54 45.38 


*p <01. 


The application of a planned-comparisons 
test to this data indicated that, as hypothe- 
sized, internally controlled subjects given 
intrinsic reinforcement performed significantly 
better than externally controlled subjects given 
intrinsic reinforcement (F = 3.97, p < .03). 
The results also indicated that, as hypothe- 
sized, externally controlled subjects given ex- 
trinsic reinforcement performed significantly 
better than internally controlled subjects given 
extrinsic reinforcement (F = 6.88, p < .005). 

An additional orthogonal comparison found 
that the combined condition failed to differen- 


_ tiate significantly between internals and ex- 


ternals. It may be noted, however, that while 


-internally controlled subjects failed to do as 


well in the combined conditions as they did 


į with intrinsic reward alone, externally oriented 


subjects performed better under the combined 
condition than when just given extrinsic 
reinforcements, 

In conclusion, locus¥of control significantly 
differentiates between intrinsic and extrinsic 
type of reward when these conditions are 
presented alone, but not when they are pre- 
sented together. 


Discusston 


The major findings in the present study were 
as predicted: under a condition of extrinsic 
reward, externals performed better than in- 
ternals, while the reverse was true under a 
condition of intrinsic reward—internals per- 
formed better than externals. These results 
support our contention that self-discovery and 
unverifiable verbal praise provide differential 
motivational bases for the behavior of internals 
and externals. Consistent with this analysis, 
Heilbrun (Heilbrun, 1970; Heilbrun & Nor- 
bert, 1970) has recently found that antecedent 
conditions that optimize achievement moti- 


present model permitted directional hypotheses on an 
a priori basis. 
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vation and effective self-reinforcement 


10n- 
controlling, nurturant mothers—differ dra- 
matically from the antecedent conditio-5 as- 
sociated with high responsiveness to ext-rnal 
reinforcement—highly controlling, nonny tur- 


ant mothers. 


Ancillary Comparisons 


Further insight into the motivational 
of our data is provided by comparisons in- 
volving the combined condition. First, it is of 
interest that the extrinsic-alone conditi 
elicits a significantly lower level of performance 
for internals than either the intrinsic or com- 
bined conditions. (A simple-effects test within ` 
internal locus of control differentiates the ex- 
trinsic condition, p < .01, two-tailed.) Wile 
further research is needed to specify the ove- 
cise basis of this effect, these data clearly sug- 
gest that it is not the presence of interpersonal 
feedback per se, but the internal’s inability to 
directly verify the accuracy of feedback by ‘sis 
own observations that depresses performance. 
In contrast, externals perform at a low level 
when the only source of feedback that they 
have is their own observations. This pattern of 
effects suggests that externals require external 
support in order to perform effectively. Because 
the experimenter is physically present, albeit 
nonresponsive, in the intrinsic reward condi- 
tion (as he was in Young & Beller’s, 1966, treat- 
ment), it is possible that externals, being 
sensitive to interpersonal cues, interpret his 
nonresponsiveness as indicating disapproval. 
Katz, Henchy, and Allen (1968) have demon- 
strated that disapproval from a white examiner 
has a debilitating effect on the performance of 
interpersonally oriented black subjects. While 
the assumption of this kind of negative motiva- 


TABLE 2 


MEAN NUMBER or Correct RESPONSES ACCORDING 
TO Locus or CONTROL AND TYPE 
OF REINFORCEMENT 


Type of reinforcement 


Locus of Ty, 
control Š Intrinsic 
Intrinsic Extrinsic and 
extrinsic 
Internal 21.9 (a) 12.2 (b) 19.9 (c) 
External 15.9 (d) 20.1 (e) 21.2 (£) 


Note.—Cell x = 10, The results of the planned comparisons 
are presented in the text; the comparisons made were a versus 
d,b Metts e, and c versus f (df = 1/54 for each of these com- 

sons). 
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tional force is not necessary to derive the 
present results, it would be desirable to con- 
trol for such effects by automating our task in 
the intrinsic condition so as to remove the need 
for the experimenter’s presence. 


Task Parameters 


Given what appeared to be a simple form- 
discrimination task, it is somewhat surprising 
that subjects in all conditions performed at 
well below the ceiling level. One possible inter- 
pretation is that our subjects perceived the 
task as being more complex than it actually 
was. That is, the subjects may have assumed 
that there was more than one correct solution 
pattern and therefore switched solution strate- 
gies even after initial success. Empirically, it 
may be noted that all groups initially did con- 
siderable switching before sticking to the 
correct solution. Indeed, one way to sum- 
marize the present findings, which are based 
on means collapsed over trials, is to say that 
these data primarily reflect differences in the 
point at which different treatment groups 
stabilize their performance around the correct 
solution pattern; for instance, internals stay 
with the correct solution pattern sooner than 
externals do under the condition of intrinsic 
reward, Such findings may be indicative of 
differences in the conditions under which 
different groups establish trust in the stability 
of the solution structure and the intentions 
of the experimenter. 

It is also possible, given our restricted range 
of IQs, that the overall poor performance on 
the present abstract task is due to a cognitive 
deficit phenomenon (cf. Dreger & Miller, 
1968). The work of Katz (cf. Katz, 1970) sug- 
gests a motivational interpretation; a high 
level of evaluative anxiety and/or increased 
expectancies for failure serve to depress per- 
formance when a white examiner is present. 

Given the above range of possible explana- 
tions, it would be highly desirable to in- 
troduce variations in such task dimensions 
as abstractness, difficulty, and academic rele- 
vance, Possible interference effects from evalu- 
ative anxiety should also be controlled or made 
the object of specific study in future investiga- 
tions. For example, black a& well as white 
examiners should be crossed with black and 
white subjects. 
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Comparison with Previous Findings 


The present findings for the intrinsic reward 
treatment fit nicely with Young and Beller’s 
(1966) finding that children with prior nursery 
school experience perform better than those 
without such experiences. It may be argued 
that children with such experience are more 
likely to have an internal than external con- 
trol orientation. Specifically, there is probably 
a good deal of self-selection, such that lower- 
class black children who have attended nursery 
school come from parents who have attitudes 
that are more favorable to the development 
of an internal locus of control; for instance, 
they are more likely to use and value control 
strategies that foster the development of inde- 
pendent problem solving than the average 
lower-class mother (Hess & Shipman, 1965). 
Furthermore, the nursery school experience 
itself may be assumed to increase the child’s 
opportunities for autonomous play and prob- 
lem-solving activities. 

Young and Beller’s (1966) finding that 
variations in prior nursery school experience 
did not affect the efficacy of their extrinsic 
treatment when compared to our strong dif- 
ferential effects of locus of control in this 
condition can be viewed in two ways: (a) Their 
extrinsic treatment, because it did not prevent 
self-observation of the “correct” box, may 
not have weakened the internal’s achievement 
motivation as our treatment apparently did; 
(b) they used black children of kindergarten 
age, while the present black sample ranged in 
age from 10 to 11 years.” It may be speculated 
that younger internals are less likely to be 
disturbed by unverifiable verbal praise; for 
instance, the distinction between chance and 
skill tasks may not be salient. 


Conclusions and Implications 


The present results suggest that by early 
adolescence, the efficacy of intrinsic and ex- 
trinsic rewards is moderated by variations in 
locus of control. Preliminary data obtained 
as part of a replication and extension of the 
current paradigm add further strength to this 
generalization: The present Locus of Control 


1 The decision to use Crandall’s TAR scale, which re- 
quires a higher level of verbal comprehension than is 
likely to be found with children of kindergarten age, 
necessitated the older sample. 
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X Type of Reinforcer interaction was replicated 
for two new samples, one black and one white. 
At a methodological level, the present study, 
along with the investigation of Young and 
Beller (1966), suggests the importance of 
moving away from using two sets of verbal 
reinforcers when comparing the efficacy of in- 
trinsic and extrinsic sources of reinforcement. 

Finally, it should be noted that Young and 
Beller’s (1966) study, the present findings, 
and a recent investigation by Gruen, Ottinger, 
and Zigler (1970) all point to the value of 
focusing on genotypical psychological variables 
such as level of aspiration (Gruen et al., 1970) 
or locus of control, which cut across pheno- 
typical categories such as race and/or social 
class. Such cross-cutting variables both pro- 
vide clearer links to theory (cf. the work of 
Weiner et al., 1971) and suggest intervention 
programs more readily than when explanations 
are built around the apparently intractable 
causal categories of race or class.* 


8 Although Weiner's recent work became available to 
us after the present research had been completed, our 
findings may be seen as providing behavioral support for 
some of the linkages between the antecedents and con- 
sequences of locus of control implied by Weiner's at- 
tempts to relate attribution theory, achievement moti- 
vation, locus of control, and self-reinforcement (cf. 
= Weiner & Kukla, 1970; Weiner et al., 1971). At the 
applied level, the present findings suggest that we ask 
questions such as how we might best modify locus of 
control or how we might vary present learning contexts 
to compensate for variations in locus of control. 
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EFFECTS OF ADULT SOCIAL EXPERIENCE ON PATTERNS OF 
REPRODUCTIVE ACTIVITY OF SOCIALLY DEPRIVED MALE 
RHESUS MONKEYS (MACACA MULATTA) 


ELIZABETH A. MISSAKIAN ? 
Laboratory of Perinatal Physiology, San Juan, Puerto Rico 


The problem under investigation involved the extent to which social group ex- 
perience as an adult could modify and/or reverse atypical behaviors produced by 
rearing under conditions of social deprivation. Three socially deprived adult male 
rhesus monkeys were provided with 3-6 months experience in social groups con- 
taining emotional rather than chronological peers. The group experience modified 
the social behavior of two of the three subjects. The nature of the changes involved 
an increase in social grooming, a decrease in physical aggression, and an increase in 
mounting activity. The increase in these behaviors was accompanied by decreases 
both in stereotyped locomotion and self-aggression. The failure of this social ex- 
perience to produce change in one of the subjects provided valuable information 
about possible factors involved in the behavioral change. Perhaps the most import- 
ant difference was the failure of this male to engage in social play while in the group. 
Data from this study indicate a direction for future research regarding the question 
of reversibility of the effects of social deprivation, The type and duration of social 
experience afforded these monkeys represents the only instance of behavior modifica- 


tion in socially deprived rhesus monkeys. 


Several investigators have demonstrated 
that male rhesus monkeys reared under con- 
ditions of social deprivation exhibit aberrances 
in sexual behavior as adults (Harlow & Harlow, 
1965; Mason, 1960; Missakian, 1969). Ex- 
amples of atypical behaviors include the total 
absence of mounting activity and the execu- 
tion of inappropriately oriented mounts. In 
the studies cited above, no socially deprived 
male rhesus monkeys have been observed to 
execute species-typical sequences of sexual 
activity. 

On the basis of current evidence, it seems 
clear that social deprivation alters the de- 
velopment of appropriate sexual behavior in 
some yet unknown way. In the attempt to 
specify critical variables involved in the de- 
velopment of such behavior, two approaches 
have emerged: first, a systematic investigation 
of developmental variables in relation to social 
behavior. Studies of this nature have typically 
involved parameters related to length of social 
isolation, age at onset of social isolation, and 
peer-group rearing. Harlow and Harlow pro- 
vided a comprehensive summary of the be- 
havioral results of these studies. 


1 Requests for reprints should be sent to Elizabeth Ant 
Missakian, who is now at Rockefeller University, New 
York, New York 10021. 


The second approach to the study of varia- 
bles influencing the development of species- 
typical sexual behavior has involved the at- 
tempt to reverse the effects of such deprivation 
through various types of social exposure. 
Harlow and Harlow (1965) reported little 
evidence of behavior change under individual 
cage conditions, despite repeated attempts to 
expose both male and female socially deprived 
monkeys to wild-reared animals of the op- 
posite sex. Further efforts to reverse the 
effects of rearing under conditions of social 
deprivation involved placing rhesus monkeys 
onto a zoo island in Madison, Wisconsin. 
Harlow (1965) reported that while in this 
social situation, some of the monkeys exhibited 
modifications of atypical or asocial behaviors. 
However, these modifications were slight and 
reverted to former patterns of aberrant be- 
havior upon reintroduction into the individual 
cage condition. 

Missakian (1969) examined the influence of 
adult social experience and found that socially — 
deprived males with a history of periodic ex- 
posure to sexually receptive females did not 
show any improvement in terms of social re- 
sponses. At no time were any of these males — 
observed to execute an appropriately oriented ii 
mount. ; 
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TABLE 1 


Number, YEAR Or BIRTH, AND NuMBER OF MATINGS 
WITH SEXUALLY RECEPTIVE FEMALES DURING 
1966 AND 1967 PER SUBJECT 


ELIZABETH A. MISSAKIAN 


Matings 
Subjects’ Year of 
number birth 
1966 1967 
49 1959 26 15 
218 1961 0 15 
219 1961 0 15 


The failure to reverse the effects of social 
deprivation in the case of rhesus monkeys is 
in sharp contrast to evidence available for 
socially deprived chimpanzees. Rogers and 
Davenport (1969) demonstrated that recovery 
from behavioral injury occurs in socially de- 
ved chimpanzees who have been accorded 
riety of social experience. In the case of 
caques, previous attempts to reverse the 
effects of social deprivation do not provide 
conclusive evidence regarding the possibility 
of behavior modification. This is true for at 
least two reasons. First, previous research has 
involved exposure of such monkeys to animals 
reared under conditions of social deprivation 
rather than to wild-reared animals. Second, a 
relatively short-term approach has been em- 
ployed. Typically, animals have been exposed 
to monkeys reared under laboratory conditions 
and/or exposed to wild-reared monkeys but 
on a relatively short-term basis. It seems clear 
that any attempt to reverse atypical patterns 
of behavior must include long-term exposure 
to animals whose behavior patterns are typical 
of the species under study. 

The purpose of this study was to introduce 
socially deprived male rhesus monkeys into 
groups consisting of wild-reared monkeys in 
order to determine the extent to which such 
exposure could modify and/or reverse atypical 
patterns of social behavior. 


METHOD 
Subjects f 


Subjects in this study were three male rhesus mon- 
keys (Macaca mulatta) reared under conditions of 
social deprivation at the Laboratory of Perinatal 
Physiology, San Juan, Puerto Rico. Each male was 
separated from his mother within the initial 12 hours 
after birth and placed in an individual cage in the 
nursery. Details of nursery care at the laboratory are 


available elsewhere (Fleischman, 1963). At the ag 
100 days, each male was transferred to the juver 
colony and maintained again in an individual 
Upon attaining the age of reproductive maturity, 
male was transferred to an individual cage in 
breeding colony and was periodically exposed to s 
ually receptive females. Until their transfer to th 
breeding colony, the males had had both visual and 
auditory, but no physical or manipulative contact 
with other monkeys. The males had been used T 
subjects in a study cited previously concerning t 

effects of social deprivation on patterns of adult 
productive activity (Missakian, 1969). 


Procedure 


Presocial group experience. Table 1 contains inform - 
tion relevant to the number, age, and number 
matings with sexually receptive females during 1966 
and 1967 for each of the three subjects. Each mai 
represents 24-48 hours of exposure to wild-reared s 
ually receptive females chosen at random from amo 
the females in the breeding program at the laborato 
The basic procedure outlined by Ponce de Lugo (1964) 
for determination of estrus in the rhesus female w 
followed throughout this study. The procedure invol 
daily examination of females in the colon y to determin 
onset and cessation of menstrual flow. Females were 
then exposed to males 12-15 days after onset of menses. 


social groups were 
formed in separate outdoor “gang cages” measuring 
20 feet X 8 feet X 6 feet. Each group consisted of © 
three wild-reared females ranging in estimated age from 
3 to 4 years, two wild-reared males estimated at 3 
years of age, and one socially deprived male, Prior 
experience indicated that adult wild-reared males were _ 
aggressive toward the socially deprived males; there- 
fore, in order to prevent serious wounding and enhance 
survival of the socially deprived males, young wild- 
reared males were selected in the formation of the 
groups. A second, but at the time less important, 
reason for selecting young animals for inclusion into 
the group was the hope that young animals would ` 
facilitate social play in the groups and provide situations 
whereby close spatial contact between the socially | 
deprived males and other animals in the group would 
result. Thus, emotional peer groups rather than chrono- | 
logical peer groups were established, i 


y 


pro 


EFFECTS OF ADULT 


nimals were released into the gang cages simul- 
ly. None of the monkeys had had prior experi- 
ith one another, Male 218 was maintained in 
e group from October 1967 through April 1968, a 
toisl of 24 weeks, Male 49 was maintained in the 
group from January 1968 through April 1968, a total 
of 16 weeks. Male 219 was in the group from December 
1967 through May 1968, a total of 24 weeks. The social 
<roups were observed for 1 hour daily per group during 
the first month of their formation and 1 hour per group 
per week for the remainder of the time the males were 
in the group. 


Behavior Categories 


The following behavior categories were employed to 
record the behavior of the male-female pairs during 
the initial 2 hours of the presocial group experience 
and also during the period of time each male was in 
the social group: grooming, contact aggression, gestural 
threat, self-aggression, stereotyped locomotion, mastur- 
bation, mount, and mount attempt. Both frequency 
and duration measures for the above behavior cate- 
gories were recorded. In order to assess the reliability 
of these measures, a double-observer procedure was 
employed, and product-moment correlations of data 
from each behavior category were computed. The 
range of these correlations was .96-1.00, thus indicating 
that the behavior categories in question were very 
reliable. 


RESULTS 
Presocial Group Experience 


None of the males were observed to execute 
a mount or mount attempt during the pre- 
social group experience. In addition, none of 
the vaginal lavages examined contained sper- 
matazoa. A detailed description of the behavior 
of these males is contained elsewhere (Missa- 
kian, 1969). Following the 15 exposures to 
sexually receptive females, each male was in- 
troduced into a social group. 


Social Group Experience 


Frequency and duration measures for nine 
behavior categories for Male 49 in the social 
group revealed that the only instance of sexual 
activity exhibited by this monkey was mastur- 
bation. Masturbation occurred during the sole 
recorded instance of self-grooming. Generally, 
this monkey did not maintain close spatial 
relations with the other animals in the group. 
No instances of play behavior or wrestling 
with other members of the group were recorded. 

Data relevant to frequency and duration 
measures for nine behavior categories for Male 
218 in the social group revealed several im- 
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portant points. On Day 13 of social group 
experience, Male 218 engaged in the first ob- 
served sequence of play and wrestling behavior 
with another animal in the gang cage. The 
following behavior description is typical of the 
early play behavior of this male: 


December 12, 1967: Male 218 moves toward, then 
sit-touches Female 461, who turns and clasps male 218, 
mouthing his right forearm. Male 218 clasps Female 
461 as she is mouthing him, and he begins to mouth 
her shoulder and middorsal regions. Reciprocal clasping 
and mouthing as Male 218 and Female 461 roll on 
floor of cage. Following approximately 16 seconds of 
above sequence, Male 218 initiates bout of self-aggres- 
sion consisting of biting of right forearm and slapping 
forehead. Duration of self-aggression was approximately 
6 seconds and is followed by rapid galloping across the 
cage. Female 461 follows Male 218, clasps him with her 
arms and wrestling continued. Duration of phase and 
wrestling following self-aggression was 8 minutes 45 
seconds. 


It was not until mid-January that Male 218 
was seen playing with members of the group 
other than Female 461. 

The total number of attempted mounts of 
Male 218 numbered nine. All of these mount 
attempts were inappropriately oriented, the 
male assuming a lateral position with respect 
to the female. All mount attempts occurred 
on days containing high scores of reciprocal 
grooming, self-grooming, and play. 

The behavioral pattern of Male 219 was 
similar to that described above for Male 218. 
On Day 20 of social group experience, Male 
219 engaged in the first observed sequence of 
play and wrestling behavior with another 
animal in the gang cage. The following be- 
havioral description is typical of the play 
behavior of this male during the time he 
remained in the social group: 


April 2, 1968: Male 219 is sitting alone and is ap- 
proached by Male 476 who leans over Male 219’s neck 
region and softly bites Male 219. The bite does not 
appear to puncture the skin. Male 219 immediately 
grabs Male 476, and the two begin wrestling in a 
sequence lasting 29 seconds. The play bout terminates 
as Male 476 moves away from Male 219 and feeds. 


The total number of mount attempts in the 
case of Male 219 was six. All but one of these 
was inappropriately oriented, the male as- 
suming either a lateral or frontal orientation 
with respect to the female. In the case of the’ 
appropriately oriented mount, no pelvic thrusts 
were executed. As in the case of Male 218, all 
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mount attempts occurred on days in which 
there was a relatively high frequency of re- 
ciprocal grooming, self-grooming, and play. 


DISCUSSION 


The three male subjects represented ex- 
amples of severely disturbed animals as wit- 
nessed by rates of stereotyped behavior (self- 
aggression, stereotyped locomotion), which 
were higher than those of other socially de- 
prived males tested at the laboratory. Further- 
more, none of the subjects had ever exhibited 
any evidence, in the form of appropriate or 
inappropriate orientation of mounting activity, 
~- Of species-typical sexual behavior. These mon- 

e DA provided an excellent oppor- 
tunity to study the effects, if any, of social 
group experience, 

The problem under investigation involved 
the extent to which social group experience as 
an adult could modify and/or reverse atypical 
behaviors produced by rearing under condi- 
tions of social deprivation, It seemed clear 
from the evidence reported above that the 
group experience afforded in this study did 
modify the social behavior of two of the three 
male subjects. The failure of this social ex- 
perience to produce change in the case of Male 
49 provided valuable information concerning 
factors possibly involved in the behavioral 
changes observed in Males 218 and 219. The 
males differed in at least four ways: (a) 
Male 49 was 2 years older than Males 218 
and 219; (b) Male 49 had had adult social 
experience prior to the group experience; (c) 
Males 218 and 219 remained in the social 
group a total of 8 weeks longer than Male 49; 
and (d) Male 49 was not observed to engage 
in social play while a member of the group. 

One might speculate that the last factor, 
failure to engage in social play, might be a 
product of the first two factors. That is, that 
age and previous social history somehow pro- 
duce an animal unable to engage in social 
interactions of a play nature. Regardless of 
the reason, which can be determined only 
through further investigation, it is clear that 
Males 218 and 219 engaged in social play and 
Male 49 did not. Furthermore, social play in 
the case of Males 218 and 219 accompanied 
execution of other social behaviors such as 
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reciprocal grooming and mount attempt, which 
occurred in higher frequency during observa. 
tion periods during which there was a }ieh de- 
gree of social play. The increase in these be- 
haviors was accompanied by a decrease in both 


stereotyped locomotion and self-aggresson, 

Data from this study do not provide an 
answer to the question of the specific manner 
in which play facilitates development of social 
behaviors or produces reversals in aberrant 
behaviors of socially deprived males. In: tead, 
the data indicate a direction for future re 
search, one which may provide interesting and 
fruitful results relating to the question of 
reversibility of the effects of social deprivation, 
The therapeutic effect of emotional as opposed) 
to chronological peer groups has interesting 
implications for future research, 

Males 49, 218, and 219, with several 
tional socially deprived males at the la? 
tory, were to be subjects in further iny 
tions concerning this question, However, all 
of the males were sacrificed due to an outbreak 
of tuberculosis in the colony. The inform- ion 
available at presentis-for two monkeys. How- 
ever, the type and duration of social experience 
afforded these males represents the only in- 
stance of behavior reversal in socially deprived 
rhesus monkeys. 
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STRUCTURAL BALANCE IN DYADS* 


DENNIS GELLER 


Logic of Computers Group, 
University of Michigan 


HAROLD MILLER? 
University of Waterloo 


Balance theory provides a means of predicting whether or not a given set of 
interpersonal relationships will change, provided the relationships are those of 
like and dislike. The present study attempted to empirically determine which 
interpersonal relationships are perceived to be stable (i.e., balanced) when the 
relationships include neutrality and ambivalence, as well as like and dislike. 

: The balance principle (that unbalanced relationships tend toward balance) was 
also tested. Subjects were presented with a number of hypothetical situations 
describing relationships between two fictitious people and were asked to pre- 
dict the relationship at some future time. In general, the subjects perceived 
the relationships to be stable only if both members of the dyad shared the 
same attitude toward the other, that is, mutual liking, disliking, neutrality, or 
ambivalence. The balance principle was supported. Perceived tension in the 
dyad was also measured to determine if tension is inversely related to the 
degree of balance. This hypothesis was not supported. 


Balance theory, as proposed by Heider 
(1946) and formalized by Cartwright and 
Harary (1956), deals with the interpersonal 
relations of like and dislike. Abelson and 
Rosenberg (1958) proposed their theory of 
symbolic psycho-logic to handle the ambi- 
valance and neutral relations as well. Although 
there has been considerable empirical research 
investigating the predictions made by balance 
theory for three or more entities (see the 
survey of the literature by Zajonc, 1968, p. 
348), the theory has yet to be tested on 
_dyads. Also, while many articles have dis- 
cussed the theory of symbolic psycho-logic 
(Giese, 1967; Lambert, 1966; Peizer, 1969), 
there has been no empirical research on the 
ambivalence and neutral relations. The pri- 
mary purpose of this study was to investigate 
all four relations in the simplest social group, 
the dyad, and to determine which sets of 
relationships are perceived to be stable (i.e., 
balanced). 

In particular, subjects were presented with 
hypothetical situations in which each member 
of the dyad either likes, dislikes, is neutral 
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toward, or is ambivalent toward the other. 
The subject's task was to predict how each 
member will feel toward the other at some 
future time. This procedure also permitted 
testing the central hypothesis of balance 
theory, namely, that unbalanced relationships 
will tend to change to balanced relationships 
(the balance principle). Heider (1946) stated 
that there would be less tension in balanced 
relationships than in unbalanced relationships. 
In order to test this hypothesis, perceived 
tension in the dyad was also measured. This 
procedure is taken from Morrissette’s (1958) 
study of balance in triads and tetrads, With 
four relations, there are 10 different dyads 
that can be studied. 

Let L, D, N, A represent the relations like, 
dislike, neutral, and ambivalence, respectively. 
Thus, by LD we mean the dyad in which 
Person P likes Person Q and Person Q dis- 
likes Person P. Using this notation, the 10 
dyads are: LL, LD, LN, LA, DD, DN, DA, 
NN, NA, and AA, 

Three of the dyads involve only like and 
dislike and thus fall within the scope of 
balance theory. Thus, theoretically, LL and 
DD are balanced, and LD is unbalanced. Con- 
sequently, according to balance theory, the 
subjects should predict that no change will 
occur in the LL or DD dyads, and that the 
LD dyad will tend toward balance, although 
the theory does not predict what the final 
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balanced structure will be. According to 
Cartwright and Harary (1956), all of the 
dyads involving a neutral relation are also 
theoretically balanced, although they use 
the special term “vacuously balanced” in 
this case. 

As a test of the tension hypothesis, subjects 
should perceive greater tension in the LD 
dyad than in either the LL or the DD dyad. 

No predictions can be made concerning the 
dyads in which the ambivalence relation is 
involved since the theory of symbolic psycho- 
logic requires at least three entities and re- 
quires that the relations between each pair 
of entities be identical. 


METHOD 


The subjects were 58 University of Michigan male 
and female undergraduates. Each subject was asked 
to complete the following questionnaire: 


Jim and Bill [or Jill and Betty, for female 
subjects] are two fictional characters. Suppose, to 
simplify matters, that each can haye only one of 
the following attitudes toward the other: one likes 
the other; one dislikes the other; one neither likes 
nor dislikes the other (i.e. one is neutral toward 
the other); one both likes and dislikes the other 
(i.e. one is ambivalent toward the other), You 
will be told how Jim and Bill feel toward each 
other, and, in each case, your task will be to pre- 
dict how they will feel toward each other 3 
months from now. Assume that Jim and Bill 
have only recently become acquainted, and that 
each is aware of the other’s attitude. 

Each situation will be presented in the fellowing 
format: 


That is, from the above you are told that Jim 
likes Bill (J L B) and that Bill is ambivalent 
toward Jim (B A J), You are to indicate your 
predictions by placing one of the following letters 
in each of the blanks provided: L for “like,’ D 
for “dislike,” N for “neutral toward,” and A for 
“ambivalent toward.” 

It might be helpful if you try placing yourself 
in the situation by alternatively assuming the roles 
of Jim and Bill, If, for any situation, you are 
uncertain of your prediction, put down your best 
guess. The situations can be considered in any 
order. Work at your own pace. 


The next page contained all 10 dyads (in a ran- 
domized order). Following completion of this part, 
the subjects were required to answer the following 
question for each of the 10 dyads in order to get 
a measure of perceived tension: 


Suppose Jim and Bill possess the attitudes origi- 
nally given. What do you think the chances are 
that serious difficulties between the two will occur? 
(Circle one) 
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RESULTS AND DISCUSSION 


The predictions made by the subj are 
represented in Table 1, which is a matrix 
where each entry is the proportion of sub- 
jects who, when presented with the dyad rep- 
resented by the corresponding row, predicted 
the dyad represented by the corresponding 
column. 

As predicted by balance theory, LL, DD, 
and NN are perceived as balanced by the 
majority of subjects. The LD dyad is un- 


PRESENT PREDICTED balanced, and there is considerable disagree- 
ATTITUDES ATTITUDES ment concerning the predicted dyad. The LN 
TLB; BAJ eB BE] and DN dyads are clearly unbalanced (2% 

TABLE 1 
PROPORTION OF SUBJECTS PREDICTING EACH DYAD FOR EACH DYAD PRESENTED 
Predicted dyad 
Presented dyad 

LL DD LD LN DN NN AA | LA DA NA 
LL 91 02 0 0 0 02 03 02 0 0 
DD 0 19 0 0 02 14 02 0 .02 .02 
LD -05 -16 07 05 .03 09 24 05 AT .09 
LN 59 .05 0 02 .02 .03 0 17 .03 09 
DN 03 62 0 0 WF 09 02 .03 03 0 
NN 12 02 0 0 0 66 14 0 02 05 
AA 14 02 0 0 .02 14 69 0 0 0 
LA 57 02 03 .03 03 .03 2 .12 .02 02 
DA 02 -64 02 03 05 02 09 0 -09 .05 
NA .09 03 02 0 .03 28 34 07 .07 07 


STRUCTURAL BALANCE IN Dvyaps 


and 17% predicted no change, respectively). 
In general, balance theory predicts correctly 
for the balanced structures, but the concept 
of a vacuously balanced structure does not 
appear to be valid (except for NN). 

Among the dyads involving ambivalence, 
only AA is balanced. In general, the dyads 
LL, DD, NN, and AA are the only dyads 
that the great majority of subjects perceived 
to be balanced. Assume that these 4 dyads 
are the only balanced dyads among the 10 
dyads presented. Then for each dyad pre- 
sented, the majority of subjects should predict 
that one of these four will result. Table 2 
shows, for each of the dyads presented, the 
proportion of subjects who predicted either 
the LL, DD, NN, or AA dyad. 

The balance principle appears to be sup- 
ported, since the large majority of subjects 
predict that the dyad presented to them will 


TABLE 2 


Proportion OF SUBJECTS WHO PREDICTED EITHER 
LL, DD, NN, or AA 


Presented dyad 


NA 
-14 


DN 


16 


NN 
94 


AA 


99 


LA 


14 


DA 


7 


LL 
98 


DD 


195) 


LD 


54 


LN 
67 


change to a balanced dyad, except perhaps 
for the LD dyad. In the case of the LD dyad, 
subjects are clearly in doubt as to what the 
relationship will be in 3 months time. How- 
ever, by making certain assumptions, it is 
possible to determine what dyad will likely 
result after a much longer period of time, on 
the basis of the data in Table 1. Since the 
balance principle refers to a tendency toward 
balance, it is quite likely that several 3-month 
time periods will be necessary before balance 
is achieved. Therefore, the long-term proba- 
bilities of a dyad eventually changing to an- 
other dyad were computed and are presented 
in Table 3. The technique is described in 
Kemeny, Mirkil, Snell, and Thompson 
(1959), In brief, the matrix of Table 1 is 
assumed to represent the transition matrix 
of a finite Markov chain. The long-term 
probabilities are determined by raising this 
matrix to succeedingly larger and larger 
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TABLE 3 


LONG-TERM PROBABILITIES or ANY ONE Dyap CHANG- 
ING INTO Eacu or THE 10 Dyaps 


Dyad 
LL|}DD} LD LN |DN|NN| AA} LA | DA|NA 
Weed pi—cOli i OleOle tos) 215) /01 021.00 


powers until the entries stabilize. Note that 
the long-term probability of a particular dyad 
resulting is independent of what the dyad was 
initially. For example, there is a 55% chance 
that the LD dyad (or any other dyad) will 
eventually become an LL dyad. In general, 
from Table 3, there is a 96% chance that one 
of the four balanced dyads will result, 
strongly supporting the balance principle. 
Table 4 shows the perceived probability of 
serious difficulties occurring in the presented 
dyad, averaged over all 58 subjects. There- 
fore, the lower the probability, the lower the 
perceived tension. Balance theory correctly 
predicts that there will be less tension in the 
LL dyad than in the LD dyad (t= 98, 
df = 57, p< .001). However, while the 
theory also predicts that there will be less 
tension in the DD dyad than in the LD dyad, 
the difference was actually found to be in the 
opposite direction (but not significantly so). 
If we use the proportion of subjects who 
predict no change as an empirical measure 
of the degree of balance for each dyad, then 
a test of the tension hypothesis for all four 
relations would involve correlating the tension 
measure with this balance measure, and 
would predict a negative correlation (i.e., 
the greater the tension in a dyad, the lower 
the degree of balance). The correlation 
(r= —.21, df = 8, p > 10), although nega- 
tive, is not significant. In general, the tension 
hypothesis is not supported. In general, from 


TABLE 4 


AVERAGE PERCEIVED TENSION FOR EACH OF THE 
PRESENTED DyAps 


Presented dyad 


LL 
2.05 


DD] LD LA | LN 


3.36 


NN 
2.30; 


DA | DN 


5.34 | 4.59 


AA | NA 


3,86 


6.60 | 6.24 4,57 3.40 


138 


Table 4, the dyads with the greatest amount 
of tension involve the dislike relation, fol- 
lowed by the ambivalence relation, the neutral 
relation, and the like relation. Therefore, it 
would seem that tension is more a function 
of the nature of the attitudes than of the 
degree of balance. 
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IMMEDIATE AND DEFERRED EFFECTS OF MODEL 
OBSERVATION AND GUIDED AND UNGUIDED 
REHEARSAL ON DONATING AND STEALING* 


GLENN M. 


WHITE ? 


University of Arizona 


A miniature bowling game served as the medium providing 210 fourth- and 


fifth-grade children of both sexes with 


several 5¢ gift certificates. Near the 


game was an open charity box. The short- and long-term effects on private 
charitable behavior and stealing of model observation, observation plus un- 
guided rehearsal, and guided rehearsal were assessed. Observation alone was 


not an effective eliciting agent. Providin; 
with an opportunity to engage in unrewa 


g girls who had observed the model 
rded social rehearsal facilitated subse- 


quent altruism and inhibited stealing. These effects persisted across a span of 
several days, The guided rehearsal procedure occasioned substantial giving in- 
itially; however, sharp decrements in giving and increments in stealing suggest 
that this training method resulted in some negative long-term effects. 


The problem of specifying the processes 
through which the child comes to behave in 
ways consistent with or discrepant from the 
admonitions of its socializing agents, even in 
the absence of immediate stimulus antecedents 
or reinforcement consequences, is the issue 
which underlies the present research. One 
category of such apparently self-regulated be- 
havior which appears to be of considerable 
theoretical import is altruism. 

Numerous theoretical accounts of altruism 
have appeared in the literature (Aronfreed, 
1968; Lewin, 1954; Mead, 1934; Piaget, 
1932; Rosenhan, 1969; Sorokin, 1962); 
however, the empirical research which the 
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topic has generated is not easily subsumed 
under any of the extant positions. This is 
partly due to the fact that clear distinctions 
between private charitable acts and socially 
induced cooperation, between sharing and 
trading, and between expressed attitudes and 
observed behavior have not been systemati- 
cally made. This deficit makes it very dif- 
ficult to derive general principles from the 
corpus of empirical studies. Bryan and Lon- 
don (1970), in a recent review of altruism 
studies, concluded that the antecedents of 
altruistic behavior remain somewhat ambigu- 
ous, but that altruism appears to be norm 
regulated, subject to modeling influences, and 
probably antedated by certain types of family 
circumstances. 

In an experiment designed to study anony- 
mous altruistic behavior as a function of 
model observation and the valence of model- 
subject interaction (Rosenhan & White, 
1967), it was found that exposing a child 
to an altruistic model facilitated subsequent 
altruism, at least when an opportunity for 
unrewarded social rehearsal accompanied the 
observational procedure. Through the medium 
of a miniature bowling game, the subjects 
were allowed to win some gift certificates. 
During the first game, in which the model 
and the subject took turns, the model won 
four gift certificates, donating two to a char- 
ity box, and the subject won four certificates. 
The option was provided the subject to give or 
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not, as he chose. No response-contingent 
feedback or reinforcement was provided by 
the model. It was then made clear to the sub- 
ject that no further contact with the model 
would obtain, and the subject was left alone 
to play and to win more certificates. Of the 
76 subjects who gave in the model’s presence, 
51 also donated while alone, while of the 44 
who failed to donate in the model’s presence, 
only 6 made subsequent private donations 
(Rosenhan & White, 1967). 

The Rosenhan and White (1967) experi- 
ment did not separate the observational effects 
from the rehearsal effects. The present experi- 
ment was designed to examine the combined 
and separate effects of observing and re- 
hearsing on private charitable behavior. If 
observation alone were effective in eliciting 
private donations, this would add support to 
the points made by Bandura (1965b, 1969) 
regarding the importance of observational pro- 
cedures as training devices. If, on the other 
hand, observation alone failed to elicit dona- 
tions where observation plus uncoerced re- 
hearsal succeeded, this would be a step toward 
establishing some boundary conditions for the 

efficacy of observation in changing behavior. 

Another issue which the present study was 
concerned with is the effectiveness of a guided 
rehearsal training procedure relative to the 
observation and to the observation-plus- 
unguided-rehearsal methods. Partly to allow 
for a test of the guided rehearsal effects in 
a situation with weaker demand character- 
istics than would obtain immediately after 
the training session and partly to make pos- 
sible a comparison of the effects of the proce- 
dures after a time lapse, a second game session 
was provided approximately 5 days after the 
first session, 

Another aspect of the durability of the 
effects of the training procedures was assessed 
by including experimental conditions in which 
the opportunity to make private donations 
was provided for the first time during the 
second session. 

It would clearly not make sense to con- 
sider as altruistic the donations made by the 
subjects in the guided rehearsal condition, 
since a behavioral rule was firmly established 
and initially enforced by the experimenter, In 
all other experimental and control conditions, 


the private donations do qualify as altruistic, 
with altruism being defined as the uncoerced 
private relinquishing to nonpresent needy 
others of a valued and possessed commodity, 
in the absence of any clear prospect of 
donation-contingent social or material reim- 
bursement. It was well established in the 
Rosenhan and White (1967) experiment that 
5¢ gift certificates are highly valued by 
fourth- and fifth-grade children. 

Observation alone’ was not expected to be 
particularly effective in cuing the private 
relinquishing of gift certificates; however, it 
was predicted that providing an occasion for 
social rehearsal would significantly enhance 
the impact of modeling in eliciting private 
donations. It was expected that the model- 
cued social rehearsal of giving would engage 
the subject’s system of self-reinforcement for 
standard-congruent responding and that the 
unrewarded donations would continue to ob- 
tain through both of the model-absent ses- 
sions. Such a process would be analogous to 
what Kelman (1961) calls internalization. 

Several investigators have noted that rela- 
tively mild behavior cuing and sanctioning 
methods may be more effective than harsher 
methods in establishing behavioral standards 
which are maintained in the absence of the 
socializing agent (Aronson & Carlsmith, 
1963; Berkowitz, 1969; Peck & Havinghurst, 
1960). The instructed rehearsal procedure was 
expected to produce compliance (Kelman, 
1961)—behavior sponsored by concern with 
the immediate social consequences, It was 
thus predicted that guided rehearsal subjects 
would donate less in the adult’s absence than 
would the observation-plus-unguided-rehearsal 
subjects. It was also predicted that the 
decrement across sessions (in private dona- 
tions) would be greater for the instructed 
rehearsal subjects than for the observation- 
plus-unguided-rehearsal subjects. 


METHOD 
Subjects 


Two hundred and twenty-one fourth- and fifth- 
grade children served as subjects in this experiment. 
Eleven subjects were eliminated from the data analy- 
sis due to failure to appear for the second session, 
mechanical failure of the apparatus, errors made in 
administering instructions, and disrupting intrusions 
during the experimental sessions, leaving an effective 


EFFECTS OF MODEL OBSERVATION ON DONATING AND STEALING 


N of 210. All subjects were drawn from two public 
schools in Lawrence, New Jersey, a primarily 
middle-class residential area. Of the 210 subjects, 
178 were fourth graders and 32 were fifth graders. 
The sample was equally divided between boys and 
girls, Subjects were sent to the laboratory in a 
boy-girl order, determined by the teacher on the 
basis of availability, and were assigned sequentially 
to seven conditions. Each subject participated in 
two experimental sessions which were temporally 
separated by an average of 5 days. (For the vast 
majority of subjects, 3-6 days elapsed between 
sessions; however, a variety of circumstances 
necessitated intervals as short as 2 days and as long 
as 10 days for a few children.) There were 15 boys 
and 15 girls in each of seven experimental and 
control conditions. 


Equipment 


All experimental sessions were conducted inside the 
Educational Testing Center for Psychological Studies’ 
Dodge Mobile Unit, the interior of which was par- 
titioned by paneled walls into three compartments. 
The only windows in the vehicle were in the front 
compartment, and they were kept covered (from 
inside) with heavy black cloth in order to make 
operative the two one-way-vision mirrors which 
divided the front and middle compartments. All 
experimentation took place in the middle compart- 
ment, which contained a miniature bowling game 
and a small table, located near the game, on which 
were kept 13 piles of two 5¢ gift certificates and 
an open box containing five gift certificates and 
displaying a sign depicting obviously underprivileged 
children and bearing the label “Trenton Orphans’ 


*Fund.” Both the game and the table were within 


view of a hidden observer in the front compartment. 
The bowling game, identical to the one employed 
by Rosenhan and White (1967), 


served as the task through which the subject 
could win the gift certificates... . [It] was 
played by rolling a marble along a_three-foot- 
ramp, the end of which was shielded from the 
subject’s view. As the marble rolled off the end 
of the ramp, a switch was triggered which acti- 
vated one of four lights located behind the plastic 
numerals 5, 10, 15, and 20, The numerals were 
embedded in a vertical sheet of wood which faced 
the subject at the end of the ramp. A game con- 
sisted of 20 rolls and was automatically terminated 
by a buzzer. The sequence in which the numerals 
were illuminated was predetermined and remained 
constant for all games [p. 425]. 


The gift certificates could be spent in lieu of 
money, upon parental signature, at a popular local 
store designated on the coupons. 


Procedure 


Before the subject entered the mobile unit, an 
assignment was made to an experimental or control 
condition. Each subject participated individually and 


141 


was given standard instructions concerning the me- 
chanics of the game and the nature and use of the 
gift certificates by a 26-year-old male graduate stu- 
dent who served as experimenter and, where appli- 
cable, as the model, A female research assistant ob- 
served all experimental sessions from behind the 
one-way-vision mirror. 

Subjects were told that a score of 20 entitled them 
to two 5¢ gift certificates. The money-surrogate char- 
acter of the certificates was noted, and if the subject 
was unfamiliar with the store which honored the 
certificates, its location and nature were specified, All 
subjects were made aware of the charity box and 
were told: “The people who made the game decided 
to give some of these gift certificates to some orphans 
in Trenton.” Subsequent directions varied somewhat 
according to experimental condition, 

Guided rehearsal condition. After having been 
made aware of the charity box, the subjects were 
told: “What we would like you to do is to give one 
certificate to the orphans each time you win two.” 
Then the subjects played for five trials in the experi- 
menter’s presence, winning on Trials 3 and 4. If the 
subject failed to donate each time, he was reminded 
to do so. Thus, children in this condition rehearsed 
the charitable behavior twice in the experimenter’s 
presence, albeit not voluntarily. 

Half of the subjects in this condition played the 
game alone immediately after rehearsal as well as 
during Session 2 several days later (played immedi- 
ately), and half played alone only during Session 2 
(played later). After the rehearsal trials, guided- 
rehearsal-played-immediately subjects were informed 
that they could play a game alone and return to 
class upon its completion. They were told that the 
experimenter had work to do in the rear compart- 
ment, whereupon the experimenter departed to the 
rear room, shutting the door firmly and locking it, 
leaving the subject alone. Guided-rehearsal—played- 
later subjects completed the rehearsal trials and then 
were told that the session was finished but that they 
would be able to play again in a few days. 

Only guided rehearsal subjects were given strong 
instructions concerning donating to the charity box. 
All other subjects were told: “If you win some of 
these gift certificates and would like to give some 
of yours to the orphans, you can, but you don’t 
have to.” 

Observation-plus-unguided-rehearsal condition. Sub- 
jects were told that the experimenter would take a 
few turns with them in order to demonstrate the 
game. For 10 trials, they alternated turns, with the 
experimenter winning on Trials 3 and 7 and the sub- 
jects winning on Trials 4 and 10. Upon winning, the 
experimenter picked up two gift certificates and, 
reaching across the subject, dropped one into the 
charity box, commenting as he did so that he would 
give one to the orphans each time he won. While the 
subject played, particularly prior to, during, and after 
his winning trials, the experimenter carefully avoided 
providing cues as to whether the subject should give. 
The experimenter accomplished this by attending to — 
aspects of the game which precluded his “seeing” the 
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subject. Care was taken to neither reward nor punish 
the subject for either giving or not giving. If the 
subject attempted to call the experimenter’s attention 
to his charitable intentions or acts, he was totally 
ignored. Observation-plus-unguided-rehearsal subjects 
observed the model give twice and had two oppor- 
tunities to act charitably in the model’s presence. 
As with the guided rehearsal condition, one-half 
of the observation-plus-unguided-rehearsal subjects 
played alone only during Session 2 (observation-plus- 
unguided-rehearsal-played later). With observation- 
plus-unguided-rehearsal-played-immediately subjects, 
the experimenter exited after the rehearsal trials as 
described above for guided rehearsal-played im- 
mediately. 

Observation condition. In order to “demonstrate 
the game,” the experimenter played for five trials in 
the subject’s presence, winning on Trials 3 and 4 and 
donating both times as described for the observation- 
plus-unguided-rehearsal condition. Observation sub- 
jects watched a charitable model and then were either 
left to play alone (observation-played immediately) 
or were sent back to class having been told that an 
opportunity would be provided to play in a few days 
(observation-played later). 

Control condition. After instructing the subject 
regarding the presence of the charity box and after 
having made it clear that contributions were volun- 
tary (as described above), the experimenter exited 
to the rear compartment, allowing the subject to 
play alone. 

For all conditions, the game instructions were 
concluded by asking if the subject understood how 
to play. Questions were answered by repeating por- 
tions of the previously stated instructions. Except in 
guided rehearsal conditions, questions relating to 
whether the subject should give were ignored. 

The game was reset prior to each subject’s game 
alone, with each complete game consisting of 20 
trials, 6 of which resulted in scores of 20 (Trials 3, 
4, 7, 10, 13, and 17). 

Session 2. All subjects were given identical instruc- 
tions and subjected to an identical procedure, regard- 
less of which experimental condition they were in 
during Session 1. A female experimenter reviewed the 
game mechanics, casually called the subject’s atten- 
tion to the charity box, and left the child to play 
alone, again making it clear that he should leave 
when the game was over. 


RESULTS 


Of the 120 subjects who played immedi- 
ately after the experimental and control ma- 
nipulations, 83 (69%) donated at least one 
certificate in at least one session. Seventy-five 
(62%) donated in Session 1, and 56 (47%) 
gave in Session 2. A total of 415 certificates 
were donated by the 120 subjects during 
the two sessions. The numbers of certifi- 
cates donated in the guided-rehearsal—played- 
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Fic. 1. Mean number of certificates donated per 
subject in guided-rehearsal-played-immediately, obser- 
vation - plus -unguided - rehearsal-played - immediately, 
observation-played-immediately, and control condi- 
tions during Sessions 1 and 2. 


immediately, observation-plus-unguided-re- 
hearsal-played immediately, observatio 
played-immediately, and control conditio 
during Sessions 1 and 2 are shown in Figure 1. 
With these four conditions comprising one- 
factor, the subject’s sex a second, and session , 
number a third, a three-way analysis of vari- 
ance design (4 X 2 X 2), with repeated mea- 
sures on one factor (Winer, 1962, pp. 337- 
344) was applied to the data. This analysis 
was undertaken in order to assess the effects 
of the treatment conditions on giving, the role 
of the subject’s sex, the effects of the interval 
between sessions, and their possible inter- 
actions. In Table 1, the data on which the 
4 X 2 X 2 analysis was based are summarized. 

The homogeneity of variance assumption 
underlying analysis of variance was tested 
with Cochran’s method (Guenther, 1964, pp. 
21-22), and the data failed to satisfy the 
assumption. The variances were, therefore, 
stabilized with the Freeman-Tukey square- 
root transformation (transformed value = 
VX+\VX+1; Mosteller & Bush, 1954, 
pp. 326-327). Again the Cochran test was 
applied, and this time the homogeneity as- 
sumption was satisfied (R = .13, df = 15/16, 
p < .05). A summary of the results of the 
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TABLE 1 
SumMARy oF Donation Data ror SUBJECTS WHO PLAYED IMMEDIATELY 
Played Immediately Conditions Sex 
Session Guided rehearsal a A ptun Observation Control 
Males Females Males Females Males Females Males Females 

Lye, 3 

X 48 4.5 1.6 2.9 fe) 1.3 a A 

SD 1.8 2.1 2.0 2.4 6 1.6 9 6 

n 15 15 15 15 15 15 15 15 
2 

x igh Qi 8 2.9 all 1.0 yi 4 

SD 3,0 2.6 1.3 21 1.4 1.4 13) ui 

n 15 15 15 15 15 15 15 15 


analysis of the transformed data appears in 
Table 2. 

The four conditions considered in this 
analysis were found to result in significantly 
different numbers of donated certificates 
across the two sessions. The guided rehearsal 
condition resulted in the greatest amount of 
giving, followed by observation plus unguided 
rehearsal, observation, and control, in that 
order. The effect of the subject’s sex ap- 
proached but did not reach significance at 
the .05 level, with girls donating slightly more 
than boys. Significantly less altruism occurred 
in Session 2 than in Session 1. A highly sig- 
nificant Treatment X Session interaction ob- 
tained, indicating that the decline in giving 
from Session 1 to Session 2 was differentially 
contributed by the treatment conditions. As 
can be seen in Figure 1, by far the greatest 
decline in giving occurred in the guided re- 
hearsal-played immediately condition. The 
Treatment X Sex interaction approached but 
failed to reach significance. 

The Newman-Keuls method was employed 
to compare all cells within each session with 
each other (Winer, 1962), since orthogonal 
planned comparisons for all of the hypothe- 
ses of interest were not possible. Guided re- 
hearsal-played-immediately males and guided- 
rehearsal-played-immediately females gave 
significantly more than did control and 
observation-played-immediately subjects and 
observation-plus-unguided-rehearsal-played- 
immediately males (pf < .01). Observation- 
plus-unguided-rehearsal-played-immediately 


females were significantly more altruistic than 
control females and observation—played-im- 
mediately males (p < .05). 

The Session 2 comparisons yielded only one 
which reached significance beyond the .05 
level; observation-plus-unguided-rehearsal— 
played-immediately females were significantly 
more altruistic than control females. In Ses- 
sion 2, observation-plus-unguided-rehearsal— 
played-immediately females surpassed guided 
rehearsal males and guided rehearsal females 
in amount donated. 


TABLE 2 


SUMMARY OF ANALYSIS OF VARIANCE OF ALTRUISM FOR 
Gumeb-REHEARSAL, OpsERVATION-PLus-UN- 
GUIDED-REHEARSAL, OBSERVATION, AND 
CoNTROL—PLAYED-IMMEDIATELY 


CONDITIONS 
Source df MS F 
Between subjects 119 
Treatment (A) 3 | 44.82 | 18.07** 
Sex (B) 1| 9.46] 3.81* 
AXB 3} 5.90| 2.38* 
Subjects within groups 112| 2.48 
Within subjects 120 
Sessions (C) 1| 18.07 | 22.04** 
AXC 3| 8.15] 9.94** 
BXC 1| 1.69} 2.06 
AXBXC Be (ieee soda ZNS: 
C X subjects within groups | 112 82 


Note.—This analysis was performed on data which had been 
transformed by the Freeman-Tukey square-root method 


VX + VX +1). 
p <.10. 
+b <.001. 


(X= 
: 
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The possibility that the large guided re- 
hearsal Session 1 mean, as compared with all 
other means, was primarily responsible for 
the significant analysis of variance results was 
examined. The guided rehearsal Session 1 
mean did differ significantly (p < 05) from 
all other means (extremity of means test; 
Dixon & Massey, 1957, p. 145); however, a 
separate analysis of variance for observation- 
plus-unguided-rehearsal, observation, and con- 
trol conditions (guided rehearsal having been 
excluded) yielded significant treatment (i= 
10.91, df= 2/84, p<.01), sex (F = 5.43, 
df = 1/84, p< .05), and Treatment x Sex 
(F = 4.44, df = 2/84, p <.05) effects, The 
fact that no session or Treatment X Session 
interaction effects were noted in this analysis 
demonstrates that the extremity of guided 
rehearsal Session 1 giving was primarily re- 
sponsible for these effects in the 4x 2 x 2 
analysis, but differences among the other 
conditioned contributed to the treatment and 
sex effects. 


Played Later Conditions 


The mean numbers of certificates donated 
in the guided-rehearsal—played-immediately, 
observation -plus-unguided- rehearsal—played- 
later, and observation—played-later conditions 
were 1.97, 1.40, and 1.37, respectively. 

Played-later—played-immediately (Session 
1) analysis. Played later and played immedi- 
ately (Session 1) performances were adjudged 
comparable, in that in both cases it was sub- 
ject’s first test of altruistic behavior, The two 
sets of conditions differed in the time elapsing 
between the experimental manipulations and 
the test for altruism, and in the intersubject 
communications which doubtless occurred 
during this interval. An analysis of variance 
was undertaken on these conditions in order 
to assess the effects of time on altruistic per- 
formance without the confounding variable of 
prior altruism intervening (as was the case in 
the played-immediately (Session 1)—played- 
immediately (Session 2) comparison. The ap- 
plication of Cochran’s method revealed that 
the untransformed data satisfied the homo- 
geneity assumption. A Treatment x Sex x 
Played-Immediately (Session 1) analysis (3 
X 2 X 2) was carried out. (Note that, of ne- 


GLENN M. WHITE 


cessity, this analysis is not independent of the 
analysis reported above.) 

As would be expected on the basis of the 
strong treatment effect noted above, to which 
the played immediately (Session 1) data 
made the larger contribution, a significant 
treatment effect obtained in this analysis 
(F = 19.77, df = 2/168, p < .001). It is also 
clear that as was the case for played immedi- 
ately (Session 2), significantly less altruism 
obtained in the played later conditions 
than in their played immediately (Session 1) 
counterparts (F= 12.67, df = 1/168, p< 
-001). No significant sex effects were found. 
The only significant interaction effect noted 
was the Treatment x Played-Immediately 
(Session 1)—Played-Later interaction (F= 
10.00, df= 2/168, p< .005). The guided- 
rehearsal-played-immediately (Session 1) sub- 
jects donated much more than did the 
guided-rehearsal—played-later subjects; the 
observation -plus-unguided -rehearsal— played- . 
immediately (Session 1) subjects donated 
slightly more than their observation-plus- 
unguided-rehearsal—played-later counterparts; 
and observation—played-immediately (Session 
1) subjects donated less than did observation— 
played-later subjects. 

Played-later—played- immediately (Session 
2) analysis. A Session 2 analysis was con- 
ducted, since in both instances several days 
had elapsed since the training procedures. 
This comparison should indicate the effects of 
opportunity for prior rehearsal in the model’s 
absence as compared with none, with time 
lapse held constant. Cochran’s test indicated 
that no transformation on the data was re- 
quired. A Treatment X Sex x Played-Later— 
Played-Immediately (Session 2) analysis of 
variance was carried out. Two significant main 
effects emerged from this analysis, a treatment 
effect (F = 3.47, df = 2/168, p < 05) and a 
sex effect (F = 5.79, dj = 1/168, p < .025). 
There were no significant interactions. : 

The treatment effect observed in this analy- 
sis, with guided rehearsal subjects giving more 
than observation-plus-unguided-rehearsal sub-: ` 


jects and observation subjects giving least, re- ~ * 


flects the impact of the experimental manipu- 
lations after a lapse of several days. The fact 
that a significant sex ‘difference obtained in 
this analysis but not in the played-immedi- 
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ately (Session 1)—played-later analysis, indi- 
cates that the nearly significant sex difference 
reported in the 4 X 2 X 2 analysis (Table 2) 
was due more to played immediately (Ses- 
sion 2) performance than to played immedi- 
ately (Session 1) performance. 


Stability of Giving 


In order to assess the degree of relationship 
between performance during the two sessions, 
Pearson product-moment correlations between 
Session 1 and Session 2 altruism scores were 
computed for all played-immediately subjects 
(r=.55) for males (r= .33) and females 
(yr =.73) and for guided-rehearsal—played- 
immediately (r = 48), observation-plus-un- 
guided-rehearsal—played-immediately (r = 
-.67), observation—played-immediately (r = 
40), and control (7 = .46) subgroups. 

The significance of the difference between 
correlations for boys and girls was tested and 
found to be significant, with girls being the 
more stable givers (z = 3.13, p < .002). No 
significant differences between conditions were 
noted. 

The correlations reported above were, of 
course,*enhanced by those subjects who were 
` consistent nongivers. In order to allow the 
reader to assess the relative contributions of 
this factor to the most interesting correlations, 
it should be noted that the significant differ- 
. ence between boys’ and girls’ stability of 
` altruism obtained despite rather than because 
“of this factor, since 20 boys versus 17 girls 
were nongivers both times. Similarly, observa- 
tion-plus-unguided-rehearsal_played immedi- 
ately (a Vrn had only eight nongivers, 
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while observation-played immediately (r = 
.40) and control (r = .46) both had 12. 

The scatterplot from which the guided- — 
rehearsal-played-immediately correlation was 
computed is of particular interest. The 13 
guided-rehearsal—played-immediately subjects 
who gave from O to 5 certificates during Ses- 
sion 1 gave either 1 certificate or none in 
Session 2 (X = .23). On the other hand, 
Session 2 performance for the 17 subjects who 
gave 6 certificates during Session 1 ranged 
from O to 12 (X =3.7). The guided-re- 
hearsal—played-immediately data yielded an 
eta of .61, and a test for nonlinearity, com- 
paring v with eta, resulted in a nonsignificant 
F ratio of 1.41 (df = 6/23). 

Of interest with regard to the suspected 
role of rehearsal in the model’s presence in 
facilitating subsequent altruism is the fact 
that 90% (19 out of 21) of the observation- 
plus- unguided -rehearsal-—played -immediately 
subjects who gave in the model’s absence 
during Session 1 had previously given in the 
model’s presence. Of the 18 observation-plus- 
unguided -rehearsal—played-immediately sub- 
jects who donated in Session 2, 15 (83%) 
had previously given in the model’s presence, 
and the same 15 had also previously given 
in his absence. 


Stealing 


A final set of results which were incidental 
but not unrelated to the main purpose of the 
study is the pattern of stealing that occurred 
in the various conditions. The existence of 
extra gift certificates on the table made steal- 
ing a salient alternative (and, sometimes, 


À TABLE 3 


A INCIDENCE OF STEALING 
= = 
Subjects GR-PI OUR-PI O-PI CPI) GR-PL OUR-PL O-PL 
pare 
Session 1 
Boys 0 1 3 3 
Girls 0 0 2 2 
Total 0 1 5 5 
Session 2 
Boys © 4 ers 3 2 3 Z 2 
Girls 4 0 1 2 2 0 4 
Totals 8 2 4 4 5 2 6 


Note—GR = guided rehearsal; PI = played immediately ; OUR = observation plus unguided rehearsal; O = observation; : 


C = control; PL = played later. 
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adjunct) to altruism. Table 3 presents the 
frequency of stealing behavior in both sessions 
(where applicable) for the several conditions. 

An analysis of variance on the number of 
certificates stolen was not performed, since the 
homogeneity assumption was badly violated 
by the data. 

Note the striking increase in the Session 2 
over Session 1 stealing on the part of guided- 
rehearsal—played-immediately subjects. Mc- 
Nemar’s significance of change test (Siegel, 
1956) was applied to guided-rehearsal—played- 
immediately Session 1—Session 2 stealing dif- 
ferences, yielding a chi-square of 6.12 (p< 
-02, corrected for continuity). 

Note also the differences in stealing 
„between the Session 2 observation-plus-un- 

_guided-rehearsal conditions and the Session 2 
guided-rehearsal conditions. Significantly more 
guided-rehearsal subjects stole in Session 2 
than observation-plus-unguided-rehearsal sub- 
jects (x° = 4.38, df=1, p< .05; corrected 
for continuity). Of the 13 guided-rehearsal 
subjects who failed to donate on each Ses- 
sion 1 winning trial, 6 (46%) stole during 
Session 2. 


Discussion 


Conversations with teachers concerning the 
children’s behavior and overheard comments 
of children on the playground made it clear 
that peer norms were strongly oriented toward 
competition with regard to the experimental 
task. The children seemed concerned to return 
to class with as many certificates as possible, 
which they would then compare with the 
winnings of classmates. Such norms would 
have opposed the tendency to relinquish certi- 
ficates to charity and should have made the 
charitable behavior which did obtain all the 
more altruistic, since the object to be donated 
would have been socially as well as materially 
valued. The possibility that the content of 
these intersubject communications differed 
systematically as a function of experimental 
condition suggests a potential artifactual con- 
taminant with respect to the Session 2 data. 
The played-immediately and played-later 
Session 2 results should be interpreted in 
light of this possible artifact. 

The observer, who monitored all experi- 
mental sessions, reported no evidence that 
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subjects were suspicious of being observed. 
The fact that the experimenter. exited through 
the rear door doubtlessly helped direct atten- 
tion away from the observer’s compartment, 


Observation and Imitation in Social Settings 


At least for females, the major hypothesis 
which prompted the present experiment was 
confirmed. The occasion for noncoerced social 
rehearsal of giving, cued by model observa- 
tion, facilitated subsequent altruism, both im- 
mediately after the experimental manipulation 
and after a delay of several days, despite the 
fact that the social rehearsal was neither 
noticed nor rewarded. The evidence for males, 
while in the predicted direction, was not 
statistically significant and was clearly less 
striking than for females. 

The finding that observation alone did not 
occasion significantly more altruism than 0C- 
curred in the control condition stands in con- 
trast to studies from which it has been con- 
cluded that observation per se is a powerful, 
elicitor of low probability behaviors (Bandur 
& Kupers, 1964; Bandura & Mischel, 1965; — 
Bandura, Ross, & Ross, 1963). The present 
research suggests that observation alone may 
not be effective if the observed behavior has 
a low base-line probability of occurrence and 
if it entails moderate expense to the orga- 
nism. Under such circumstances, modeling 
may be a relatively inefficient way of inform- 
ing the subject how he is expected to behave 
under similar conditions, 

The observation-plus-unguided-rehearsal 
data indicate that observation does serve to 
make giving a salient alternative and perhaps 
to establish a social anchor, such that when it 
is combined with anticipated social approval 
(or fear of disapproval), imitative giving oc- 
curs. Such observation and socially triggered 
compliant behavior in turn increases the prob- 
ability that the subject will behave altru- 
istically while alone. For girls, this externally 
unrewarded rehearsal of observed giving ex- 
erted as powerful an effect after an interval 
of several days as it did immediately following 
the manipulation. 

It seems unlikely that observation-plus- 
unguided-rehearsal altruism can be reasonably 
accounted for by situational demand charac- 
teristics. It is not obvious why such demand 


characteristics should be any greater for 
observation- plus-unguided-rehearsal subjects 
than. for control subjects. If anything, obser- 
vation-plus-unguided-rehearsal subjects should 
feel less constrained to give in the model’s 
absence than control subjects, since most 
observation-plus- unguided-rehearsal subjects, 
having already donated in the model’s pres- 
ence, could construe themselves as having ful- 
filled any model-imposed obligation to donate. 
Nor is it clear that the observation-plus-un- 
guided-rehearsal manipulation should have 
conveyed more of a rulelike interpretation of 
donating than the control procedure. In fact, 
few of the observation-plus-unguided-rehearsal 
subjects donated each time. 

The observation-plus-unguided-rehearsal 
data seem to be most easily assimilated by 
those theoretical perspectives which assign re- 
inforcing properties to internalized standards, 
whether these are couched in terms of refer- 
ence scales, attitudes, or values. 


Instructed Rehearsal 


The prediction that guided-rehearsal sub- 
jects would donate less in the adult’s absence 
than observation-plus-unguided-rehearsal sub- 
„jects was not borne out by the data. In terms 
of short-range effects, the statement, “What 
we would like for you to do is to donate one 
certificate each time you win two,” was a 
highly efficient elicitor of private donations. 
That 57% of the guided-rehearsal-played- 
+ immediately subjects complied perfectly with 

a rule which deprived them of valued com- 
modities, even though they were alone and 
could have breached the rule with impunity, 
‘suggests either a respect for rules (Piaget, 
1932) op a fear of social consequences as a 
behavioral mediator. The hypothesis that the 
guided-rehearsal intersession decrement in 
giving would exceed that of the other experi- 
mental conditions was confirmed. The fact 
that only 17% of the guided-rehearsal—played- 
immediately subjects complied with the rule 
in Session 2 suggests that factors other than 
“respect for rules” were operative. 

It seems significant that virtually no Ses- 
sion 2 giving obtained among guided-rehearsal 
subjects who, in Session 1, failed to meticu- 
lously observe the rule by donating on each 
winning trial. One might speculate that to 
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the extent to which these subjects gave, they 
were merely complying on grounds of fear of 
possible social consequences and that having 
once discovered that it was possible to safely 
disregard the rule (by failing to donate at 
least once during Session 1), the motive for 
compliance vanished, since no social conse- 
quences obtained. The fact that almost half 
of the guided-rehearsal subjects who gave 
fewer than six certificates in Session 1 not 
only did not give but stole during Session 2 
suggests that the expected negative conse- 
quences of enforced rehearsal did obtain, but 
only after the 1-week delay. 


Further Observations 


Sex differences and cross-sex facilitation ef- 
fects are not uncommon in modeling experi- 
ments (Bandura, 1965a). The observation- 
plus-unguided-rehearsal sex differences and 
the finding that girls gave significantly more 
consistently than boys are not easily inter- 
pretable, since only a male model was em- 
ployed and in light of the nearly significant 
Sex x Treatment interaction among played- 
immediately subjects. The fact that control 
boys were slightly more altruistic in both ses- 
sions than control girls suggests that it is not 
the case that girls are simply more disposed 
to behave altruistically than boys. Social and 
situational factors, as they interacted with 
differential predispositions, were likely respon- 
sible for the sex differences. 

That the level of played later giving did 
not differ significantly from that of played- 
immediately Session 2 subjects suggests that 
the effects of the training conditions, in gen- 
eral, decrease over time and that such a de- 
crease is not altered by an initial opportunity 
to rehearse giving privately. 

In light of its success in maintaining private 
altruistic acts and suppressing stealing (rela- 
tive to the other training methods), the com- 
bination of observation and socially induced 
(but nonsanctioned) rehearsal is suggested as 
a potentially viable socialization technique, 
entailing some advantages over less constrain- 
ing “pure observational” procedures and some 
long-range advantages over more constraining 
instructional methods of instating private com- 
pliance with “expensive” behavioral standards. 
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REINFORCEMENT FOR IMITATION AS A SOCIAL LEARNING 
DETERMINANT WITH IMPLICATIONS FOR 
SEX-ROLE DEVELOPMENT* 


JOAN E. GRUSEC ? anp DALE B. BRINKER, Jr. 


University of Toronto 


Children who were reinforced for imitating one of two adults were able to 
recall more of the subsequent behavior of that adult. Films of their eye 
movements revealed that, in spite of these recall differences, all subjects looked 
at each adult for the same amount of time. Nor did they tend to initially 
recall more of one adult’s behavior, thereby forgetting more of the other’s 
because of a greater delay between its presentation and recall. An explanation 
of the acquisition finding in terms of differential coding of the two models’ 
behaviors thus seemed plausible. These findings suggest that children, whom 
we generally reinforce for imitating same-sexed models, should learn more 
about the behavior of same-sexed individuals and that this tendency should 
increase with age, that is, with increasing reinforcement. In a second study, 
employing 5- and 7-year-olds, the first hypothesis was confirmed for boys and 


for half of the girls, but the predicted change with age was not found. 


Variables which influence the imitation of 
observed behaviors have been the subject of 
investigation now for a number of years. 
Research findings suggest that models who 
are powerful, rewarded for their behavior, 
friendly, or similar to an observer are gen- 
erally more likely to be imitated than those 
who are not powerful, rewarded for their 
behavior, friendly, or similar to an observer 
(e.g., Bandura, Grusec, & Menlove, 1967; 
Bandura & Huston, 1961; Bandura, Ross, & 
Ross, 1963; Mischel & Grusec, 1966; Rose- 
krans, 1967). Few studies, however, have 
been concerned with the problem of whether 
these and other social variables influence imi- 
tation through their effect on an observer’s 
willingness to perform modeled behaviors, on 
the degree to which he has learned these 
behaviors, or both. Individuals may, for 
exdmple, imitate a powerful model because 
‘their willingness to perform his behavior is 
increased by certain expectancies they have 
about the outcome of imitation. On the other 


1 This investigation was supported by Grant 116 
from the Ontario Mental Health Foundation. The 
authors are indebted to the officials and teachers of 
the Waterloo schools and the Toronto separate 
schools who generously cooperated in the carrying 
out of these studies. 

2 Requests for reprints should be sent to Joan E. 
Grusec, Department of Psychology, University of 
Toronto, Toronto 181, Canada. 
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hand, these consequences may affect in some 
way the amount they learn about the model’s 
behavior, and behaviors obviously cannot be 
imitated if they have not been learned. Alter- 
natively, increased imitation of a powerful 
model may be due to both increased willing- 
ness to perform and a greater amount learned, 
The present study is addressed to one asperi 
of the problem, that of learning. 

Bandura (1965) has maintained that the 
acquisition of matching responses results pri: 
marily from stimulus contiguity and associ- 
ated symbolic responses. In accord with this. 
notion, Bandura, Grusec, and Menlove (1966) 
found that children who generated verbal 
equivalents of a model’s behavior remembered 
more of that behavior than those who simply 
passively observed the model. These passivi 
observers, however, recalled more than those 
who engaged in competing symbolization 
while observing the model. It is evident, then, 
that the acquisition of behavior can be di- 
rectly affected by the coding processes of 
the observer. 

Acquisition may be indirectly influenced by ; 
manipulations which affect a subject’s atten- 
tiveness to a model by altering his observing 
responses or affecting how he processes ob- 
served material. Thus, Grusec and Mischel 
(1966) found that children who interacted 
with a highly rewarding and powerful model 
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were later able to recall more of his behavior 
when given external incentives for doing so 
than were those who interacted with a model 
_ who was low in rewardingness and power. 
x Presumably, - “tese learning differences were 
®mediated by the amount of attention paid to 
the model. This explanation is in keeping with 
a theory of identification like Maccoby’s 
(1959). She hypothesized that children pay 
more attention to, and engage in more covert 
rehearsal of, both the rewarding and punish- 
ing behaviors of a powerful model since both 
are relevant to guiding their plans about 
future actions. In order for a child to be able 
to judge which actions on his part will lead 
to the administration of positive and negative 
resources by the model, he must be familiar 
` with how that model has behaved in the past 
in similar situations. 
_ Another variable affecting acquisition is 
perceived similarity. Thus, Rosekrans (1967) 
_ found that high perceived similarity to a 
A model facilitated the learning of imitative 
= responses. Rosekrans suggests that attentive- 
ness to a model is increased by perceived 
milarity and thus facilitates learning of the 
model’s responses. Kagan (1967) has hy- 
thesized that an observer will pay closer 
ittention to someone he sees as similar to 
mself since that person is more distinctive. 
is more distinctive because he violates 
| the commonly held expectancy that we do not 
“Share basic personality traits with most of the 
Strangers we meet. Thus, more will be learned 
Al about his behavior. 
4 So far we have been concerned with the 
' effects of the relationship between model and 
‘observer on the observer’s learning. It may 
also be that consequences the observer experi- 
‘ences for imitating a model subsequently 
affect the amount of attention he pays to that 
model. For example, when an individual is 
consistently rewarded for imitating the be- 
havior of a particular model, we might expect 
him to begin paying closer attention to, and 
hence learning more about, that model. 
Clearly, it would be important for him to 
- know as much as possible about what the 
model did so that he would be in a position 
to maximize the rewards he might later re- 
ceive for imitation. In order to test this 
hypothesis of differential acquisition, children 
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in the present study were shown a movie of 
two individuals engaging in distinctive be- 
haviors and were reinforced for performing 
the same behaviors as one of them. They were 
then shown a second movie of the same two 
individuals. At the end of the movie, they 
were reinforced for recalling everything they 
could about the behavior of both models in 
this second movie. If reinforcement for imita- 
tion affects learning, it could do so in more 
than one way. It might have its effect at a 
peripheral level, with children simply spend- 
ing more time looking at the model they had 
been reinforced for imitating. In order to see 
if this was so, a movie record was e of 
the subjects’ eye movements as they- atched 
the second movie. It was thus possible. to tell 
how much time they spent looking at each 
model. 


EXPERIMENT I 


Method 
Subjects 


Subjects were 32 first and second graders from a 
Waterloo, Ontario school. There were 16 boys and 
16 girls, and the same male experimenter was used 
with all children. 


Procedure 


Phase 1, Each child was brought individually by 
the experimenter to a three-room mobile laboratory 
parked in the school yard, As they entered the first 
room the experimenter said: 


Today were going to play a special kind of game. 
If you sit in that chair P’ll show you a movie, In 
the movie you’ll see two things happen. You have 
to do ome of the two things you see happen. If 
you do the right one, PII give you one of these 
chips [the child was shown a chip]. You can 
trade these chips in at the end for a prize. The 
more of these chips you get, the bigger and better 
the prize you’ll get. So the more times you do 
the right thing, the bigger and better the prize 
you'll get. 


The subject was then seated and shown a movie 
consisting of a series of separate short sequences. In 
each sequence two adult males (Models A and B), one 
on each side of the screen, performed a different be- 
havior. In no case did they interact with each other. 
The following are examples of what the two models 
did: while one pointed to his head, the other pointed 
to his foot; while one rubbed his. eyes, the other 
rubbed his stomach; while one tied his shoelace, the 
other rolled up his sleeve; while one pasted the 
picture of a fish on a board, the other pasted a 
picture of a bird on a board; and while one read 
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a book, the other read a magazine, The appearance 
of a model on a given side of the screen was ran- 
domly arranged, except that no model appeared on 
the same side in more than two consecutive se- 
quences. After each sequence the movie was stopped, 
and the child was asked to reproduce one of the 
behaviors he had seen, If props had been used in 

- the movie, he was offered both of them. (In no 
case did one model use a prop, while the other did 
not.) Half of the boys and half of the girls were 
reinforced with a chip and praise if they imitated 
Model A, and the others, if they imitated Model B. 
If the wrong model was imitated, the experimenter 
said firmly, “No, that is the wrong thing to do.” 
The next sequence of the movie was then presented, 
and the procedure repeated: When a criterion of 10 
correct responses in succession had been reached, 
Phase 2 of the study began. 

Phase 2. The child was brought to a second room 
of the trailer and seated in front of a table where 
an eye-marker tamera was mounted. This camera 

- was designed to record eve moyements using a 
corneal-reflectance method, It was set on a horizontal 

` bar attached to two vertical bars which were, in 
turn, attached to, the table. The subject’s head was 
placed between the vertical bars with his chin rest- 
ing on an adjustable chin rest. The camera was 
lowered to the top of his head so that an attached 
viewing tube could be positioned in front of his eye. 
The subject looked through the lens at the bottom 
of the tube so that it was possible to film both the 
movie that he was watching and the place on the 
screen at which he was looking. An adjustable 
cushion was firmly secured at the back of the child’s 
head so that his head movements were minimized. 
During the adjustment procedure, every effort was 
made to reduce any apprehension that the subjects 
might have, and a quite common response on their 
part was an eagerness to “see through the periscope” 
of the camera. 

When the apparatus was properly adjusted, the 
child was shown a second movie, and his eye move- 
ments were filmed as he watched, This second movie 
was of Models A and B, one on each side of the 
screen, again each displaying a series of different 
behaviors. There were 12 sequences of behavior, half 
displayed by Model A and half by Model B. At the 
end of each sequence, the positions of the models 
on the screen were reversed. Included among these 
sequences were the following: while one model read 
a paper, then rolled it up on a table, put a rubber 
band around it, and shoved it off the table, the other 
model looked at a large hat, dropped it onto the 
floor, put a small hat on his head, and dropped it 

- onto the floor. Again, while one model placed a dart 
board agdinst the wall, then put it on the floor, 
- moved his arm in a circular motion, and dropped 
darts from both hands onto the floor, the other 
model put a lamp shade on his head, paced back and 
forth, and then. twirled a hula hoop around his arm. 

Two movies were shown, identical except that the 
behaviors the models performed were reversed. Half 
of the boys and half’ of the girls reinforced for 
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imitating a given model were shown one movie, while 
the others were shown the second movie. 

Phase 3. Immediately after the second movie, the 
subjects were brought to a third room which con-~ 
tained all the props that had been used in the 
Phase 2 movie. The experimenter then said: 


You see here all the things that were used in the 
movie you just saw. In the first movie I wanted 
you to show me what only one person did. Now 
I want you to show me what both people did with 
all of these things. For everything you show me 
that’s right, I’ll give you another chip toward your 
prize. Now you show me as much as you re- 
member about what both people did with all of 
these things. 


For every correctly recalled behavior, the child was 
rewarded, and the item was recorded on a prepared 
checklist. He was frequently urged to recall every- 
thing that he could and was reminded again that 
the experimenter wanted to know what both models 
had done, When the subject claimed that he could 
recall nothing else, the experimenter pointed to each 
prop that had not been used in the course of recall 
and asked him to try and remember what had been 
done with it. After this, the child was given a prize 
in exchange for his chips and was asked not to 
discuss any of the events in the trailer with his 
classmates. 


Results 


The number of trials to reach a criterion of 
10 correct responses in succession in Phase 1 
of the study was 30.1 for boys and 34.0 for 
girls reinforced for imitating Model A and 
28.6 for boys and 22.9 for girls reinforced 
for imitating Model B. Analysis of variance. 
of these data yielded no significant effect of 
the model reinforced (F = 2.55), sex of sub- 
ject (F < 1), nor was there a significant inter- 
action between the two variables (F = 1.48). 


Recall (Phase 3) 


The mean number of items of each model’s 
behavior recalled by children in all condi- 
tions is given in Table 1. Analysis of variance 
performed on these data revealed that there 
was a significant main effect of movie 
(F = 14.38, df = 1/24, p < 01), with more 
items of behavior being recalled from Movie 1 
than from Movie 2. There was no main effect 
of reinforcement for imitation (F = 1.33), sex 
of subject, nor model recalled (F < 1, in both 
cases). The only interaction which reached 
statistical significance was that of Rein- 
forcement for Imitation X Model Recalled 
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TABLE 1 
Mean Noumper or Irems Recattep or Eacs MopeL’s BEHAVIOR, MEAN NuMBER OF 
SECONDS SPENT LOOKING at Each MODEL, AND MEAN PROPORTION OF 
Tora, BEHAVIOR or Each MopEL RECALLED IN FIRST 
HALF OF ALL ITEMS RECALLED 
Movie 1 Movie 2 
Imitation se 
Behavior Ruched Boys Girls Boys Girls 
ol 
Model A | Model B | Model A | Model B | Model A | Model B | Model A | Model B 
Recall Model A | 11,00 11.25 12.00 10.50 9.00 5.50 7.15 5.50 
(Experiment I) Model B 8.00 11.25 8.00 10.25 6,50 6,50 6.75 6.75 
Eye movements Model A | 138,25 | 114.75 | 127.00 | 148.50 | 160.75 | 128.50 | 142.25 | 122.00 
Model B | 145.75 | 128.00 | 138.25 | 137.00 | 125.50 | 115.00 | 152.50 | 157.25 
Recall Model A | 12,75 9.25 11.50 10,00 11.00 9.50 10.00 7.50 
(Experiment II) Model B 7,50 16,00 8.25 13.25 10.25 13.25 12.25 16.75 
Order of recall Model A Al AL .57 AL AZT 52 44 52 
Model B 49 Ad 48 52 43 SL .50 48 


(F = 5.84, df = 1/24, p< .05). All other 
interactions yielded an F of less than 1, ex- 
cept for that between model recalled and 
movie (F = 3.74, df = 1/24, p < .10). Fur- 
ther analysis of the statistically significant 
interaction revealed that more items of 
Model’s A’s behavior were recalled than of 
Model B’s when the subjects were reinforced 
for imitating Model A in Phase 1 (ż = 1.92, 
df = 15, p < .05, one-tailed). When the sub- 
jects were reinforced for imitating Model B, 
there was a tendency, although it was not 
statistically significant, for more to be re- 
called of Model B than Model A’s behavior 
(t= 1.51, df = 15, p < .10, one-tailed). 


Eye Movements (Phase 2) 


The mean number of seconds that subjects 
in each group looked at each model is also 
presented in Table 1. Analysis of variance of 
these data yielded no significant effects, with 
F less than 1 in all cases except for the main 
effect of sex (F = 1.06) and the interaction 
between sex and movie (F = 1.56). Clearly, 
subjects did not spend more time looking at 
one model than at the other, regardless of the 
group to which they had been assigned. 


Discussion 


The results of this experiment suggest, at 
least, that reinforcement for imitation may 
have an effect on subsequent learning. Sub- 


jects were able to recall more of the behavior 
of one model when they had been reinforced 
earlier for imitating him. Findings for the 
other model, although they did not reach 
acceptable level of statistical significanée, 
were in the predicted direction. These differ- 
ences in acquisition, however, were not paral- 
leled by differences in the amount of time 
children spent looking at a given model. Re- 
gardless of the experience they had had in 
Phase 1, the subjects spent equal amounts 
of time looking at each model. 

The eye-movement data, then, indicate that 
an explanation of the acquisition findings in 
terms of differential observing responses could 
be dismissed, Before an explanation involving 
differential coding of the model’s behavior 
could be entertained, however, a second 
experiment was conducted to look at an addi- 
tional possibility. If the subjects during 
Phase 3 had tended to recall first all of the 
behavior that they could remember of the 
model that they had been reinforced for 
imitating, there would have been a shorter 
time delay between presentation of this 
model’s behavior and its recall than between 
the other model’s behavior and its recall. As 
a result, more would have been forgotten of 
the latter’s behavior, simply as a function of 
the passage of time and events occurring 
during it. Increased recall of the behavior of 
the model that the subjects had been rein- 
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forced for imitating, then, would have been 
due to the order in which they recalled ma- 
terial—presumably a result of the response 
learned in Phase 1 to perform one model’s 
behavior and not the other’s. If this pos- 
sibility could be ruled out, an explanation in 
terms of differential processing of observed 
material would seem tenable. Thus, behaviors 
displayed by a model whom subjects had 
been reinforced for imitating might have been 
more easily admitted to long-term memory 
storage or more easily retrieved once there. 


EXPERIMENT II 


Method 
Subjects 
Subjects were 32 third graders from a Metropolitan 
Toronto separate school, There were 16 boys and 16 


girls, and the same female experimenter was used 
with all children.’ 


Procedure 


The procedure was identical to that of the first 
experiment with the following exceptions. New 
movies were made, this time using two adult females 
as models. Films of subjects’ eye movements were not 
obtained, but the order in which the subjects recalled 
the behavior of the two models was recorded. 


Results 


The number of trials to reach the criterion 
of 10 successive correct responses in Phase 1 
was 26.9 for boys and 15.6 for girls rein- 
forced for imitating Model A and 19.9 for 
boys and 15.9 for girls reinforced for imitating 
Model B. Analysis of variance of these 
data yielded a significant main effect of sex 
(F = 8.68, df = 1/28, p <.01). Thus, girls 
reached criterion faster than boys. There was 
no significant effect of model reinforced 
(F = 1.70), nor was there a significant inter- 
action between sex and model reinforced 
(F = 1.96). 


Recall (Phase 3) 


The mean recall scores for each model of 
children in all conditions is presented in 
Table 1. Analysis of variance performed on 
these data yielded a significant main efféct 
of model recalled (F = 4.67, df= 1/24, 
p< .05), with#more being remembered of 


8 The authors wish to thank Nancy Satterberg, 
who acted as experimenter. 
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Model B’s behavior than of Model A’s. There 
was no main effort of reinforcement for 
imitation (F = 3.07), sex of subject (F < 1), 
or movie (F < 1). The predicted interaction 
of model recalled and reinforcement for 
imitation was highly significant (F = 29.18, 
df= 1/24, p< .001). None of the other 
interactions reached an acceptable level of 
statistical significance, with F< 1 for all 
but Reinforcement for Imitation X Movie 
(F = 2.02), ‘Reinforcement for Imitation X 
Sex of Subject x Movie (F = 1.20), Rein- 
forcement for Imitation X Model x Movie 
(F= 1,59), and the four-way interaction 
(F = 2.08). 

Further analysis of the Model Recalled x 
Reinforcement for Imitation interaction indi- 
cated that more was remembered of Model A 
than Model B’s behavior when the subjects 
had been reinforced for imitating Model A 
(¢=2.30, df=15, p< .025, one-tailed), 
while more was remembered of Model B’s 
behavior when the subjects had been rein- 
forced for imitating Model B (¢ = 5.36, 
df = 15, p < .001). 


Order of Recall (Phase 3) 


For each child, the proportion of all the 
items that he was able to remember of each 
model’s behaviors that he produced in the 
first half of his total recall (i.e., both models 
combined) was calculated. The mean propor- 
tions for each group are presented in Table 1. 
It is clear from these data that there was no 
tendency on the part of subjects to recall 
a greater proportion of the total of one 
model’s behavior than of the other’s in the 
first half of recall. In fact, the subjects ap- 
peared to alternate between the two models 
in their recall, with an average of 6.19 
(SD = 2.14) sets of behaviors, or runs of 
material from a single model, being recalled. 


Discussion 


These two experiments support the hy- 
pothesis that the consequences that an ob- 
server experiences for imitating a given model 
will affect the amount he learns about the 
later behavior of that model. The results of — 
the first study, though not impressively 
strong, were in the predicted direction. Those 
of the second were in the same direction and 
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quite highly statistically significant. In both 
studies children were able to recall more of 
the subsequent behavior of a model that they 
had been reinforced for imitating than of 
one that they had not been reinforced for 
imitating. 

From the first study it is evident that 
consequences for imitation did not affect 
acquisition through their influence on periph- 
eral attending or observing responses. Photo- 
graphs of their eye movements indicated that 
children in all conditions spent equal amounts 
of time looking at each model. The second 
study ruled out the possibility that subjects 
might have initially recalled all the behavior 
of the model that they had been reinforced 
for imitating, thus forgetting, by the time 
they finally began to recall it, some of the 
other model’s behavior. Subjects alternated 
between models in their recall, not appearing 
to immediately focus on one model and ignore 
the other. It seems reasonable, then, to talk 
in terms of reinforcement for imitation 
having led the subjects to differentially proc- 
ess the material they later observed. Thus, a 
child who was reinforced for imitating a given 
model might have had either a greater tend- 
ency to admit representations of that model’s 
subsequent behavior into his long-term mem- 
ory storage, dealt with it in such a way that 
it was less prone to interference once there, or 
organized it better so that it was more easily 
retrieved during the acquisition test. The 
specific mechanism, of course, depends on 
one’s notions about long-term memory. 

Why might this differential processing— 
regardless of its particular form—occur? 
Clearly, the more knowledge one has about 
the way an individual one has been rein- 
forced for imitating behaves, the greater the 
number of rewards it is possible to amass. 
And the children in this study may have 
expected that, since pleasant consequences for 
imitating one of the models had been forth- 
coming in the past, they were likely also to 
occur in the future. Thus, they may have 
attempted to learn as much as possible about 
what that model did in order to maximize 
future rewards. 

The results of the study, then, indicate 
that the effects of reinforcement for imita- 
tion on acquisition have to be considered in 
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any adequate theory of imitation. This vari- 
able must be added to those of the model’s 
social characteristics, like perceived similar- 
ity and one or both of nurturance and 
power, which also affect acquisition indirectly. 
Characteristics of the observer, alone and in 
interaction with the model’s characteristics, 
could also have their effect on acquisition. 
That many or even all of these variables also 
affect willingness to perform is highly prob- 
able. But determinants of the translation of 
what has been learned into performance, in- 
dependent of external incentive, represents 
another area of investigation completely. 

In our culture, one area in which children 
receive a great deal of reinforcement for imi- 
tation of designated models is that of sex-role 
learning. Boys are rewarded for imitating 
male models and girls for imitating female 
models. It would seem, on the basis of the 
results of the present studies, that children 
should soon begin to learn more abot 
behavior of same-sexed than of oppo 
sexed models. The following study — 
designed to test this hypothesis. 


Pa 
EXPERIMENT IIJ—Errects oF SEX OF 
MODEL AND OBSERVER ON ACQUISITION _ 


Traditionally, within both the psychoana- 
lytic and the social learning model, the devel- 
opment of appropriate sex-typed behavior has 
been conceptualized as a function of identi- 
fication with a same-sexed model. As well, 
social learning theorists have suggested that 
reinforcement by socializing agents for imi- 
tated sex-typed behaviors is an important 
factor strengthening sex-role development. 
Recent statements of this view have been 
put forth by Kagan (1964) and Mischel 
(1966). Within a social learning framework, 
then, the amount of sex-typed behavior that 
an individual displays should be determined, 
first of all, by the degree of identification 
with a same-sexed model—affected by the 
model’s nurturance to the observer, his con- 
trol of resources, and his similarity to the 
observer as perceived by that observer. (This 
assumes, of course, that the same-sexed model 
displays sex-appropriate behaviors. Clearly, a 
boy who is maximally identified, e.g., with a 
feminine father is not going to display many 
masculine behaviors.) In addition, the re- 
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wards—both positive and negative—that an 
observer receives for his imitation should in- 
fluence his display of masculine or feminine 
behaviors. The son who is identified with a 
feminine father may not exhibit many femi- 
nine behaviors if he is continuously punished 
for them. And the son who is weakly identi- 
fied with a masculine father may behave in 
a very masculine fashion if the male be- 
haviors that he occasionally imitates are 
consistently and strongly rewarded. 

The results of the first two studies, how- 
ever, suggest that reward for imitation of a 
model has more than just the simple effect 
of strengthening specific behaviors which an 
observer imitates. It may also increase the 
strength of identification with that model by 
affecting the amount the observer learns 
about him. In the case of sex-role develop- 
ment, it could well be that, when boys are 
continuously rewarded for imitating male 
models and girls for imitating female 
models, they both begin to learn more about 
same-sexed models and correspondingly less 
about opposite-sexed models; that is, they 
become even more identified with them, For 
by so doing they would substantially increase 
their chances of gaining rewards. Thus, so- 
cializing agents may have to rely less and less 
on rewards and punishments for sex-appropri- 
ate behavior, with children displaying increas- 
ingly appropriate behaviors not only because 
they expect reward for doing so, but because 
they have a preponderance of these in their 
learned behavior repertoires. While boys at 
an early age may even be equally familiar 
with the behaviors displayed by both their 
mothers and fathers, as they grow older and 
are exposed to more and more direct training 
for masculine behavior, they may come to be 
less familiar with the behaviors that females 
in our culture display. When they grow up 
they may be unskilled at sweeping, dusting, 
and making beds not just because they have 
not had practice in doing these things, but 
because they have never really concentrated 
too closely on how they are done. 

If this reasoning is correct, we should find 
that boys, when presented with an adult male 
and female, learn more about the male’s 
behavior, while girls, who observe the same 
two models, learn more about the female’s 


behavior. Maccoby and Wilson (1957), em- 
ploying 11-year-olds as subjects, found that 
boys recalled more of the behavior of male 
models their own age than they did of young 
female models, while the opposite relation- 
ship held for girls. That this finding should 
also hold for children observing adult models 
seems plausible, It should also be possible to 
see changes in amounts learned about the 
behavior of male and female models with age. 
Thus, younger children, who have experienced 
less reinforcement for sex-appropriate behav- 
ior, should show a smaller difference in recall 
than older ones who have had more reinforce- 
ment for sex-appropriate behavior. The pres- 
ent study was designed to test the hypothesis 
that childrén learn more about the behavior 
of same-sexed adult models, and that this 
tendency increases with age. Five- and 7- 
year-old boys and girls were shown movies 
of an adult male and female each displaying 
a series of distinctive behaviors, They were 
then rewarded for recalling everything they 
could about the behavior of both models. 

All the sequences of behavior in the movie 
which subjects viewed, except for two, could 
be characterized as neither masculine nor 
feminine in content. The two which could be 
involved aggression and nurturance. Boys 
tend to be more aggressive physically than 
girls and girls more oriented than boys 
toward nurturant behavior (Kagan, 1964). 
Presumably this is due, in part at least, to 
differential reinforcement received by the two 
sexes for these behaviors. Therefore, it was 
hypothesized that children, in addition to 
learning more about models they are rein- 
forced for imitating, would also learn more 
about classes of behavior they are reinforced 
for imitating, with girls recalling more of the 
sequence of nurturant behavior and boys 
recalling more of the sequence of aggressive 
behavior. 


Method 
Subjects 


Subjects were seventy-two 5-year-olds and sev- 
enty-two 7-year-olds from a Waterloo school. Half 
of the children in each age group were boys, and 
half were girls, Two experimenters, one male and 
one female, were used.4 


*The authors wish to thank Pat Mertl, who acted 
as the female experimenter, 
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. 
Procedure 


Phase 1. Children were brought individually from 
their class to the mobile laboratory. As soon as they 
entered, the experimenter seated the child in front 
of a movie screen and showed him a movie. The 
only instruction given in this first phase was to 
watch the movie. 

The movies used were quite similar to those used 
in Phase 2 of the previous studies, except that they 
starred an adult male and female model instead of 
two same-sexed models. Half of the boys and half 
of the girls at each age level saw one movie while 
the rest of the children saw a second movie. These 
two movies differed only in that the roles played by 
the models in them were reversed. Half of the 
boys and half of the girls at each age level who 
viewed each movie were exposed to the male experi- 
menter and the rest of the children to the female 
experimenter. 

All the sequences of behavior included in the 
movie were neutral as far as sex typing was con- 
cerned except for two—a nurturant and an aggres- 
sive one. In the nurturant sequence the female 
_ model in Movie 1 and the male model in Movie 2 

dressed and undressed a teddy bear and threw him 
< into the air. In the aggressive sequence the male 
~ model in Movie 1 and the female model in Movie 2 
_ placed two books on end, put a third one on top 
of these two, and then, with a swift motion, 

knocked all three books onto the floor. The nurturant 
sequence comprised the fourth in the series of six 
behaviors performed by a given model, while the 
aggressive sequence comprised the fifth in the series 
of six behaviors. 

Phase 2. Immediately after he had seen the movie, 
the child was brought to a second room of the 
trailer which contained all the props used in the 
movie. He was urged to remember everything he 
had seen and to reproduce as much as possible. For 
everything he correctly remembered he was given an 
M&M, as well as verbal praise. After this the experi- 
menter showed the child each prop that had not 
been used during recall and asked him to try and 
remember how it had been used in the movie. After 
the recall phase all subjects were asked not to dis- 
cuss the experiment with any of their fellow students. 
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Each of the subject’s correct responses was 
recorded on a checklist, consisting of all items dis- 
played in the movie. In order to estimate interscorer 
reliability, the performance of 10 subjects run by 
each experimenter was observed through a one-way 
mirror by the other experimenter and was inde- 
pendently recorded and scored. The two raters were 
in agreement 96% of the time. 


Results 
Total Recall 


The mean number of items recalled by 
children in each group is presented in Table 2. 
Analysis of variance of the recall data yielded 
a statistically significant main effect of age 
(F = 23.18, df= 1/128, p<.001), with 
7-year-olds recalling more than 5-year- 
olds. None of the other main effects were 
significant (the Fs ranged from less than 1 
to 2.41). There were significant interactions 
between sex of model and sex of child 
(F = 12.05, df = 1/128, p< .001), sex of 
model and movie (F = 4.48, df= 1/128, 
p < .05), sex of child and movie (J 
df = 1/128, p < 05), and sex of 
of child, and movie (F = 4.24, df= 
p< .05). Further analysis of the ter n- 
volved in the three-way interaction showed 
that boys remembered more of the male than 
the female model for both Movies 1 and 2 
(t= 2.45 and 2.49, df= 70, p < .01, one- 
tailed), while girls remembered more of the 
female than the male model only for Movie 2 
(t = 3.06, df = 70, p < .005, one-tailed), the 
comparison for Movie 1 yielding a t of 1.12. 


Recall of Sex-Typed Behaviors 


Table 3 presents the mean number of 
aggression and nurturance items remembered 
by each group. Results of an analysis of 


TABLE 2 
MEAN NUMBER or Items RECALLED oF Each Mopet’s BEHAVIOR IN ALL CONDITIONS 
Boys Girls 
Movie Experimenter Male model Female model Male model Female model 
5 yr, 7 yr. 5 yr. 7 yr. S yr. 7 yr. 5 yr. T yT. 
1 Male 3.56 5.33 3.00 2.56 2.67 7.33 4.44 4.78 
Female 4.11 4.57 2.89 4.11 3.58 6.78 3.22 5.67 
2 Male 5.33 7.22 4.00 5.44 2.44 3.67 2.78 5.00 
| Female 4.00 6.33 2.89 5.68 3.56 5.11 6.33 6.67 
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TABLE 3 
MEAN NUMBER OF AGGRESSION AND NuRTURANCE ITEMS REMEMBERED IN EACH CONDITION 
Aggression Nurturance 
Experimenter Age Male model Female model Male model Female model 

Boys Girls Boys Girls Boys Girls Boys Girls 

Male S yr. 56 44 .56 A4 1,00 67 1.22 1.67 

7 yt. 1.11 1.11 -78 .89 2.00 1.22 -67 1.44 

Female 5 yr. 78 44 33 1,33 1,00 1,00 1.44 1,00 

7 yr. 1.11 1.22 1,00 1.11 1.67 1.78 1.22 1.89 


variance of aggression items yielded only one 
significant main effect, that of age of subject 
(F=906, df= 17/128; p < 01), with 7- 
year-olds remembering more than 5-year- 
olds. All other Fs ranged from 1.63 to less 
than 1. 

Analysis of variance of the nurturance 
items again yielded a main effect of age 
of subject (F = 6.16, df = 1/128, p < .05), 
with 7-year-olds remembering more, as 
well as an interaction between sex of subject 
and sex of model (F = 4.40, p< .05) and 
age of subject and sex of model (F = 7.14, 
p< .01). All other terms ranged from less 
than 1 to 3.64, Further analysis of the sig- 
nificant interactions indicated that girls re- 
membered more of the female model’s nur- 
turance than did boys (£=1.71, df = 70, 
p< .05, one-tailed), while 7-year-olds re- 
membered more of the male model’s nur- 
turance than did 5-year-olds (¢ = 3.57, p< 
.001). 


Discussion 


The hypothesis that boys would remember 
more of an adult male than an adult female’s 
behavior was supported. The parallel hypothe- 
sis for girls was less clearly supported— 
although girls learned more about the female 
than the male’s behavior in the second movie, 
they learned equally as much about the two 
models in the first movie. There seems to 
be no immediately obvious explanation for 
the behavior of the girls who viewed the 
first movie. Whatever the answer, it is inter- 
esting that it was girls, and not boys, who 
deviated from predicted behavior, for it seems 
to be that girls are less strongly identified 


with the female sex role than boys are with 
the male sex role (e.g., Brown, 1957). 

The prediction that there would be an inter- 
action between age of subject and differential 
amount learned about the two models was not 
confirmed. It may be that the 5-year-olds 
were not young enough to provide an ade- 
quate test of the hypothesis, Perhaps they 
had already had enough experience with rein- 
forcement for sex-appropriate behavior to 
learn that it is generally wiser to pay more 
attention to a same-sexed than an opposite- 
sexed model. It would be of value, then, to 
repeat this study using a younger group of 
subjects. Finally, the predicted difference in 
recall of nurturant and aggressive behaviors 
was not found. 

In summary, the results of the studies pre- 
sented here suggest the following. It is clear 
from the first two studies that when individ- 
uals have been reinforced for imitating some- 
one, they begin to learn more about the way 
that person behaves. The third study showed 
that young boys, who are certainly rewarded 
for imitating males, learn more about the way 
males, rather than females, behave. The op- 
posite holds true, but to a lesser degree, for 
girls. It is not unreasonable to assume that 
this is because in our culture girls are less 
consistently rewarded for imitating females 
than are boys for imitating males. Girls are 
much freer to express interest in activities 
of the opposite sex (Kagan, 1964), and it is 
certainly much less reprehensible for a girl 
to behave like a tomboy than for a boy to 
behave like a sissy. The findings of the third 
study are open to more than one explanation. 
We know, for example, that people learn more 
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about the behavior of those they see as similar 
to themselves (Rosekrans, 1967). And cer- 
tainly children may see same-sexed adults as 
more similar to themselves than opposite- 
sexed adults. But it is also quite plausible 
that the relationship demonstrated in the first 
two studies could adequately account for the 
relationships of the last study. 
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RHETORICAL ELICITATION OF AGREEMENT IN PERSUASION 


DOLF ZILLMANN +! 


University of Pennsylvania 


The persuasive effect of the rhetorical elicitation of agreement responses was 
assessed under various conditions of initial attitude relative to the attitudinal 
position advocated. Additionally, the resulting resistance to counterpersuasion 
was measured. A significant increase in the effectiveness of persuasion was ob- 
tained, especially under conditions of initially opposed attitude. No facilitation 
of effectiveness was found under conditions of initially neutral attitude. Resist- 
ance to counterpersuasion was not differentially affected by treatments. The 
enhancement of cognitive involvement is discussed as an insufficient explana- 
tion. A rationale based mainly on the operant learning of the connotations of 
significance and certainty associated with verbal constructs such as the rhetori- 
cal agreement question was developed and employed to explain the results 
obtained. Also, an alternative explanation was proposed, maintaining that the 
findings might be due to a lowering of the communicatee’s defenses as a 
consequence of changes in source perception brought about by the style of 


language used. 


Since the Sophists it has been felt that, in 
a: debate situation, it is advantageous for the 
speaker to get his opponent to express agree- 
hent with particular arguments advanced. 
Frequent agreement, it was felt, would create 
the impression of superior knowledge and rea- 
soning on the part of the speaker, and lead 
the audience to doubt the opponent’s expertise 
and credibility. Schopenhauer (orig. publ. 
1864), in proposing his 36 tactics of success- 
ful debate, urged speakers to seek the admis- 
sion of agreement by the opponent, but to 
avoid admitting agreement themselves, should 
the opponent employ this strategy. In debate, 
the most apparent means of eliciting the criti- 
cal responses of admission are the use of ques- 
tions to the opponent, directly soliciting the 
expression of agreement. “Isn’t this so and 
so?” may be considered the prototype of such 
an agreement question. This is not to say, 
however, that the elicitation of an agreement 
response is necessarily via questions only. 
Other verbal constructs could be used, and 
nonverbal cues are not inconceivable in this 


function. 


1 The author wishes to thank Percy H. Tannen- 
baum, for valuable comments, and David O. Sears 
and James L. Hoyt, for their help with research 
materials. 

Requests for reprints should be sent to Dolf 
Zillmann, who is now at the University of Indiana, 
Radio-TV Building, Room 216, Bloomington, Indi- 
ana 47401. 


Such speculations concerning debate could 
be extended and applied to persuasive situa- 
tions not immediately involving an opponent. 
A particular opponent may be challenged in 
absentia, or an “imaginary” or “generalized” 
opponent may be created and “argued with.” 
Also, the speaker might behave toward his 
audience, or toward subsections of it, as if 
it were an opponent, able and likely to express 
its stands on issues raised in connection with 
agreement-eliciting cues. In such situations, 
which are characteristic of persuasion in the 
context of mass communication, the provoca- 
tion of an agreement response is rhetorical in 
the sense that an overt response of an 
“opponent” is not possible or, at least, not 
expected by the speaker. 

If the elicitation of agreement responses is 
rhetorical rather than direct, it most prob- 
ably does not result in the overt expression 
of agreement, which is presumed to have the 
suggested critical implications in debate. Ap- 
parently, then, other mechanisms must be 
involved to account for any particular effects 
brought about by the use of this rhetorical 
tactic or strategy, which is so heavily em- 
ployed in all domains of persuasive com- 
munication. To explore the way in which the 
rhetorical elicitation of agreement operates, 
three possible psychological mechanisms are 
developed, and a procedure is devised 
which permits the assessment of the relative 
predictive accuracy of the various models, 
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This investigation utilizes the agreement 
question, which is proposed to be the com- 
munication mechanism par excellence to elicit 
an agreement response in the recipient. 
Grammatical transformations to interrogative 
forms are used to operationalize the provo- 
cation of agreement without affecting the 
content of arguments presented. The cru- 
cial experimental variation is thus purely 
syntactical. 

- „The agreement question is unique in that 
it appears to be suggestive of an agreement 
response. It typically implies agreement with 
the assertion conveyed in it. For example, the 
question “Prices have gone up, haven’t they?” 
seems biased toward the answer “Yes, they 
have.” The elicitation of the objection “No, 
they haven’t” is, in all likelihood, not intended 
or expected. This agreement bias can take 
negative form, grammatically speaking. For 
example, it should be recognized that the 
response “No, it hasn’t” to the question 
“Driving hasn’t become any safer, has it?” 
expresses genuine agreement. In general, any 
assertion, positive or negative, put into an 
agreement question undergoes a double trans- 
formation: declarative to interrogative, and 
reversal of the affirmative-negative mode. 
Thus, the basic forms are “A b.” > “A bv?” 
or “A ~ b?” and “A ~ b? > “A ~ ba?” or 
“A b?”, where Ab denotes an elementary 
assertion, v some negative, assertion-bound 
question marker (e.g., Isn’t it? Haven’t they? 
Don’t we?), and a some positive, assertion- 
bound question marker (e.g, Is it? Have 
they? Do we?). These various forms are not 
necessarily equipotent with respect to the 
elicitation of agreement responses. It is con- 
ceivable that the particular forms meet dif- 
ferential habit strengths as the result of 
preferences in usage. 

A first rationale from which to predict per- 
suasive implications of the rhetorical use of 
agreement questions may be advanced on 
the basis of learning-theoretical considera- 
tions, mainly (but certainly not exclusively) 
involving operant conditioning. It is assumed 
that the mature language user has a condi- 
tioning history of both the encoding and the 
decoding of agreement questions in a social 
context. Typically, the agreement question is 
used to elicit the respondent’s admission of 
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the appropriateness and the value of the 
persuader’s argumentation. This “having to 
agree” with some point of view—being 
“nailed”—is characteristically interpreted and 
experienced as the losing, or at least the par- 
tial losing, of an argument. Obviously, the 
assertion conveyed in the agreement question 
must have some minimal power or strength, 
defined as subjectively felt significance and 
certainty, to resist easy refutation. The en- 
coder should thus tend to elicit agreement 
with only the best arguments available to 
him. A poor choice, resulting in disagreement, 
would have undesirable, unpleasant conse- 
quences. It can be assumed that if the ques- 
tion user starts at a theoretical point of non- 
differentiation, the conveyed “good” argument 
will be positively reinforced and develop 
response strength, and the conveyed “poor” 
argument will be negatively reinforced and 
lose response strength. This development may 
be assisted by other mechanisms, s ich as, for 
example, social learning. The instr 
value in a social situation defines tl 


opment of a pronounced association of the 


agreement question with relatively powerful 
arguments. It is proposed that the association 
established for the agreement question gen- 
eralizes to the rhetorical agreement question. 
As a consequence, the rhetorical agreement 
question gains in persuasive power, relative 
to the assertion in statement form. This power 
increase should be independent of the respon- 
dent’s initial attitude. Thus, the question form, 
in general, should exhibit greater effective- 
ness, Additionally, it might be expected that 
the significance-denoting rhetorical agreement 
question somewhat facilitates resistance to 
counter persuasion. 

In developing a second, quite independent 
rationale, it is assumed that the mature lan- 
guage user has developed a very pronounced 
tendency to respond with some answer to a 
direct question, since both the inability to 
respond meaningfully and the refusal to re- 
spond usually define noxious states in the 
respondent. Whenever the communication 
situation provides cues which make the overt 
response appear unnecessary and improper, 
it is expected that this response tendency 
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generalizes to a covert response. Such situa- 
tions exist in many social exchanges, but they 
are particularly characteristic of mass com- 
munication: the persuader on the air, for 
example, addressing the recipient from the 
screen, asks questions that are rhetorical by 
physical circumstances—if feedback irrele- 
vancy rather than communicator intent is 
accepted as the critical definitional criterion. 
The “rhetoricalness” of questions, not ap- 
parent in the encoding behavior, but rather 
in situational cues, may effect the inhibition 
of the overt components of the verbal re- 
sponse. It may be assumed, however, on the 
basis of the postulated response generaliza- 
tion, that the executed covert response pro- 
duces psychological consequences similar to 
those associated with the entire response— 
social factors aside. 

It may be argued that the assumed covert 
“agreement response elicited by a rhetorical 
agreement question, as compared to the rela- 
tively passive decoding of an assertion in 
statement form, raises the individual’s level 
of awareness. It makes him cognizant of his 
position on a particular issue, and it may 
activate issue-related cognitions to consolidate 
and bolster his evaluations, thereby facili- 
tating his involvement with the issue, and 
possibly producing some self-commitment. If 
the position advocated in the persuasive 
assertion coincides with the individual’s initial 
position, the elicitation of agreement responses 
should enhance communication effectiveness. 
Similarly, if the individual has not developed 
an explicit stand with respect to the per- 
suasive issue, and consequently his “position” 
is not dissonant with the one advocated, his 
greater cognitive involvement should also 
enhance effectiveness. However, if the posi- 
tion advocated conflicts with the individual’s 
position, increased awareness and cognitive 
involvement should only facilitate the recogni- 
tion of the discrepancy, and thus yield a more 
vigorous refusal to accept the persuasive asser- 
tion. In this case the individual would respond 
with disagreement to the agreement question, 
and this disagreement would be associated 
with the activation of refutation attempts and 
result in the explicit rejection of the persua- 
sive assertion. Effectiveness of the rhetorical 
agreement question under such conditions 
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would therefore be lower rather than higher 
than that of the statement form. 

The assumption of greater cognitive in- 
volvement associated with the covert verbal 
response to a rhetorical agreement question 
also has implications for predictions regarding 
resistance to counterpersuasion (cf. McGuire, 
1964). The greater explicitness in the indi- 
vidual’s cognition of his attitudinal position 
may result in self-commitment, due to his 
motivational tendency to perceive his own 
formed opinions as nonfluctuating and stable. 
This self-commitment should produce more 
pronounced resistance to attitude change, In 
contrast, communicating the persuasive asser- 
tion in statement form should leave the indi- 
vidual relatively unprotected against counter- 
attacks. 

Finally, in proposing a third rationale, it 
may be argued that the use of rhetorical 
questions defines a delivery style of relatively 
high “‘extraversion” (Bowers, 1965). The rela- 
tively dynamic presentation, compared to the 
“subdued” presentation of assertions in state- 
ment form, might make the communicator’s 
persuasive attempt more apparent, and re- 
spondents might protect themselves against 
his “rhetorical virtuosity” by discounting his 
arguments somewhat (cf. Dietrich, 1946). As 
a consequence, the statement form should 
haye relatively greater effectiveness, indepen- 
dent of initial attitude. Similarly, the state- 
ment form should yield more pronounced re- 
sistance, since only the dynamic presentation 
creates some doubts as to the trustworthiness 
of the source. 


METHOD 
Subjects 


Ninety male and female undergraduates from 
introductory psychology classes at the University of 
Wisconsin served as subjects. They participated to 
earn extra points toward their course grade. The 
male/female ratio was approximately 2:3 throughout 
all experimental conditions. 


Design 


A 2X3 factorial design was employed, based on 
two levels of grammatical form (statement, rhetorical 
agreement question) of critical arguments in the 
persuasive communication, and on three levels of 
initial attitude (unfavorable, neutral, favorable) 
toward the person constituting the focus of the 
persuasive issue. Fifteen subjects were assigned to 
each of the various conditions, 


omgys 0s 


Procedure 


Six groups of 15 subjects met with the experi- 
menter for a single session. Each group was randomly 
assigned to one of the two conditions of grammatical 
form. Within each group, 5 subjects were randomly 
assigned to each condition of initial attitude. This 
was unobtrusively accomplished through the distribu- 
tion of manipulatory booklets in prearranged order. 
Subjects were not permitted to interact with one 
another in any way. The seating arrangement elimi- 
nated face-fo-face contacts. All instructions were 
tape-recorded. 

f: erimenter introduced the study as a project 
sponsored by the American Bar Association, aimed 
at the improvement of court procedures. The par- 
ticular research interest was said to be the decision 
process within a jury: specifically, how the individual 
jury member forms various attitudes and how these 
attitudes are modified as novel, relevant informa- 
tion is received during the proceedings of further 
testimony and the summations of prosecution and 
defense. Deviations from standard court procedures 
were explained as necessary to facilitate the system- 
atic investigation of such information-dependent 
judgmental adjustments. 

The trial chosen, the case of “The State of New 
York vs. Johnny Myers,” was developed from the 
Burdick case used by Sears (1965). It involves a 
juvenile who was charged with second-degree murder 
in the killing of his father. The defense claimed that 
the killing was manslaughter. 

Subjects received a booklet of background informa- 
tion on the case, giving all relevant personal data 
on the defendant, the deceased, and their family, the 
opening statements of the court, descriptions of 
prosecution and defense motions, the testimony of 
police officers and the coroner, and, in the appropri- 
ate conditions, that of the defendant’s school prin- 
cipal and a social worker. After exposure to this 
preliminary information, the subjects’ attitudes to- 
ward the defendant were assessed. 

Assessment of initial attitude was followed by the 
subjects’ exposure to the critical experimental com- 
munication: the tape-recorded summation of the 
defense attorney. The summation made references to 
some testimony that had not been previously pre- 
sented, and thus was not redundant. After exposure, 
subjects received a second booklet instructing them 
to generate a recommendation of a fair and adequate 
prison term for the defendant. 

Recommendation of the prison term was followed 
by the subjects’ exposure to the countercommunica- 
tion: the tape-recorded summation of the prosecution 
attorney, who also made reference to thus far not 
presented testimony. After exposure, the subjects 
received a third booklet, and again recommended a 
prison term. 

Finally, the actual purpose of the study was fully 
disclosed. Postexperimental interviews indicated that 
the subjects were deeply interested in the case. 
Without exception, they performed their tasks with 
extreme sincerity. 

Manipulation of initial attitude, In the neutral 
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condition the testimony of the defendant’s principal 
and a social worker was omitted. The remaining 
testimony was strictly factual and relatively am- 
biguous with respect to evaluations of the defendant. 
The available evidence did not permit the develop- 
ment of strong feelings pro or con, that is, pro- 
nounced positive or negative affect toward the 
defendant. 

The testimony given by the principal and the 
social worker was manipulated to create negative or 
positive affect in the corresponding conditions of 
initial attitude. In the negative condition, the prin- 
cipal described the defendant as being thoroughly 
selfish, unreliable, and “stealing from other boys, 
disrupting classes, passing obscene pictures around, 
and being consistently one of the worst discipline 
problems in the school,” whereas in the positive 
condition, he described the boy as modest, self- 
reliant, helpful, and “behaving in the classroom as 
well as all the other boys.” Similarly, in the negative 
condition, the social worker described him as 
difficult, aggressive, and a gang member involved in 
the robbery of a grocery store, and in the positive 
condition as well-balanced, concerned, and a passive 
gang member not at all involved in the ng’s 
objectionable activities. ‘ 

Manipulation of gramma 
tion by the defense was c 
pendent parts presenting eri 
an argument was developed an 
a concluding assertion. Ten 


form. The summa- 
of relatively inde- 
idence, Typically, 
en summarized in 
` these argument- 


condensing, persuasive statements? were chosen to be 


transformed from the statement form into the 
agreement-question form. Four of the transformed 
sentences formed a recapitulation of general argu- 
ments. The six remaining sentences were placed after 
the build-up of particular arguments, and were inter- 
spersed throughout the text. Four affirmative state- 
ments were transformed to the negative-interrogative 
form (e.g, “On this night, and in this condition, 
Frank Myers was a threat to his own daughter.” to 
“Wasn't Frank Myers a threat to his own daugh- 
ter?”). Analogously, four negative statements were 
transformed to the affirmative-interrogative form 
(e.g. “But he never used his knife as a weapon 
before.” to “But did he ever use his knife as a 
weapon before?”), The remaining two statements 
were transformed from simple affirmative to affirma- 
tive with a negative-interrogative agreement-eliciting 
appendix (e.g, “Johnny was a peaceful boy.” to 
“Johnny was a peaceful boy, wasn’t he?”). 

The two versions of the defense speech were identi- 
cal in all other respects. Copies were made from an 
especially prepared tape of the statement version. For 
the question version, only the critical statements 
were cut out and replaced by similarly prepared 
transformed agreement questions, The splicing was 
undetectable. 


Dependent Measures 


Initial attitude toward the defendant was assessed 
on a verbally labeled 7-point scale, ranging from 
“very unfavorable” through “very favorable.” 
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TABLE 1 


ANALYSIS OF VARIANCE OF EXPERIMENTALLY MANIPU- 
LATED [ĪNITIAL ATTITUDES BEFORE DEFENSE 
AND PROSECUTION SPEECHES 


Source df MS F 
Grammatical form (A) 1 44 900 
Initial attitude (B) 2 38.633 63.882* 
AXB 2 144 -239 

Error 84 605 
* p <.001. : 


Recommended prison terms, the measure devised 
to assess the impact of the persuasive communica- 
tions, were recorded in years, with ẹ year as the 
smallest unit of measurement. Subjects expressed their 
recommendations in the form of a number written 
in an allocated space. The range of permissible sen- 


tences was from 4 year, the minimum term for 


manslaughter, to 50 years, the maximum term for 
second-degree murder. 

The persuasive impact of the critical experimental 
communication was measured in the first prison-term 
recommendation. Resistance to counterpersuasion was 
measured mainly in the difference in years between 
the first and the second prison-term recommendation. 


RESULTS 
Manipulation of Initial Attitude 


In order to determine whether the manipu- 
lation of initial attitude toward the defendant 

^ was successful, an analysis of variance was 
performed on the attitude ratings. As can be 
seen from Tables 1 and 2, the achieved dif- 
ferentiation is quite satisfactory. The mean 
scores exhibit the desired differences in the 
attitude conditions, but show only negligible 
differences between the grammatical-form 
groups within any particular attitude condi- 
tion. The scores of the neutral condition ap- 

i pear to reflect a tolerable, slight bias toward 
favorableness (4.0 defines a neutral score). 
Both the conditions of unfavorable attitude 


TABLE 2 
Mean INITIAL ATTITUDE SCORES 


Initial attitude 


Grammatical 
form 
$ Unfavorable Neutral Favorable 
4 Statement 5.200, 3.867, 3.007. 
Question 5.200. 3.600; 2.867¢ 


Note.-—-Scores range from 1, very favorable, through 7, very 

. unfavorable. Differences between the means were analyzed by 

the Newman-Keuls method. Means having different letter sub- 
scripts differ significantly at p < .05. 
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TABLE 3 


ANALYSIS OF VARIANCE OF RECOMMENDED PRISON 
TERMS AFTER THE DEFENSE SPEECH 


Source df MS F 
Grammatical form (A) 1 29.469 5.308* 
Initial attitude (B) 2 96.886 17.452** 
AXB 2 12.386 2,231 

Error 84 §.552 
ep <0, > Cath ga ONE a 
** p <.001. 


and favorable attitude should be considered 
as only minimally polarized (the scores of 5.0 
and 3.0 were both qualified with “slightly” 
on the attitude scale). This minimal polariza- 
tion has the advantage of leaving adequate 
space for polarizing effects to take, but it 
might constitute a rather weak manifestation 
of the theoretically relevant initial predispo- 
sitions met by the various rhetorical forms 
of a communication. 


Impact of the Persuasive Communication 


As can be seen from Tables 3 and 4, the 
agreement-question version of the persuasive 
communication, taken across all conditions of 
initial attitude, yielded a significantly lower 
mean recommended prison term than the 
statement version. Inspection of the mean 
scores of the individual experimental groups 
reveals that the effect is highly significant 
under conditions of initially unfavorable atti- 
tude toward the defendant (p< .005, by 
multiple ¢ test). Under conditions of initially 
favorable attitude, the effect is quite pro- 
nounced, but fails to meet conventional sta- 
tistical criteria (p approaches .10 by multiple 


TABLE 4 


Mean RECOMMENDED PRISON TERMS AFTER 
THE DEFENSE SPEECH 


Initial attitude 


Se el atta 
(pan Neutral | Favorable petite 

Statement | 6.600, 2.433p,. | 2.600p,. | 3.8782 

Question 4.133, | 2.533... | 1.533. 2.733 (yy 


Note.—Scores are in years. Differences between means were 
analyzed by the Newman-Keuls method. Means having no 
letter in their subscripts in common differ significantly at 
p <.05, The combined means, denoted by parenthesized 
subscripts, differ significantly at p <.025 (F = 5.308, df 


= 1/84). 
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t test). In contrast, the effect under conditions 
of initially neutral attitude is negligible. 

The comparison across conditions of initial 
attitude shows the difference-reducing impact 
of the persuasive communication: the condi- 
tions of neutral attitude no longer differ sig- 
nificantly from the conditions of favorable 
attitude. The relatively greater change in the 
neutral conditions may be attributed to the 
prior lack of favorable information about the 
defendant. In the favorable-attitude condi- 
tions, this information was highly redundant, 
and, additionally, the recommendation of sen- 
tences may have suffered a floor effect (sub- 
jects were restricted to a minimum of .5 
year—a score chosen by 4 and 5 subjects, 
respectively, out of 15). Both the redundancy 
and the floor effect could account for the 
failure to obtain a significant differentiation 
in the grammatical-form variation in the 
conditions of favorable attitude. The condi- 
tions of unfavorable attitude show a mark- 
edly weaker impact of the persuasive com- 
munication, with the statement version as 
particularly less effective. 


Resistance to Counter persuasion 


The change brought about by the counter- 
communication was highly consistent through- 
out all experimental conditions. The analysis 
of variance performed on the two prison terms 
as repeated measures yielded a significant 
change, averaging 2.5 years, toward longer 
sentences (F = 81.1, df = 1/84, p< .001). 
This change did not interact with the experi- 
mental variables to any appreciable degree 
(all interactions yielding F < 1). In order to 
test more directly for differential effects in 
the amount of change brought about by 
counterpersuasion, an analysis of variance was 
performed on the change scores, defined as 
the increase in the length of prison terms 
from the first to the second recommendation. 
The experimental variables, grammatical 
form and initial attitude, and their inter- 
action failed to produce notable effects (all 
being associated with F < 1). The observed 
negligible differentiation between the means 
of difference scores must be considered trivial. 

Given the significant differentiation between 
experimental conditions as a function of the 
critical persuasive communication, on the one 
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hand, and the homogeneous effect of the 
countercommunication throughout all condi- 
tions, on the other, it may be expected that 
the residual impact of the manipulated per- 
suasive communication; that is, the impact 
remaining after counterpersuasion, would ex- 
hibit, generally speaking, the significant dif- 
ferentiation found prior to counterpersuasion. 
This expectation was confirmed in analyses 
performed on the second prison-term recom- 
mendations. 


DISCUSSION 


The rationale that agreement questions be- 
come associated with assertions characterized 
by relatively high degrees of subjective sig- 
nificance and certainty, and consequently 
“mark” relatively powerful arguments, best 
accounts for the findings obtained. The only 
inconsistency occurs under conditions of ini- 
tially neutral attitude. This inconsistency, 
however, may be explained by the relatively 
uninformed communicatee’s motivation to at- 
tend carefully to the content of the message 
in order to fulfill his assignment. In the ab- 
sence of conflicting information, the argu- 
ments are acceptable, even when given in the 
potentially weaker statement form. Positive 
connotations due to the greater strength asso- 
ciated with arguments presented in the form 
of agreement questions cannot manifest them- 
selves under these conditions. Given positive 
affect, there also exists the readiness to accept 
consistent assertions. The presentation of 
somewhat redundant assertions in the form 
of agreement questions, under such circum- 
stances, may be more persuasive, not by being 
more readily acceptable, but rather by serv- 
ing more pronouncedly as a reassurance for 
the attitudinal position taken. This reasoning 
on the significance-marking function of the 
agreement question suggests its persuasive ef- 
fect to be most pronounced under conditions 
of initially opposed attitude. Attitudes slightly 
polarized against the position advocated may 
be considered most vulnerable to the implica- 
tion of significance and certainty of argu- 
ments, and the individual holding such a 
position should be swayed to adopt suggested 
opinions more readily than individuals initi- 
ally holding other attitudinal positions, It 
should be kept in mind that the present 
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RHETORICAL ELICITATION or AGREEMENT IN PERSUASION 


findings were obtained under conditions of 
only moderately opposed initial attitude. The 
persuasive effect of rhetorically elicited agree- 
ment responses under conditions of extremely 
opposed initial attitude remains to be empiri- 
cally explored before more general statements 
can be made. 

In contrast, the results obtained do not bear 
out the expectations derived from considera- 
tions of cognitive involvement. Under condi- 
tions of initially unfavorable attitude, the 
outcome is clearly counter to predictions. Also, 
the negligible effect under conditions of initi- 
ally neutral attitude is inconsistent with ex- 
pectations. The failure to demonstrate reliable 
differences in support of the facilitation of 
resistance to counterpersuasion further calls 
this explanation into question. 

With respect to the third rationale pro- 
posed, the results in no way support the 
argument that the use of agreement questions 
yields an awareness of persuasive intent, 
which, in turn, activates defensive decoding 
behavior. On the contrary, it may be argued, 
consistent with the findings reported, that the 
rhetorical use of questions creates the impres- 
sion of minimal intent to persuade. The com- 
municator does not simply state assertions 
to be adopted, but apparently, by asking 
questions, “leaves it to the people to decide.” 
As a direct consequence, the communicatee 
may feel secure and lower his defenses, be- 
coming more susceptible to persuasion. In 
fact, this reasoning constitutes an alternative 
explanation of the findings. 

Both suggested mechanisms, the signifi- 
cance-marking function of agreement ques- 
tions and the lowering of defenses due to the 
perception of minimal intent to persuade on 
the part of the source, adequately account 
for the results. In this sense, the explanations 
are alternative, This does not mean, however, 
that the two suggested mechanisms need be 
mutually exclusive. It is conceivable that both 
operate simultaneously and relatively indepen- 
dently of one another. Most recently, under 
experimental conditions extremely similar to 
those of the present investigation, source per- 
ception as a function of grammatical variation 
has been implicated as a mediating factor 
(Zillmann & Cantor. 1970). A speaker using 
questions to rhetorically elicit concession (the 
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yielding to the speaker’s argumentation for 
lack of immediately available effective coun- 
terarguments) was perceived as exhibiting 
more pronounced intent to persuade. His audi- 
ence experienced him as exerting greater 
pressure to change particular positions held— 
greater than in a control condition involving 
the identical argumentation, but without this 
rhetorical technique. Under conditions of ini- 
tially opposed attitude, the impact of a per- 
suasive communication was found to decrease 
as rhetorical means perceived as “pressuring” 
were involved, and the results may be viewed 
as consistent with expectations derived from 
reactance theory (Brehm, 1966). However, 
since the assumption made in the post hoc 
explanation of the present study is that of 
less rather than more perceived persuasive 
intent associated with the rhetorical elicita- 
tion of agreement, a choice between rationales 
in favor of source mediation would be pre- 
mature. It appears that the direct empirical 
assessment of source perception as affected by 
the rhetorical elicitation of agreement re- 
sponses, and, equally important, the percep- 
tion of the degree of significance of arguments 
conveyed in the form of rhetorical agreement 
questions versus their expression in statement 
form, is needed to aid in making a choice 
between the alternatives, or to help deter- 
mine the degree to which both mechanisms 
contribute to the persuasive effects observed. 
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ATTRIBUTIONAL DETERMINANTS OF ACHIEVEMENT- 


RELATED BEHAVIOR * 
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Three experiments are reported which relate achievement motivation to causal 

“attribution. Experiment I demonstrated that subjects who differ in achievement 
level also differ in their manner of accounting for the causes of their outcome 
at a task. High achievers more frequently attributed their successes or failures 
to the degree of effort expended than did either intermediate or low achievers, 
while intermediate achievers ascribed outcomes to luck more often than either 
of the extreme achievement groups. Experiments II and III investigated the 
hypothesis that disparate causal attributions are the determinants of the 
divergent performances of high and low achievers. The attributions typical of 
high and low achievers were induced in two experimental groups, with the 
prediction that these groups would diverge behaviorally in the same way as 
do uninstructed high and low achievers. In both experiments, high-achieving 
subjects performed as predicted, while low-achieving subjects failed to be 
differentially affected by the two attributional instructions, Possible explanations 
for this pattern of results are discussed. 


The experiments reported in this study 
continue the investigations undertaken by 
Weiner and Kukla (1970) of the relationship 
between achievement motivation and the at- 
tribution of causality. In particular, this study 
attempts in its two sections (a) to specify 
more exactly the differences in causal attribu- 
tion which characterize individuals who differ 
in achievement motivation and (b) to in- 
vestigate further the hypothesis that these 
differential attributions constitute “the cogni- 
tive links mediating between the level of re- 
sultant achievement needs and the under- 
taking of achievement actions [Weiner & 
Kukla, 1970, p. 9].” 


ATTRIBUTION AS A CORRELATE OF 
ACHIEVEMENT MOTIVATION 


Weiner and Kukla attempted to verify a 
conjecture of Rotter’s (1966), that high 
achievers attribute the causes of success or 


+ These investigations were initially a portion of 
a doctoral dissertation submitted to the University 
of California, Los Angeles (1970). The experiments 
were financed by the University of California, Los 
Angeles, Head Start Evaluation and Research Office, 
Carolyn Stern, Director, This funding was provided 
by the Office of Economic Opportunity, Project 4117. 

2 Requests for reprints should be sent to Andy 
Kukla, who is now at the Department of Psychology, 
Scarborough College, University of Toronto, West 
Hill, Ontario, Canada. 


failure internally (e.g., to their own skill or 
lack of skill), whereas low achievers consider 
their outcomes to be externally determined 
(e.g, by chance). The evidence showed 


that, although the personality dispositions of 


internality—externality and achievement moti- 
vation are definitely related, this relationship 
is unlikely to be the simple one-to-one cor- 
respondence hypothesized by Rotter. Weiner 
and Kukla suggested that a more adequate 
statement of the relationship between achieve- 
ment needs and causal ascription might be 
made in terms of the two-dimensional analysis 
of causality offered by Heider (1958). Heider 
distinguished between stable and variable 
causal attributes as well as between internal 
and external attributes, conceiving of these 
two dimensions as orthogonal. In this analysis, 
an outcome which is attributed internally can 
more specifically be ascribed either to rela- 
tively stable properties of the self (the per- 
son’s ability) or to relatively transient proper- 
ties of the self which may or may not obtain 
on a subsequent trial (the amount of effort 
expended). Similarly, an external attribution 
may refer either to relatively stable properties 
of the environment (the difficulty of the task) 
or to fluctuating environmental characteristics 
(luck). 

The following experiment attempts to de- 
termine whether persons who differ in achieve- 
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. ment needs also differ in their disposition to 
ascribe outcomes to any of the four Heiderian 
attributional variables of ability, effort, 
difficulty, or luck. 


$ EXPERIMENT I 
Method 


The subjects were 138 male University of Cali- 
fornia, Los Angeles, undergraduates, some of whom 
were paid for participating in the experiment and 
some of whom were volunteers from introductory 
psychology classes. The study was conducted with 
groups ranging in size from 8 to 16. 

Subjects were first given the Revised and Con- 
densed Achievement Scale for Males (Mehrabian, 
| 1969). This test is a self-report inventory whose 
| items are constructed in conformance with Atkinson’s 

(1957, 1964) conception of achievement motivation 
as the resultant of the motive to succeed (M,) and 
| the motive to avoid failure (M7). The score obtained 


on this scale is theoretically a measure of the result- 
ant achievement tendency (M,—WM,). Mehrabian 
(1969) has presented the validating evidence that 
this test correlates positively with standard measures 
of M, and negatively with standard measures of My. 

Each subject then engaged in a digit-guessing task, 
which was introduced by the following instructions: 


a 
= 


I have in front of me a list of 50 numbers, 
either O or 1, in an order which is unknown to 
you. Your task is to guess whether the next number 

on my list is either O or 1. You will write down 
y your guess on the answer sheet which I have 
passed out, and then I will tell you what the 
number actually was. If your guess is correct, place 
a check on the line next to it. You will then be 
asked to make your next guess, and so on until 

> all 50 guesses have been completed. 
Now this is a test of your synthetic as opposed 
to your analytic ability. By this we mean that 
M there is no one definite pattern, like 010101, that 
A you could easily detect and get all the answers 
correct from then on. But the list also is not 
À. random. Instead, there are certain general trends 
and tendencies in the list—perhaps a greater fre- 
quency of one kind of pattern over another, To 


the extent that you can become sensitive to those | 


tendencies, you can make your score come out 
consistently above chance. Of course, your score 

7 also will be heavily influenced by luck. Even if you 
learn just exactly as much about the patterns as 
we expect, you could get a much higher total score 
just by being lucky in your guessing. Similarly, 
your score could be much lower just because of 
bad luck. To get a really accurate idea of where 

4 you stood, you would have to take the test a 

{ number of times so that the good and bad luck 
would average out. 


Daaa 


The list of zeros and ones read to the subjects 
was randomly constructed, so that the outcome was 
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determined solely by chance. However, the instruc- 
tions created an ambiguous situation which allowed 
performance to be perceived as attributable to any 
combination of ability, effort, luck, and task diffi- 
culty. Subjects were allowed 15 seconds to make 
each guess, with the correct answer read after 
each trial. 

Upon completion of the task, the subjects added 
up their total number of correct guesses. They were 
then instructed to answer a written questionnaire, 
consisting of the following five questions, each of 
which was to be answered by placing an X on a 
10-point Likert-type scale anchored at both extremes 
and at the midpoint: (a) Ability—“How good are 
you potentially at this kind of task?” (1 = extremely 
low ability, 10 extremely high ability). (b) Ef- 
fort—“How hard did you try to succeed at this 
task?” (1 = extremely low effort, 10 = extremely high 
effort). (c) Difficulty—‘‘How difficult do you think 
this task is?” (1 = extremely difficult, 10 = extremely 
easy). (d) Luck—Try to evaluate how lucky you 
were in your guessing.” (1= extremely unlucky, 
10 = extremely lucky). (e) Outcome—‘Would you 
evaluate your score as a success or as a failure?” 
(1 = extreme failure, 10 = extreme success). 

Three different random orders of these five ques- 
tions were employed. Subjects were divided into high, 
intermediate, and low achievement groups. The high 
achievement group comprised the highest one-third 
of all subjects in achievement scores, while the low 
achievement group contained the lowest third of the 
distribution of achievement scores. 


Results 


Table 1 gives the correlation coefficients 
between reported outcome (Question e of the 
questionnaire) and each of the four attribu- 
tional variables. All of the motive groups 
are seen to covary their estimation of ability 
with their outcome. However, only the high 
achievement group varied its effort rating 
with outcome, while only the intermediate 
achievement group varied its estimation of 


TABLE 1 
CORRELATIONS BETWEEN SUBJECTIVE OUTCOME AND 


ATTRIBUTIONAL VARIABLES FOR HIGH, INTER- 
MEDIATE, AND Low ACHIEVEMENT GROUPS 


Achievement group 


Attributional 
variable 


High Intermediate Low 
(n = 46) (n = 44) (a = 48) 
Ability .46* .43* .42* 
Effort 44* .00 -08 
Luck 13 .58* 09 
Difficulty =O 18 .08 
*p <.01. 


luck systematically with outcome. Subjects in 
the low achievement group showed no con- 
sistent outcome correlates outside of ability. 
None of the groups in this experiment exhib- 
ited a significant difficulty-outcome correla- 
tion. The effort-outcome correlation for the 
high achievement group was significantly 
greater than that for either the intermediate 
or the low achievement groups (p < .01); 
and the luck-outcome correlation for the 
intermediate achievement group was signifi- 
cantly greater, than that for either of the 
extreme achievement groups (p< .01). 

All significance levels remained unchanged 
when corrrelations were calculated between 
attributional variables and objective outcome 
(number of correct guesses) rather than sub- 
jective outcome (the correlation between 
objective and subjective outcome was .44). 

Finally, the mean self-ratings of the three 
achievement groups were essentially identical 
for all items of the questionnaire except the 
ability scale. The average ability rating of 
the high achievement group was 6.58, as com- 
pared with 5.89 for the intermediate achieve- 
ment group (¢ = 1.90, df = 88, p < .10) and 
5.78 for the low achievement group (¢ = 2.38, 
df = 92, p < 05). 


Discussion 


Table 1 enables us to determine how each 
group attributed its outcome on the experi- 
mental task. A high correlation between out- 
come and the attributional variable of, say, 
ability indicates that the group in question 
perceived itself as relatively more able when 
its outcome was high than when it was low. 
This is taken to mean that the ability dimen- 
sion was utilized by this group to account for 

_ whether it succeeded or failed, that is, that 
outcome was attributed at least in part to 
ability. Thus, the data of Table 1 indicate 
that high achievers attributed their outcome 
to a combination of ability and effort; that 
intermediate achievers considered ability and 
luck to be determinants of their outcome; and 
that low achievers expected only the ability 
dimension to influence success or failure. 

However, only the differences in attribu- 
tion found among achievement groups can be 
assumed to reveal their attributional dispo- 
sitions. For even though the experimental task 
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was intended to be causally ambiguous, it is 
likely that its structure and accompanying 
instructions suggested or discouraged some 
attributions to all subjects independently of 
their attributional tendencies. The data thus 
permit only the conclusion that high achievers 
consider effort to be a more salient determi- 
nant of outcome than do intermediate or low 
achievers, and that intermediate achievers 
have a greater tendency to ascribe outcome 
to luck than do the extreme achievement 
groups. 

The fact that intermediate achievers used 
the luck dimension to distinguish between 
success and failure, while neither of the ex- 
treme achievement groups did, was particu- 
larly surprising. It seems that contrary to 
Rotter’s (1966) conjecture, it is this group 
rather than the low achievement group which 
must be considered external. The usefulness 
of the Heiderian analysis of causality is dem- 
onstrated by the fact that Rotter’s concepts 
of internality and externality Ù rm. out to be 
incapable of distinguishing between the c 
tive dispositions of the extreme achievemen G 
groups in this experiment. 

The finding that high NS Tite their 
abilities at an ambiguous task absolutely 
higher than either of the other two achieve- 
ment groups may be taken as a confirmation 
of a prior conjecture of McClelland’s (1961). 
He proposed that high achievers, having gen- 
erally been successful in their past achieve- 
ment-related activities, approach new and un- 
familiar tasks with generalized overconfidence. 
The ability difference found here is also con- 
sistent with evidence presented by Atkinson, 
Bastian, Earl, and Litwin (1960), indicating 
that high achievers exceeded low achievers 
in initial estimates of how well they expected 
to do at a novel task. 

Finally, it must be recalled that the cogni- 
tive dispositions discovered here are corre- 
lates of resultant achievement motivation, that 
is, of (M, — M;). If, as Atkinson maintains, 
M, and M; are relatively independent sys- 
tems, each ought to have its own cogni- 
tive correlates which, when combined, yield 
the attributional dispositions found in this 
experiment. 
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ATTRIBUTION AS A DETERMINANT OF 
ACHIEVEMENT-RELATED 
BEHAVIOR 


Current conceptions of achievement moti- 
vation have for the most part explicated 
achievement-related behavior in terms of the 
subjects’ affective reactions to task situations 
(Atkinson, 1957, 1964; McClelland, Atkin- 
son, Clark, & Lowell, 1953; Weiner, 1970). 
Persons high in achievement motivation have 
been conceptualized as those subjects who 
experience considerable pride in success and 
relatively little shame at failure, whereas low 
achievers are thought to derive relatively little 
pride from success and great shame from 
failure. It is assumed that precisely these 
differences in affective dispositions account 
for the divergent performances of high and 
low achievement groups (Atkinson, 1964), 

However, the existence of systematic at- 
tributional ` differences among achievement 
groups suggests that disparate cognitions of 
the task situation may also play a role in 
the determination of achievement-related be- 
havior. Weiner and Kukla (1970) hypothe- 
sized that the affective reactions to success 
and failure which are usually cited as expla- 
nations of achieving behavior are themselves 
the result of the ways in which subjects 
construe the causes of their outcomes. Thus, 
on this view, the behavioral differences be- 
tween high and low achievers are ultimately 
due to divergent cognitions about causality. 
Weiner and Kukla presented intuitively rea- 
sonable derivations of the main behavioral 
characteristics of high and low achievers from 
these groups’ attributional characteristics 


(which were then not as well-known). The . 


two experiments reported below attempt di- 
rectly to verify this hypothesized causal link 
between cognition and behavior. 

In principle, this cognitive hypothesis is 
readily testable: if attribution determines be- 
havior, then a change in attribution will 
result in a corresponding change in behavior. 
Thus, if a high achiever acts the way he 
does because of the manner in which he 
typically attributes causality, then any opera- 
tion which increases the likelihood of such an 
attribution will also increase the likelihood 
of the behavior known to be characteristic 
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of high achievers. More specifically, it would 
be expected that any set of conditions which 
favor the attribution of outcome to effort 
will result in high-achiever-like behavior; con- 
versely, conditions which discourage the per- 
ception that effort determines outcome will 
lead instead to low-achieverlike behavior. 

Thus, the indicated experimental procedure 
is to compare the performance of subjects 
who are led to believe that their outcome will 
be determined by effort with that of subjects 
who are instructed that effort will not influ- 
ence their outcome. However, the negative 
instruction not to employ the effort variable, 
by leaving open to the subject an indefinite 
number of ways in which to construe the 
task, is relatively poor in informational con- 
tent. Thus, any behavioral differences be- 
tween an effort-instructed group and a no- 
effort-instructed group could be due to the 
disparate amounts of information each group 
receives, rather than to the specific effects of 
effort attribution. For this reason, it was 
decided to instruct one group that both effort 
and ability would determine outcome, and 
to tell a second group that only ability would 
influence performance. While both these in- 
structions purport to characterize the task 
completely, the attributional difference be- 
tween them explicitly corresponds to that 
obtaining between high and low achievement 
groups. The present hypothesis consequently 
leads to the prediction that these groups will 
diverge behaviorally in the same way that 
high and low achievers do. 


EXPERIMENT IT 


In this experiment, the cognitive hypothe- 
sis was tested for one of the fundamental 
behavioral parameters which distinguish high 
from low achievers. It is known that in an 
achievement situation, high achievers per- 
form better at a task than low achievers 
(Atkinson & Reitman, 1956; Lowell, 1952). 
It was then predicted that an ability—effort- 
instructed group would perform better than 
an ability-instructed group. 


Method 


Ninety-one University of California, Los Angeles, 
undergraduate volunteers served as subjects in groups 
ranging in size from 11 to 20. All subjects were 
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TABLE 2 


MEAN NUMBER or CORRECTLY UNSCRAMBLED ANA- 
GRAMS, ACCORDING. TO ACHIEVEMENT LEVEL 
AND (ONAL INSTRUCTIONS 


Tae 


ps Achievement group 
Instructional group 
High Low 
Group A 10.71 10.71 
n 28 17 
Group AE 13,94 9.93 
n 17 29 
first administered the Revised and Condensed 


Achievement Scale for Males (Mehrabian, 1969). The 
subjects then attempted to unscramble 20 anagrams 
of animal names (e.g., DEKNOY=DONKEY). Their 
score was the number of correct unscramblings 
obtained in 5 minutes. 

One group of subjects, Group A, was read the 
following instructions before beginning to work on 
the task: 


The reason we're interested in this task is that 
we've found it to be a very pure measure of 
ability to reorganize material into new patterns. 
It's pure, in the sense that it’s relatively unaffected 
by effort. Some people just seem to have the 
ability to have the correct word leap up at them, 
while others don’t, And, within the time limit 
set, whether a person works very hard or takes 
it easy makes little difference in his score. This 
makes the task especially well-suited to studying 
the relation between personality factors and ability 
factors. 


The intent of these instructions is to induce 
attributions solely to ability in this group. 
Group AE (ability-effort) was instead instructed 


as follows: 


The reason we’re interested in this task is that 
weve found that success in such reorganization 
of material into new patterns is heavily influenced 
by the amount of effort a person puts into the 
task, that is, the motivation he has to do well. 
Of course, there are differences in ability too; but 
even people with high ability for this kind of task 
do rather poorly if they do not give their full 
attention to searching for words, while people with 
somewhat lower ability can do quite well if they 
just search hard enough. For this reason, it’s 
possible to use this task to determine which 
conditions lead to greater motivation among 
subjects. 


Group AF’s instructions thus encourage ascription 
of outcome to effort as well as ability. 

All subjects were further divided at the median 
into high achieving and low achieving groups, on 
the basis of their score on the achievement scale. 


ANDY KUKLA 


Results 


The average number of correct un- 
scramblings for all subjects in Group A (abil- 
ity) was 10.71, while Group AE obtained an 
average of 11.41 items correct. Although the 
difference between these two means is in the 
hypothesized direction, its magnitude does not 
approach statistical significance (¢ < 1). 

Table 2 presents the mean scores on the 
anagram task with subjects classified accord- 
ing to achievement level as well as attribu- 
tional set. The data indicate that level of 
performance increased both with level of 
achievement motivation (F = 7.87, df = 1/87, 
(p < .01) and when the task was introduced 
with ability-effort-orienting instructions (F = 
2.93, dj = 1/87, p < .10), although the latter 
main effect only approaches significance. Of 
considerable interest is the presence of a 
significant Achievement x Orientation inter- 
action (F = 7.81, df = 1/87, p < 01). The 
high achiever’s customary superiority of per- 
formance over low achievers was evidenced 
only under the ability-effort orientation (¢ = 
3.89, df = 44, p< .001); both achievement 
groups performed identically under the pure 
ability orientation (¢ < 1). In addition, while 
low-achieving subjects obtained essentially 
the same average score under both attribu- 
tional orientations (< 1), high achievers 
solved more anagrams when they were ini- 
tially told that effort as well as ability deter- 
mines outcome (¢ = 3.78, df = 43, p < .001). 


Discussion 


Contrary to expectation, the attributional 
instructions affected only the high-achieving 
subjects in the manner predicted by the 
cognitive hypothesis. The theoretical import 
of this finding will be dealt with in the 
Discussion section of the closely related 
Experiment III, 

Independently of any more general con- 
siderations, however, the data point again to 
the fact that the internality—externality di- 
mension studied by Rotter and his colleagues 
confounds important attributional distinc- 
tions. Both instructional sets used in Experi- 
ment II suggest internal attributions; yet, 
they lead to different behavioral consequences 
depending upon whether the subjects are led 
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to ascribe outcome to ability and effort or 
to ability alone. 


EXPERIMENT III 


The failure of the low achievers to react 
differentially to the attributional instructions 
in Experiment II could have been due to a 
greater resistance on their part to changing 
their attributional accounts of task situations. 
Such a resistance would presumably be over- 
come if the instructions were made more 
compelling. It was thought that the use of 
a task whose causal determinants appeared 
highly ambiguous would force all of the sub- 
jects to rely more on the instructions in 
determining their causal attributions, and 
that this would result in the low achievers’ 
acting in the hypothesized manner. The 
following experiment employs such an am- 
biguous task to investigate the behavioral 
differences between ability-effort and ability- 
oriented groups. 

A different behavioral parameter was also 
studied. One of the most striking behav- 
ioral differences distinguishing high from low 
achievers concerns their relative preferences 
for risk taking. It is generally found that 
given a choice, high achievers prefer inter- 
mediate risk situations more frequently than 
low achievers, while the latter choose either 
extremely high- or extremely low-risk situa- 
tions more frequently than high achievers 
(Atkinson et al., 1960; Atkinson & Litwin, 
1960; McClelland, 1958). The cognitive hy- 
pothesis consequently predicts that intermedi- 
ate risks will be more often preferred under 
conditions which favor the high achiever’s 
typical attributional pattern. Specifically, it 
was predicted that subjects given a pure 
ability orientation would exhibit less inter- 
mediate-risk preference than subjects who 
are told that effort as well as ability would 
determine their outcome. 


Method 


Forty-eight subjects, all male University of Cali- 
fornia, Los Angeles, undergraduate volunteers par- 
ticipated in the experiment in groups ranging in size 
from 6 to 20. As in the previous two experiments, 
the subjects were first administered the Revised and 
Condensed Achievement Scale for Males (Mehrabian, 
1969). The subjects were then presented with the 
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task of predicting each of a series of 20 digits. They 
were given the option of making any number of 
guesses they liked on each trial. Thus, they could 
try to get the correct answer the “hard way” by 
guessing only a few digits, the “easy way” by guess- 
ing many digits, or they could choose to undertake 
an intermediate challenge by guessing neither very 
many nor very few digits, The instructions were as 
follows: 


I will have in front of me a list of 20 numbers, 
each one a digit between O and 9. Your task 
basically is to guess which digit is next on my list. 
On each turn, you will be allowed to write down 
any number of guesses you want on the answer 
sheet which I have passed out. When I call out 
the correct answer, you will write that answer 
down on the space provided next to your guesses. 
If your guesses include the correct answer, you 
will place a check on the line next to the correct 
answer, so that you can keep track of your score. 
Then you will make your next set of guesses, 
and so on. 

Now the list of digits I will use is neither 
arbitrary, nor does it have a definite pattern like 
2468024680 that you could detect and so get all 
correct from then on. Instead, the list is con- 
structed according to certain general rules so that 
some numbers have a greater tendency to occur at 
some times than at others. Thus, if you are able 
to become sensitive to these tendencies, you can 
make your score come out consistently above 
chance. But you still would not be able to get 
them all right. 


The list of digits read to the subjects was randomly 
constructed, so that outcome was determined solely 
by chance. However, the instructions created an 
ambiguous situation with respect to the task’s causal 
determinants. The information supplied to relieve 
this ambiguity was different for two experimental 
groups. Group A was told: 


We've found on prior tests that the amount of 
ability a person has for this kind of task is the 
most important determinant of how well he does, 
while the degree of effort put into searching for 
patterns makes relatively little difference in the 
final score. Some people just seem to be good at 
the game, while others are not. 


Group AE was read the following instead: 


We’ve found on prior tests that the amount of 
effort a person puts into trying out various pat- 
terns and looking over the previous series of 
correct answers is the most important determinant 
of how well he does. Of course, there are dif- 
ferences in basic ability for this kind of task. But 
even people with high ability do rather poorly if 
they do not give their full attention to the game, 
while people with somewhat lower ability can do 
quite well if they just try hard enough. Effort then 
is crucial, 
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TABLE 3 


MEAN Number OF INTERMEDIATE-RIsK GUESSES 
CHOSEN, ACCORDING TO ACHIEVEMENT LEVEL 
AND ATTRIBUTIONAL INSTRUCTIONS 


Achievement group 
Instructional group 


High Low 

Group A 6.58 6.70 
n 12 10 

Group AE 14.10 7.07 
n 10 14 


Thus, Group A was led to view the task as purely 
ability determined, while Group AE was told that 
effort as well as ability would determine their 
outcome. 

After these instructions, but before the actual trials 
began, the subjects were asked to predict what guess- 
ing strategy they would use, that is, how many times 
out of 20 they would guess just one number, how 
many times they would guess two numbers, etc. It 
was emphasized that this estimate in no way com- 
mitted them to any particular performance once the 
game was under way. 

The subjects then proceeded to record their 20 
guesses. They were allowed 30 seconds to make each 
guess, after which the correct answer was read, For 
subsequent analysis, an intermediate-risk guess was 
considered to be a guess of 4, 5, or 6 digits, while 
a guess of 1, 2, 3, 7, 8, or 9 digits was taken to be 
an extreme-risk guess. 

As in the previous experiment, all subjects were 
divided at the median into high-achieving and low- 
achieving groups. 


Results 


The subjects’ actual risk preferences and 
their predicted risk preferences followed es- 
sentially the same pattern, although signifi- 
cance levels tended to be attenuated in the 
latter data analysis. Only the actual risk 
preference data are discussed below. 

Two subjects did not make the full 20 
guesses while engaged in the task. The data 
for these subjects were eliminated in the fol- 
lowing analysis. 

The mean number of intermediate-risk 
guesses for all subjects in Group A was 6.64, 
whereas Group AE chose an average of 
10.00 intermediate-risk guesses out of 20 
trials. The difference between these two means 
approaches statistical significance in the 
hypothesized direction (¢= 1.96, df = 44, 
p < .10). 
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When the subjects’ level of achievement 
motivation is taken into account, the data 
indicate that as in the previous experiment, 
the effects of the attributional instructions are 
evident only for the high-achieving group (see 
Table 3). Analysis of variance reveals that 
high achievers on the whole chose more 
intermediate-risk guesses than low achievers 
(F = 4.13, df = 1/42, p< .05), and that 
Group AE was superior in this respect to 
Group A (F= 6.12, df = 1/42, p < .025). 
Superimposed over both these main effects 
was a significant interaction effect (F = 5.05, 
df = 1/42, p< .05). The pattern of this 
interaction completely parallels that found for 
the data of Experiment II. Within Group AE, 
high achievers chose more intermediate-risk 
guesses than low achievers (¢ = 2.98, df = 
22, p < .01), whereas these two motive groups 
did not differ within Group A (ż < 1). Fur- 
ther, high achievers chose more intermediate- 
risk guesses in Group AE than in Group A 
(t = 3.34, df= 20, p< 01), while low 
achievers showed no such differential per- 
formance between the two instructional 
conditions (¢ < 1). i 

A comparison of high achievers in Group A 
with high achievers in Group AE reveals that 
the latter’s greater preference for intermediate 
risks is due for the most part to their lesser 
preference for high-risk tasks. High achievers 
in Group A made an average of 12.14 guesses 
of 1, 2, or 3 digits, whereas high achievers 
in Group AE averaged only 5.80 such guesses 
(t = 2.23, df = 22, p < 05). Relatively few 
low-risk guesses were made by any subjects 
in this experiment. However, among the 10 
high achievers who made at least one low-risk 
guess of 7, 8, or 9 digits, 9 were in Group A 
and only 1 was in Group AE (x?= 5.81, 
df = 1, p < .02). There is thus some indica- 
tion that the high achievers of Group A had 
a greater preference for extremes at either 
end of the risk-taking continuum, The dis- 
tribution of high- and low-risk guesses is 
essentially identical for the high achievers in 
Group A, the low achievers in Group A, and 
the low achievers in Group AE. 


Discussion 


In both Experiment IT and Experiment ITI, 
the behavior of high-achieving subjects con- 
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formed to the prediction of the cognitive 


` hypothesis, while low-achieving subjects failed 


to be differentially affected by the attribu- 
tional instructions. Some possible explana- 
tions for this pattern of results are discussed 
below. ; 

It may of course simply not be the case 
that the motivationally significant differences 
between achievement groups are solely at- 
tributional. Contrary to the cognitive hy- 
pothesis, high and low achievers may have 
divergent affective reactions to the task even 
when their causal attributions are identical. 
The data of Experiments II and II could then 
be explained by postulating that instructions 
which encourage effort attributions (as in 
Group AE) have the effect of “arousing” 
achievement needs in each subject according 
to his level of achievement motivation, 
whereas instructions which discourage effort 
attributions (as in Group A) eliminate the 
achievement concerns of all subjects, With 
this assumption, one would indeed expect the 
behavioral differences between high and low 
achievers to be accentuated in Group AE and 
mitigated in Group A. There is, however, an 
empirical difficulty with this explanation. It 
is known that both high and low achievers 
perform differently when their achievement 
concerns are aroused from when they are de- 
emphasized. In particular, subjects low in 
anxiety (and, therefore, high in resultant 
achievement motivation) typically exhibit a 
better performance in aroused than in neutral 
conditions, whereas highly anxious subjects 
(who are low in resultant achievement moti- 
vation) show a diminished performance in 


` aroused as compared to neutral conditions 


(Mandler & Sarason, 1952; Sarason, Mandler, 
& Craighill, 1952). Thus, although the affec- 
tive account of the effects of attributional 
instructions correctly predicts that high 
achievers will perform better in Group AE 
than in Group A of Experiment II, it also 
seems to entail the disconfirmed prediction 
that low achievers will do worse in Group AE 
than in Group A. 

An affective explanation of the present re- 
sults must therefore postulate a relatively com- 
plex relationship between attribution and affec- 
tive reaction. It is necessary to assume that al- 
though the instructions which encourage effort 
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attribution enhance the appeal of success 
and thus increase the performance of high 
achievers, they leave unchanged the negative 
affect associated with failure and thereby have 
little effect on the behavior of low achievers. 

It is also possible to devise an explanation 
of the data which is consistent with the cogni- 
tive hypothesis and which is at least not more 
arbitrary than the previous account. It may 
be that contrary to the experimenter’s inten- 
tions, the necessary condition that high and 
low achievers construed the task in the same 
manner was not obtained. Instead, the sub- 
jects’ own attributional dispositions may have 
interacted with the instructions to create dis- 
crepant interpretations of these instructions. 
The information that both ability and effort 
determine outcome is after all necessarily 
vague: it is up to the subject himself to 
decide what degree of relative importance to 
assign to each of these two attributional vari- 
ables. A high achiever having already the 
tendency to ascribe outcome to effort, will 
presumably lay emphasis on effort here too, 
and so can be expected under these conditions 
to view the task very much as if he had been 
uninstructed. Similarly, a low achiever, carry- 
ing with him the disposition to underempha- 
size the effort component of a task, may 
construe an ability—effort orientation in much 
the same way that he interprets a pure ability 
orientation. Thus, ability—effort instructions 
allow the high and the low achiever to at- 
tribute causality each in his typical fashion. 
On the other hand, if any subject is told 
that the task is solely ability determined, 
then (if he believes the instructions) attribu- 
tion to effort is explicitly precluded, and the 
attributional scheme of the low achiever is 
forced on that subject. If this is the case, 
the cognitive hypothesis leads to the correct 
prediction that high and low achievers will 
diverge behaviorally in their customary way 
under an ability-effort orientation, whereas — 
both high and low achievers will behave rela- 
tively like low achievers under an ability 
orientation. 

A direct test of this explanation would 
entail a replication of the present experiments 
with the addition of a third instructional 
Group E, which is told that its outcome is 
due solely to effort. For the same reason 


that a pure ability orientation is assumed to 
force all subjects to disregard the effort vari- 
able in accounting for outcome, a pure effort 
orientation should force all subjects, including 
low achievers to utilize the effort variable. It 
would then be predicted that both high and 
low achievers in Group E will act like 
uninstructed high achievers. 
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DIMENSION OF TIME IN RELATION TO CHOICE 
BEHAVIOR AND ACHIEVEMENT ORIENTATION * 
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Two groups of college students decided upon level of difficulty of items in 

p tests to be taken on the same day and after 4 weeks, presented in reverse 
\ temporal order in the two groups. They also indicated their expectancy of 
success and the valence of success and of failure in each of the two tests they 

were to take. Expected temporal distance until the taking of the test resulted 

in higher risk taking and higher expectancy of success in success-oriented 

subjects (high need Achievement, low anxiety) but not in failure-oriented 

h subjects. Temporal distance did not affect significantly verbalized valence of 
success or failure. Results are interpreted as indicating the operation of time 

as a meaningful stimulus in the psychological field, and the perception of 

time by achievement-oriented subjects as instrumental in helping to achieve 

a better control of the environment, even when objective conditions do not 


justify such expectancy. 


Decisions involving risk-taking behavior 
may be affected by temporal distance be- 
tween the making of the decision and its 
execution. For example, a student’s choice 


is also due to Marion Blanc and Daniel Kahneman 
for their helpful comments regarding this paper. 

2 Requests for reprints should be sent to Mordecai 
Nisan, School of Education, Hebrew University of 
Jerusalem, Jerusalem, Israel. 


poral distance brings about a decrease in 
expectancy (e.g., Mischel, 1966), this research 
was concerned with the expectancy of occur- 
rence of events that are dependent upon an 


$ of a college may be different if he makes external agent. The present research deals 
his selection a short or a long time before with the effects of temporal distance upon 
1 entrance. The present study was designed to expectancy of success, where the outcome is 
investigate the effects of length of expected dependent on the individual (skill-dependent 
ą temporal distance (short or long) upon level tasks). Temporal distance should exert a 
* of risk taken by the individual in skill- different effect on skill-dependent tasks, 
dependent tasks. In accord with valence- as opposed to externally controlled be- 
expectancy models of choice behavior (e.g., havior, since often in skill-dependent tasks, 
Rotter, 1954), risk taking is seen as deter- temporal distance allows the individual to pre- 
\ mined by expectancy, that is, the subjective pare himself better and acquire a better con- 
probability that a given outcome will follow trol of the environment; temporal distance 
the individual’s choice, and valence, that is, thus increases his chances of success. It is 
the subjective value of that outcome for the suggested that this effect of temporal distance 
individual. In investigating the effects of will be generalized to other skill-dependent 
€ temporal distance upon risk taking, it is tasks, even where temporal distance does not 
necessary to consider what its effects are allow the individual to prepare better for the 
on these two mediating factors, namely, task. It was thus hypothesized that expected 
expectancy and valence. temporal distance will bring about a rise in 
Although some research implies that tem- expectancy for success in tasks which are skill 
dependent. In addition, this increased expec- 
l 1 This article is based on a dissertation submitted tancy of success should result in greater risk 
4 to the peers Coe Ca ENEE taking, the latter being defined as an indi- 
SEL een aig ae Development, under Grant vidual’s choice of more difficult tasks (i.e., 

. §T1-HD 114-02. The author would like to thank tasks with a low probability of success.) 
Y Wilbur Haas for his help and guidance, Appreciation We turn now to the effects of temporal 


distance upon valence. Using measures of 
ratings and preferences, Mischel, Grusec, and 
Masters (1969) found that rewards expected 
after a lapse of time have lower subjective 
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value than immediately expected rewards. For 
punishments, a reversed effect of temporal 
distance was found in adults; that is, delayed 
punishments were found to have higher nega- 
tive valence than immediately expected ones. 
These effects were presumably due to the 
negative valence of the delay period for either 
positive or negative outcomes, These findings 
and interpretations are consistent with the 
results and theoretical considerations con- 
tained in related studies (e.g., Renner, 1967). 
Another effect of temporal distance upon 
valence, and a different hypothesis concerning 
valence of success, is suggested by the present 
reasoning which postulates that the individual 
perceives temporal distance as allowing him 
a better control over the environment. Longer 
temporal distance should then result in a 
higher valence of success and failure, since 
it will cause the individual to feel more 
responsible for the outcomes of his behavior 
(Feather, 1967). 
The preceding predictions are derived from 
an approach in which temporal distance is 
viewed as a meaningful stimulus within the 
“psychological field” of the individual. It is 
difficult to assess the meaning of temporal 
distance; however, the construct of success 
versus failure orientation (Atkinson & Feather, 
1966) supplies us with a useful classification 
of the psychological meaning of temporal 
distance. Need Achievement and time orienta- 
tion have been found to be related in that 
high need Achievement subjects have longer 
future time perspectives than low need 
Achievement subjects, who view it as “aim- 
less” (Heckhausen, 1967). Anxiety is also 
related to the perception of time; for an 
anxious person the future represents the fear- 
ful unknown (May, 1950), rather than an 
instrument which may bring success. There- 
fore, the contrast of success versus failure 
orientation provides a sharp distinction be- 
tween different perceptions of expected tempo- 
ral distance. Success-oriented subjects would 
perceive time as a useful instrument for the 
achievement of particular goals. Failure- 
oriented subjects, by contrast, would tend to 
avoid thinking about expected temporal dis- 
tance due to its anxiety-provoking nature. 
These differences in time perception suggest 
that the hypothesized effects of temporal dis- 
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tance upon expectancy, risk taking, and 
valence would be especially strong in success- 
oriented, as compared with failure-oriented, 
subjects. 

Temporal distance, as a relevant variable 
in choice behavior encountered in life, may 
be replicated in the laboratory in two dif- 
ferent paradigms. One is a between-subjects 
paradigm, which compares two groups, one 
of which must decide upon a task to be 
accomplished immediately, the other, upon a 
task to be performed at some later time. The 
second paradigm is a within-subject one, 
which refers to comparison of decisions of 
the same person, concerning similar tasks 
envisaged at different points in time. To some 
extent, this is what happens when a person 
reviews his decision as the time of its execu- 
tion approaches. Both paradigms are used in 
testing the hypotheses of the present study. 


~“"MetHop ~ 
Subjects te 


The subjects were 100 male college students, en- 
rolled in classes of introduction to economics at a 
mid-western state university, Subjects were randomly 
divided into two “treatment” groups, the “now” 
group and the “future” group. They were run in 
four classes of about 25 each, 


Procedure and Measurement of Variables 


Subjects were tested in two sessions, In the first 
session, measures of risk taking, valence, and expec- 
tancies were obtained. In the second session, measures 
of need Achievement, test anxiety, and future time 
perspective (not reported here) were obtained. 

In the first session, subjects were instructed 
(through written instruction on the first page of a 
booklet) about a test which they were to take either 
immediately (the now group) or after 4 weeks (the 
future group). The test entitled “the Stanford 
Inductive Reasoning Test” was described as a new 
test, intended to “examine the adequacy and predict 
the success of students in several academic fields”; 
it was to be “not a test of knowledge or informa- 
tion, but a test which requires quick orientation and 
inference.” They were instructed that the test in- 
cluded 12 items, each of which had two levels of 
difficulty, medium difficulty and high difficulty, and 
that the items were pretested so it was known that 
students in schools such as theirs answer high-diffi- 
culty questions correctly about 25% of the time and 
medium-difficulty questions correctly about 50% of 
the time. A correct answer to a high-difficulty ques- 
tion was to be awarded 2 points; to a medium- 
difficulty question, only 1 point. 

Measure of risk taking. The subjects were asked 
to choose one out of seven combinations of items of 
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high and medium difficulty, including between 12 and 
0 items of high difficulty out of a total of 12. Risk- 
taking score was calculated according to the number 
of high-difficulty items in the combination chosen. 
The score range was between 1 and 7. 

Expectancy. The following question was asked in 
the now group (and a parallel one in the future 
group): 


What is your feeling about your personal chances 
of answering correctly any one type of question 
in the Stanford Inductive Reasoning Test you are 
going to take today? In the spaces below write 
down a number between O (no chance at all) and 
100 (definitely sure) to indicate your evaluation 
of your chances, 

I think my chances (out of 100) of giving a 
correct answer to any. high-difficulty-level question 
in the Stanford Inductive Reasoning Test I am 
going to take today are out of 100. 


The same question was asked concerning questions 
of medium level of difficulty. 

These responses furnished two scores: Expectancy 
for correct answer to a high-difficulty question and 
to a medium-difficulty question. 

Valence. In order to get a direct measure of 
valence, a verbal index was used, following Feather 
(1967). It is assumed that the valence of an accom- 
plishment is reflected in the subject’s verbal response 
concerning his level of satisfaction (or dissatisfaction) 
following this accomplishment. The following ques- 
tion was asked in the now group (and a parallel 
one in the future group): 


The satisfaction a person derives from a success- 
ful accomplishment of a task, or from getting a 
high score in a test, depends very much on the 
nature of the task or what the test is measuring. 
A correct answer to one type of question may give 
you a great deal of satisfaction, while a correct 
answer to another type of question would give 
you only minimal satisfaction. 

On the scale below please indicate how much 
satisfaction you would feel if you attained a high 
score in the Stanford Inductive Reasoning Test you 
are going to take today. (You may use any refer- 
ence point you like to.) Please put a check mark 
(V) within one of the nine spaces below. 
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Moderate Very little 
satisfaction satisfaction 


A great deal 
of satisfaction 


The response to this question furnished us with a 
score for “valence of success.” The score range was 
between 1 (little satisfaction-low valence) and 9 
(high satisfaction-high valence). A similar question 
was asked concerning dissatisfaction, substituting 
“dissatisfaction” for “satisfaction” in the scale. This 
gave us a score for “valence of failure” with the 
same range, the higher score indicating the higher 
valence. 

Change scores. After making the previous re- 
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sponses, the subjects were instructed that another 
version of the test would be given to them after 
4 weeks (in the now group), or on the same day 
(in the future group). Then, measures of risk taking, 
valence, and expectancy were obtained for this test 
also. It should be noted that when responding on 
the first test (to be taken either “now” or “after 
4 weeks”), the subjects did not know that they were 
going to be asked to make similar responses for 
another test, to be taken at a different point in time. 

We now had two sets of similar measures for each 
subject concerning two tests, to be taken at two 
different points in time. By subtracting the first score 
of an individual from his second score, we derived a 
“change score” for each of the measures used. A 
positive “change score” means that the individual 
had a higher score (in either expectancy, valence, or 
risk taking) in his first set of responses than in his 
second one, while a negative change score means 
the opposite. Since the now group first decided for 
now and only afterwards for “after 4 weeks,” while 
the future group decided in a reverse order, we 
would expect—by the hypotheses—a difference be- 
tween the change scores of the two groups. More 
specifically, we would expect a trend toward positive 
change scores in the now group (meaning higher 
scores in either expectancy, valence, and risk taking) 
in thier second set of responses (i.e. for “after 
4 weeks”), and a trend toward negative change 
scores in the future group. 

Introspective data. After the subjects had re- 
sponded to all the questions in the booklet, they 
were given a questionnaire containing various ques- 
tions concerning their responses in the booklet. The 
true aim of the experiment was then disclosed and 
explained. 

Personality measures. In a second session, measures 
of need Achievement and test anxiety were obtained. 
Need Achievement was assessed through a projective 
test, using the standard procedures. The following 
four pictures were presented (using the numbers 
assigned by Atkinson, 1958): 2, 8, 9, 7. Protocols 
were coded by an experienced coder whose reliability 
had been established. 

Test anxiety was assessed through the Achieve- 
ment Anxicty Test, designed by Alpert and Haber 
(1960), to give separate measures of debilitating 
and facilitating anxiety. In order to arrive at a 
success or failure orientation score, the following 
procedure was employed: the need Achievement 
scores and the debilitating anxiety scores were con- 
verted into Z scores, and for each subject the dif- 
ference in Z scores was calculated, that is Z (need 
Achievement) — Z (debilitating anxiety). It was as- 
sumed that subjects with a positive Z-score difference 
would more likely be success oriented, and that 
subjects with a negative Z-score difference would 
more likely be failure oriented, 


RESULTS 


The findings are presented in three sections, 
corresponding to the three classes of depen- 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS FOR DEPENDENT MEASURES BY 
ORIENTATION AND TREATMENT 


Orientation group 
Success orientation Failure orientation 
Measure 

Now group Future group Now group Future group 

M SD M SD M SD M SD 
Risk takin; 3.44 1.19 4.40 | 138 | 3.72 1.17 3.72 | 1.24 
ES re: 6.04 1.83 6.96 | 1.30 | 6.24 1.50 6.24} 2.02 
Valence of failure 5.24 | 2.02 4.96| 1.79 | 5,12 2.16 5.48 | 2.06 
Expectancy in high difficulty 35.80 | 11.60 | 49.20 | 18.57 | 42.20 | 20.56 | 45.00 | 22.68 
Expectancy in medium difficulty 65.40 | 11.07 | 73.40 | 12.05 | 65.80 | 14.48 | 68.40 | 17.00 
Change in risk taking A8 a1) —.16 -80 Ald 92 08 70 
Change in valence of success .20 .70.| —.24 87 | —.32 1.03 | —.08 86 
Change in valence of failure —.16 74 12) 1.71 | —.28 1.74 | —.24| 1.09 
Change in expectancy in high difficulty 5.80 7.02 | —1.20 | 7.67 1.40 | 3.95 | —1.60 | 3,45 
Change in expectancy in medium difficulty 3.80 6.96 | —4.20 | 11.33 2.00 5.59 | —.80 | 4.00 


Note:—Scores refer to the means of the first indication in each treatment group, that is, for now in the now group and for 
after 4 weeks in the future group. ‘Change scores” were obtained by subtracting each individual's score in his first indication 
from his score in his second indication, for a different point in time. In the now group a positive changeggere means a higher indi- 


cation for the future, in the future group a positive change score means a higher indication for now. © WE 


dent variables under study: (a) temporal 
distance effects upon risk taking; (b) tempo- 
ral distance effects upon valence; and (c) 
temporal distance effects upon expectancy. In 
each section, the results are presented for the 
between-subjects paradigm, namely, compari- 
son between the scores of two treatment 
groups in their first decision (i.e., for now 
in one group and for after 4 weeks in the 
future group), and for the within-subject 
paradigm (namely, between the change scores 
of the two groups). 


Risk Taking 


Table 1 shows the mean risk scores by 
treatment (now versus future) and orienta- 
tion (success versus failure orientation). 
Analysis of variance indicated that the effect 
of temporal distance was significant at the 
.06 level (F = 3.69, df = 1/968), orientation 
did not yield a significant effect (F < 1), and 
the interaction between treatment and ori- 
entation was significant at the .06 level 
(F = 3.69). Comparison of the two treatment 
conditions within each orientation group re- 
veals that success-oriented subjects take sig- 
nificantly higher risk when deciding on a test 


8 Degrees of freedom in analyses of variance 
throughout this paper are 1/96. 


to be taken after 4 weeks, as comi 
a test to be taken on the same day 
p < .005, one-tailed test), while among the 
failure-oriented group, exactly the same mean 
in the “now” group and in the “future” group 
was found. 

Similar results are obtained in the analy- 
sis of the change-in-risk-taking scores. The 
two treatment groups differ significantly in 
this score (F = 4.6, p < .05), orientation does 
not show a significant effect (F < 1), and the 
interaction between treatment and orientation 
is significant only at the .10 level (F = 3.5), 
An analysis of the means shows that success- 
oriented subjects tend to take higher risks 
when deciding on a test to be taken after 
4 weeks, as compared with a test to be taken 
immediately, while among failure-oriented 
subjects, the respective differences are negligi- 
ble; that is, temporal distance does not affect 
their risk taking. 


Valence 


The mean valence scores by treatment and 
orientation are shown in Table 1. Analysis 
of variance indicates no significant effect of 
either temporal distance or orientation in 
both valence of success and valence of failure. 
However, it can be seen in Table 1 that 
success-oriented subjects tend to attribute 
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higher valence to success in a test expected 
in the future, as compared with a test to be 
taken on the same day. No such trend is 
displayed by failure-oriented subjects. 

A similar picture is found in change-in- 
valence scores. Neither temporal distance nor 
orientation had a significant effect upon change 
in valence of success; however, the interac- 


' tion term approaches significance (F = 3.88, 


p < .06). Analysis of the relevant figures in 
Table 1 reveals that while success-oriented 
subjects attribute higher valence to success in 
the test to be taken after 4 weeks (therefore, 
their change scores are positive in the now 
group and negative in the future group), 
failure-oriented subjects show a weak trend 
in the opposite direction. No difference is 
found in the scores of change in valence 
of failure. 

We may conclude that valence of success 
or failure in a future task is not considered 
smaller than that of success or failure in a 
task to be performed immediately. There are 
even indications that among success-oriented 
subjects the valence of success in a future 
task is higher than in an immediate task. 


Expectancies 


The mean scores of expectancy for correct 
answers to high-difficulty and to medium- 
difficulty items, by treatment and orientation, 
are shown in Table 1. The results show a 
consistent trend toward higher expectancy for 
the test to be taken after 4 weeks, as com- 
pared with the test to be taken on the same 
day. Analysis of variance indicated that this 
trend is significant at the .05 level in expec- 
tancy in high-difficulty items (F = 4.6), and 
at the .06 level in expectancy in medium- 
difficulty items (F = 3.6). Orientation had no 
significant effect upon expectancy (F< 1), 
and the interaction between treatment and 
orientation is short of significance (F = 1.9). 
Testing to the effect of temporal distance 
within each orientation group shows a signifi- 
cant effect in the success-oriented group (in 
expectancy in high-difficulty items, ¢ = 2.51, 
p<.01; and in medium-difficulty items, 
t = 1.74, p < .05, one-tailed test), but no sig- 
nificant effect among failure-oriented subjects. 

Analysis of change-in-expectancy scores re- 
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vealed a similar picture. Analysis of variance 
indicated a significant effect of temporal dis- 
tance for both change in expectancy in high- 
difficulty items (F = 18.3, p < .001), and in 
medium-difficulty items (F = 13.0, p < .001). 
Orientation has a significant effect on change 
in expectancy in high-difficulty items (F = 
4.23, p < .05), so that success-oriented sub- 
jects have higher change scores than failure- 
oriented subjects. The interaction of treat- 
ment and orientation falls short of significance 
in both change in expectancy in high-difficulty 
items (F= 2.8, p<.10) and in medium- 
difficulty items (F = 3.01, p < .10). In both 
cases, it was found that the differences be- 
tween treatment groups in change scores are 
highly significant among the success-oriented 
subjects (for expectancy in high-difficulty 
items, £= 4.24, p < .001; and for medium- 
difficulty items, £ = 3.77, p <.001, one-tailed 
test), and reach significance among failure- 
oriented subjects only in expectancy for high- 
difficulty items (t = 1.82, p < .05, one-tailed 
test). 

The following conclusion seems warranted: 
People tend to estimate their chances for 
success in a future task as higher than their 
chances in a task to be performed in the 
“present.” This is a strong tendency for 
success-oriented people, but a weak one for 
the failure oriented. 


DISCUSSION 


Risk Taking and Expectancy in Relation to 
Temporal Distance 


This study shows that in making decisions 
about a skill task to be undertaken after a 
lapse of time, people behave differently than 
with a task to be undertaken immediately; in 
that they take a greater risk, that is, they 
choose more items with a stated low prob- 
ability of success, they also have higher ex- 
pectancies for success. Such changes are found 
primarily among success-oriented subjects. 
Viewing expectancy and valence as the factors 
which determine choice behavior, a plausible 
interpretation of these results would be that 
the rise in risk taking as a function of tempo- 
ral distance is mediated by an increase in 
expectancy. People view their chances in the 
future as higher than in the present, and, 
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presumably, that is why they take a higher 
risk when deciding on the future. 

However, an alternative interpretation of 
the relaitonship between risk taking and ex- 
pectancy cannot be excluded. It is possible 
that the higher expectancy for success in the 
future group was not the cause, but rather 
an effect, of the higher level of risk taken in 
this group. According to this interpretation, 
subjects would tend to justify their riskier 
choices by holding higher expectancies of 
success. Such an interpretation would be 
weakened if we find that temporal distance 
brings about a rise in expectancy for success 
even in tasks where no choice is required. A 
study in progress indicates that indeed this 
is the case. 

Since the decisions in the present study 
were concerned with a task involving skill, one 
may argue that the rise in expectancy results 
from a deliberate intention of the subjects to 
use the intervening period to prepare for the 
task. In this case the explanation for the rise 
in expectancy can be reduced to the less inter- 
esting factor of “practice effects.” We regard 
such a simple explanation as unacceptable. 
Indeed, the study was planned to eliminate 
this explanation. It is true that the task on 
which the subjects decided in the present 
study was a skill task, but it was not a con- 
trollable one, nor was it perceived as such. 
This was indicated in a pilot study in which 
the same task was presented and the subjects 
were asked, before or after choice, whether 
they had thought of a way to practice for the 
test, and whether such thoughts had influ- 
enced their decision. It was clear that the sub- 
jects could not see any possible way to affect 
their chances for success, either through prac- 
tice or any other means. In fact, such a pos- 
sibility did not exist, since the task was 
defined in such general terms that the subject 
could not be aware of its exact nature. How- 
ever, even if we accept the possibility of 
deliberate intentions to practice, a basic ques- 
tion still must be raised: Why did the inten- 
tions exist under these conditions? This ques- 
tion brings us to the considerations which 
follow. 

A clue to the interpretation of the rise 
in risk taking and expectancy is that it is 
found almost exclusively among success- 
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oriented subjects. This finding is explained 
by, and was predicted in, an approach which 
emphasizes the operation of temporal distance 
as a meaningful stimulus in the individual’s 
psychological field. According to this view, 
the effect of temporal distance depends upon 
the way it is perceived and construed by the 
individual. It was suggested that the higher 
expectancy for the future derives from a 
“generalized expectancy” (Rotter, 1966) that 
time improves one’s control over the environ- 
ment. By “generalized,” it was implied that 
it is operative in a wide range of behavior, 
even where it is not justified on objective 
grounds, The overgeneralization of the expec- 
tancy must not divert attention from its more 
regular and important application elsewhere, 
that is, in new and ambiguous situations 
where the individual does not have a sufficient 
basis for judging whether having more time to 
prepare will really increase his chance of suc- 
cess. Whether ‘he would actively try to influ- 
ence the course of events, , or passively ry 
their coming, would depend on 
of his generalized expectancy conc 
The relationship between this con ept and 
that of “internal control” (Rotter, 1966) is 
now clear. The present study may be con- 
sidered as a specific application of internality 
to the dimension of time. However, it may 
also be seen as specifying a condition for 
“internality,” namely, the availability of time. 
One should define the limits and conditions 
for effectiveness of the discussed expectancy. 
For example, it may be asked how it would 
affect expectancy concerning a situation which 
involves competition between two people. 
Does the “generalized expectancy” generalize 
to the partner too, so that temporal distance 
is perceived as improving his chances also? 
Another question is whether this expectancy 
operates with regard to chance-dependent 
tasks. Somewhat surprising results obtained 
by Strickland, Lewicky, and Katz (1966) 
suggest that a related expectancy may be 
effective even in chance-dependent situations. 
These authors found that subjects who made 
their bets after throwing dice, without know- 
ing the results, tended to take lower risk than 
subjects who made their bets before throwing 
the dice. The latter condition, the authors 
assert, is perceived as somewhat more control- 
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lable. Thus, expectancies with regard to 
controllability of the situation generalize 
to purely chance-dependent tasks, and this 
might well be the case in regard to the percep- 
tion of controllability produced by temporal 
distance. 


Temporal Distance and Valence 


The present results reveal a consistent but 
not significant trend in the direction of higher 
valence put on success in a future, rather 
than in an immediate, test. This finding is in 
the hypothesized direction, but is still quite 
puzzling. First, it seems to stand in contrast 
to the interpretations of the results of many 
studies (e.g., Mischel et al., 1969) which show 
that rewards expected in the future are less 
attractive, or have a lower valence, than those 
expected immediately. Second, it stands in 
contrast to the observational judgment of the 
experimenters and to the information that was 
gathered through the inquiries. These two 
sources of information indicated that the sub- 
jects were more aroused, or “excited,” in the 
now condition, This excitement might be 
taken as an indication of the valence the 
subject attributes to a task. 

This difficulty suggests a distinction be- 
tween two aspects of what we have referred 
to as “valence”: one, “the arousal aspect,” 
refers to the extent a subject is “aroused” by 
a stimulus; and a second, “the judgmental 
aspect,” refers to the individual’s rational 
judgment of the “objective value” of a 
stimulus, independently of his own tendencies 
toward it. Granted that the two aspects are 
interdependent, the suggested distinction as- 
sumes conditions which affect the two aspects 
in a different way. Such a condition is the 
introduction of temporal distance. “Arousal” 
seems to decrease with increasing temporal 
or spatial (or any psychological) distance, 
while there is no reason why it should affect 
the judgment concerning the value of an ob- 
ject; the stimulus does not change as a result 
of temporal distance (and if it does, it may 
be in either direction). Thus, the two aspects 
of valence may act partly independent of one 
another. It is suggested that the verbal mea- 
sure used in the present study captures mainly 
the judgmental aspect of valence. Subjects 
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were asked concerning probable success or 
failure in a forthcoming unknown test, and 
their indications of valence might represent — 
an attempt to evaluate the importance and 
significance of the test, which is not identical 
with the extent they were aroused by it. In 
this case one may expect, as suggested above, 
a higher valence attached to a test expected 
after some time than to one expected immedi- 
ately. Indeed, such a trend was found in the 
present study among success-oriented subjects. 

As with expectancy one may argue that the 
revealed trend for higher evaluation of suc- 
cess in a test to be taken after some time 
results from the rise in risk taking, from the 
choice of more difficult items in this test. 
Results of a related study (Nisan, 1968) are 
not consistent with this interpretation. In this 
study a trend toward higher evaluation of 
future success was found in value rating of 
“success” in items of medium and of high 
level of difficulty, and not merely of success 
in the test as a whole. These results are in 
accord with the hypothesis that temporal dis- 
tance brings about a heightened sense of con- 
trol, which results in higher evaluation of 
success in all types of tasks. 


CoNCLUSIONS 


The concept of “expectancy” holds a cen- 
tral place in several theoretical formulations 
concerning choice behavior. Time is an essen- 
tial dimension of this concept. Outcomes may 
be expected to happen after a short, or after 
a long, time. The present study indicates the 
importance of the dimension of time, and 
suggests that in studying the temporal dimen- 
sion in regard to choice behavior, it is 
necessary to consider not only the formal di- 
mension of length of temporal distance, but 
also a content dimension of how temporal dis- 
tance affects the way a situation is perceived. 
This dimension necessitates a consideration of 
individual differences in interaction with dif- 
ferent types of tasks. 
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Forty male undergraduates participated in an experiment designed to investi- 
gate the effects of ambient temperature and prior anger arousal on adult 
aggressive behavior. It was hypothesized that following instigation to aggres- 
sion, subjects would direct stronger attacks against the anger instigator under 
uncomfortably hot than comfortably cool temperatures. However, it was 
expected that in the absence of prior anger arousal, temperature would fail 
to exert a significant effect on aggression. Results indicate that contrary to 
these predictions, aggression was actually inhibited by high ambient tempera- 
tures, and that this was the case regardless of the level of prior anger arousal 
to which aggressors had been exposed, Several factors which may have 


contributed to this unexpected pattern of results are discussed. 


Mass media reports of the tragic instances 
of collective violence which have occurred 
with increasing frequency in recent years have 
often suggested that such events are attrib- 
utable, at least in part, to what has been 
termed “the long hot summer.” More pre- 
cisely, such reports have frequently implied 
that high temperatures are somehow capable 
of facilitating the performance of overt ag- 
gressive acts and so often contribute to the 
occurrence of ghetto riots and campus dis- 
turbances. Anecdotal evidence in support of 
this view is provided by the common observa- 
tion that we are often more irritable under 
conditions where we are hot and uncomfort- 
able than under conditions where we are cool 
and comfortable. More substantial support 
for the hypothesized relationship between 
temperature and aggression is provided by 
the fact that a large proportion of recent civil 
disorders have actually occurred on days 
when “heat wave” or near “heat wave” condi- 
tions prevailed (United States Riot Commis- 
sion, 1968). However, despite the fact that 
psychologists have recently shown increased 
interest in the effects of various aspects of the 
physical environment on human behavior 
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University of South Carolina. Sincere appreciation is 
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versity, Lafayette, Indiana 47907. 


183 


(Proshansky, Ittelson, & Rivlin, 1970; Wohl- 
will, 1970), no direct experimental evidence 
concerning this relationship appears to exist. 

Indeed, even indirect evidence is provided 
by the results of only one recent experiment 
(Griffitt, 1970), an investigation concerned 
with the effects of ambient temperature and 
attitude similarity on interpersonal attraction. 
In this study, subjects were presented with 
information which suggested that a stranger 
was relatively similar or dissimilar to them- 
selves in terms of attitudes and then required 
to make a series of judgments about this 
individual. One-half performed this task 
under very hot and uncomfortable conditions, 
while the remaining half participated in the 
experiment under cool and comfortable condi- 
tions. Results indicated that reported attrac- 
tion toward the stranger was significantly 
lower under the hot than cool temperatures, 
and that this was the case regardless of his 
degree of apparent similarity to the subjects. 
In view of the fact that a low level of attrac- 
tion toward a potential victim would gen- 
erally be expected to facilitate aggression 
against this person (e.g., Baron & Kepner, 
1970), these findings are entirely consistent 
with the view that aggression may be aug- 
mented by high ambient temperatures. 

The present experiment was designed to 
attain more direct evidence concerning the 
influence of temperature on adult aggressive 
behavior. In order to examine this relation- 
ship, male subjects were first angered or not 


angered by a confederate of the experimenter, 
and then provided with an opportunity to 
ageress against this individual by means of 
electric shock (Buss, 1961). For one-half of 
the subjects, the entire experiment was con- 
ducted under comfortably cool temperatures, 
while for the remaining half, it was conducted 
under uncomfortably hot temperatures. On 
the basis of the suggestions described above, 
it was hypothesized that following prior 
instigation to aggression, subjects would di- 
rect stronger attacks against the anger insti- 
gator under hot than cool temperatures. How- 
ever, it was also expected that in the absence 
of prior instigation, ambient temperature 
would fail to exert a significant influence upon 
overt aggressive behavior. 


METHOD 
Subjects 


Forty undergraduate males enrolled in summer 
session classes at the University of South Carolina 
participated in the experiment. Students took part 
in the study to attain extra points toward their 
course grades. 


Design 


A 2X2 factorial design based upon 2 levels of 

_ temperature (cool, hot) and 2 levels of prior anger 

arousal (nonangry, angry) was employed. Ten sub- 

jects were randomly assigned to each cell of this 
design, 


Apparatus 


The apparatus consisted of a modified Buss 
“aggression machine” (Buss, 1961); a Lafayette stop 
clock (Model 20225 A); a Hunter interval timer 
(Model 111-C); a device for the delivery of shock 
to the subjects; and an intercom system. 

The Buss aggression machine was identical to 
apparatus employed in previous experiments (e.g., 
Baron & Kepner, 1970). Thus, it contained 10 push- 
button switches which could be depressed by the sub- 
jects to deliver electric shocks of varying intensities 
to another individual on occasions when he made 
“errors” in a learning task. The Lafayette stop clock 
was employed to measure the duration of shocks 
administered by subjects to the learner. The mate- 
tials to be mastered by this individual were lists of 
40%-60% Glaze association value nonsense syllables. 
Subjects presented these materials to the learner over 
the intercom system. The interval timer was em- 
ployed to control the duration of shocks delivered 
to the subjects during an initial (anger arousal) 
phase of the experiment. 
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Procedure 


When the subjects arrived for the experiment, 
they found a confederate already seated in the 
waiting room. Shortly thereafter, the experimenter 
arrived and conducted both individuals to the first 
experimental room. Once there, she explained that 
the first part of the experiment was concerned with 
attitudes and opinions and requested that both indi- 
viduals fill out a 23-item attitude questionnaire 
dealing with a wide variety of issues (e.g., student 
protests, belief in God, the draft). This task was 
employed as a time filler to insure that the subjects 
spent sufficient time in the experimental rooms for 
any influence of ambient temperature on their behav- 
ior to be apparent. In order to spare the confederate 
repeated exposure to high ambient temperatures 
during this phase of the study, however, this indi- 
vidual was removed from the scene (ostensibly to 
permit both individuals to work on the items un- 
disturbed) and completed his questionnaire in 
an adjacent room removed from the subject’s sight. 

Anger arousal, After allowing both the subject and 
confederate 15 minutes to complete the questionnaire 
(a more than ample period of time), the experi- 
menter conducted the confederate back into the 
first experimental room and announced that the 
next part of the experiment would be concerned with 
the effects of stress on problem-solving ability. She 
then explained that in order to study this problem, 
one of the two individuals would serve as a problem 
solver and the other as an evaluator. The subject 
was then arbitrarily designated to play the first role 
and the confederate to play the second. The experi- 
menter explained that the problem solver’s (i.e., the 
subject’s) task would consist of writing a solution 
to a complex social problem (how to control rising 
crime rates), knowing that his work would be 
evaluated in two different ways by the evaluator. 
This individual (the confederate) would write his 
overall reaction to the subject’s solution on a form 
which the subject would later see, and would also 
deliver from 0 to 10 shocks to this. individual ac- 
cording to the rule that the poorer his work, the 
greater number of shocks he would receive. In 
reality, the evaluations presented to the subject were 
varied in a systematic manner and provided the 
major manipulation of anger arousal (see, eg, 
Berkowitz & Geen, 1966). In the nonangry. condi- 
tion, the confederate delivered only 1 shock to the 
subject and wrote a very positive statement about 
his solution, while in the angry condition, he de- 
livered 9 shocks to this individual and wrote a very 
negative statement about his work.ë Again, in ‘order 
to spare the confederate repeated exposure to high 
ambient temperatures, this individual was removed 
from the scene and performed his task as evaluator 
in the adjacent room. 


3As has been the case in previous experiments 
conducted by the author (Baron, 1971), the shocks 
to subjects consisted of 67-volt, .5-second pulses 
delivered by means of stainless-steel finger electrodes. 
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Aggression opportunity. After the completion of 
the above procedures, the experimenter conducted 
both the confederate and the naive subject to a 
second experimental room and explained that the 
final part of the study was concerned with the 
effects of punishment on learning. She then stated 
that in order to examine this problem, one of the 
individuals would act as a teacher and the other as 
a learner. The teacher would attempt to teach the 
learner a list of nonsense syllable pairs, rewarding 
him for correct responses and punishing him with 
electric shock for errors, The experimenter then 
requested that since he had acted as the evaluator 
in the previous part of the study, the confederate 
should now agree to serve as learner. The confederate 
agreed to this request, and the experimenter then 
conducted him to a third room where she provided 
instructions for his task in the study and appeared 
to attach the shock electrodes to his arms. After 
completing these activities, the experimenter returned 
and read detailed instructions for his task as teacher 
to the naive subject. These instructions indicated 
that each time the learner made a correct response 
on the learning task, the teacher should reward him 
by pushing a “correct” button on the apparatus. 
However, whenever the learner made an error, the 
teacher should choose and depress one of the 10 
shock buttons in order to punish this individual. 
The experimenter carefully explained that the higher 
the number of the buttons chosen by the teacher, 
the stronger the shocks to the learner, and that the 
longer any shock button was depressed, the longer 
these shocks would last. In order to convince the 
subject that the aggression machine was in fact 
operational, she then administered sample shocks 
from Buttons 4 and 5 to this individual. The 
shocks produced by these buttons were 36 and 50 
volts, respectively, and generally produced pain 
verbalizations on the part of subjects. 

Following these activities, the experimenter re- 
turned to the room where the learner was waiting 
and signaled the subject to begin teaching his list 
of nonsense syllables to this individual. The learner 
appeared to make a total of 20 errors during the 
experiment, but seemed to gradually master the list. 

Postexperimental questionnaire. After the last shock 
trial, the subjects completed a postexperimental ques- 
tionnaire on which they rated (on 7-point scales) 
their anger toward the victim and their current 
feelings along the dimensions of comfortable- 
uncomfortable and pleasant-unpleasant. 

Ambient temperature. The manipulation of ambient 
temperature was quite simple and straightforward. 
In the cool condition, all phases of the experiment 
were conducted under comfortably cool temperatures, 
while in the hot condition, all phases took place 
under uncomfortably hot temperatures. In order to 
implement this variation in the most effective man- 
ner, it was necessary to conduct cool and hot condi- 
tions on alternate days. Thus, in the cool condition, 
the air conditioning in the two experimental rooms 
occupied by the subjects during the study was 
turned on full and the thermostat set at a low level 
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on the previous night. In the hot condition, however, 

the air conditioning was turned off, and two electric 

heaters were used to raise the temperature in these 

rooms to very uncomfortable levels. Readings of 

ambient temperature were taken both before and 
after each experimental session in both experimental 

rooms. In the cool condition, the averages before and 

after readings were 74.2 degrees and 74.3 degrees 

in the first room, and 74.6 degrees and 75.0 degrees 

in the second room. In the hot condition, the cor- 

responding readings were 93.9 degrees and 91.1 de- 

grees in the first room, and 95.2 degrees and 91.8 

degrees in the second. Subjectively, the experimental 

rooms seemed to be pleasantly cool and comfortable 

in the cool condition, but uncomfortably hot and 

stuffy in the hot condition.* 


RESULTS 
Shock Duration 


Figure 1 presents the mean duration of 
shocks administered to the learner by subjects 
in the angry and nonangry conditions as a 
function of ambient temperature. Inspection 
of this figure suggests that this measure was 
strongly influenced by both experimental 
factors. Thus, as expected, subjects in the 
angry condition delivered longer shocks to 
the victim than subjects in the nonangry 
condition. However, in direct opposition to 
experimental predictions, subjects in the 
cool condition delivered longer shocks to the 
victim than subjects in the hot condition. 

An analysis of variance in which tempera- 
ture and anger arousal were the between- 
subjects factors and trials the within-subjects 
factor was performed on these data. The re- 
sults of this analysis indicated that con- 
sistent with the pattern of results shown in 
Figure 1, the effects of anger arousal 
(F = 4.74, df = 1/36, p < .05) and tempera- 
ture (F = 4.54, df= 1/36, p< .05) were 
both significant, while none of the other ef- 
fects in the analysis attained or approached 
significance. Thus, as suggested by this 
figure, aggression was facilitated by prior 


4 Unfortunately, it proved impossible to hold rela- 
tive humidity completely constant as temperature 
was varied. Thus, although two humidifier-vaporizers 
were employed in an attempt to maintain uniform 
conditions, average humidities were somewhat higher 
in the cool condition (40%-50%) than in the hot 
condition (20%-30%). It appears that more elabo- 
rate equipment than that available for the present 
study would be required to maintain precise control 
over this factor. 
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Fic. 1. Mean shock duration as a function of prior anger arousal 
and ambient temperature. 


anger arousal but inhibited by high ambient 
temperatures. 


Shock Intensity 


Figure 2 presents the mean intensity of 
shocks administered to the learner by sub- 
jects in the angry and nonangry conditions 
as a function of ambient temperature. Inspec- 
tion of this figure reveals a pattern of results 
very similar to those obtained with the dura- 
tion measure. Thus, subjects in the angry 
condition delivered stronger shocks to the 
learner than subjects in the nonangry condi- 
tion, while subjects in the cool condition 
delivered stronger shocks to this individual 
than subjects in the hot condition. 

An analysis of variance in which tempera- 
ture and anger arousal were the between- 
subjects factors and trials the within-subjects 
factor was performed on these data, The re- 
sults of this analysis indicated that the effect 
of anger arousal was significant (F = 7.31, 
df = 1/36, p < .025), while that of tempera- 
ture approached significance (F = 3.00, df = 
1/36, p< .10). Thus, as was the case with 


shock duration, aggression was facilitated by 
prior anger arousal but somewhat inhibited 
by high ambient temperatures. In addition, 
it was found that the effect of trials was also 
significant (F = 15.90, df = 3/108, p < .001). 
Thus, as has been reported in previous experi- 
ments (e.g, Baron & Kepner, 1970; Buss, 
1966; Geen, 1968), subjects increased the 
intensity of their shocks to the victim over 
the course of the experiment. 


Postexperimental Questionnaire 


The first item on the postexperimental 
questionnaire required the subjects to rate 
their degree of anger toward the victim after 
he had evaluated their solution and shocked 
them during the second phase of the study. 
The means for the nonangry and angry 
groups were 1.05 and 3.45, respectively. The 
appropriate Mann-Whitney U test indicated 
that this difference was highly significant 
(z = 7.36, p < .001). Thus, the manipulation 
of anger arousal appeared to be successful. 

Two additional items on the questionnaire 
required the subjects to rate their present 
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Fic. 2. Mean shock intensity as a function of prior anger arousal 
and ambient temperature. 


feelings along the dimensions of pleasant— 
unpleasant and comfortable-uncomfortable. 
Separate analyses of variance performed on 
the data for these items indicated that sub- 
jects in the cool condition reported feeling 
significantly more pleasant and comfortable 
than did subjects in the hot condition 
(F = 11.28 and 13.64, df = 1/38, p< .005 
and p < .001, respectively). Thus, the experi- 
mental variations in ambient temperature 
appeared to produce the expected effects on 
subjects. 


DISCUSSION 


The results of the present experiment failed 
to provide support for the hypothesis that 
following instigation to aggression, subjects 
would direct more intense attacks against an 
anger instigator under high than low ambient 
temperatures. Instead, results obtained with 
both dependent measures indicated that un- 
comfortably high temperatures tended to in- 
hibit such behavior, regardless of the level of 
anger arousal to which aggressors had been 
previously exposed. In view of the fact that 


data from the postexperimental questionnaire 
suggested that both anger arousal and temper- 
ature were manipulated in an effective manner, 
these findings cast serious doubt upon the 
view that high temperatures per se often play 
an important role in the elicitation of in- 
stances of collective violence. Instead, it ap- 
pears that very high temperatures may actu- 
ally tend to inhibit the performance of overt 
aggressive acts by adult human beings. Sev- 
eral possible explanations for this unexpected 
pattern of findings may be readily suggested. 

First, it is possible that the very high 
temperatures employed in the hot condition 
of the present study made the subjects so 
uncomfortable that they sought to complete 
the experimental task and escape from the 
stifling laboratory as quickly as possible. As 
a result, they may have reduced the duration 
of the shocks that they delivered to the 
victim and also lowered the intensity of these 
attacks in order to avoid any delays con- 
tingent upon their use of strong punishment 
(e.g., possible censure of their behavior by 
the experimenter). Evidence concerning the 


188 ae 
validity of these suggestions may be obtained 
in future research by the inclusion of items 
on the postexperimental questionnaire de- 
signed to assess the subjects’ interest in and 
concern with the experiment. 

Second, it may be the case that the sub- 
jects in the hot condition reduced the strength 
of their attacks against the victim because 
they perceived this individual as having 
shared with them the unpleasant experience 
of participating in the study under uncom- 
fortable environmental conditions. Previous 
evidence (Latané, Eckman, & Joy, 1966) 
suggests that the perception of shared suf- 
fering with another may lead to increased 
liking, and presumably decreased aggression, 
toward this individual. Although this interpre- 
tation is somewhat weakened by the fact that 
the subject could not actually see the confed- 
erate during most of the study, it is still ap- 
plicable to the extent that they assumed he 
was experiencing discomfort similar to that 
which they themselves endured. Future re- 
search may investigate this interesting possi- 
bility by varying the extent to which the vic- 
tim appears to share the unpleasant experi- 
ences of the subject in a systematic manner. 
_ Finally, it is also possible that the subjects 
in the hot condition directed weaker attacks 
against the victim than subjects in the cool 
condition because they assumed that his 
physiological state would tend to make the 
shocks extremely painful (e.g., he was pre- 
sumably perspiring profusely in the stifling ex- 
perimental rooms). Information regarding this 
possibility may be obtained in further experi- 
ments by requiring the subjects to rate the 
painfulness of the shocks delivered by various 
buttons on the aggression machine. 

In view of the existence of the alternative 
explanations outlined above, it is clear that 
interpretation of the findings of the present 
study must remain somewhat ambiguous un- 
til the completion of additional research. 

The fact that the findings of the present 
research were obtained under highly re- 
stricted laboratory conditions which differ 
markedly from those of naturalistic social sit- 
uations in which high temperatures have been 
assumed to enhance aggression is also deserv- 
ing of some attention. In particular, it is quite 
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possible that the predicted “long hot summer” 
effect may occur under conditions other than 
those employed in the present experiment. 
For example, it may be the case that high 
temperatures facilitate aggression, but only 
when individuals have spent relatively long 
periods of time in such environments. Simi- 
larly, it is possible that high temperatures fa- 
cilitate aggression, but only under conditions 
where aggressors have been subjected to 
stronger anger arousal than was produced in 
the present study. Third, as suggested by 
data collected by Goranson and King (1970), 
high temperatures may facilitate aggressive 
outbursts, but only when they follow a period 
of relatively cool and comfortable conditions. 
Finally, it may be the case that the relation- 
ship between temperature and aggression is 
curvilinear so that violence is inhibited by 
very high temperatures such as those in the 
hot condition of the present experiment, but 
actually facilitated by somewhat lower tem- 
peratures. The investigation of these sug- 
gestions in future research will provide more 
definitive evidence concerning the effects of 
ambient temperature on the performance of 
aggressive acts by adult human beings. 

Despite the limitations described above, the 
fact remains that in the present research, high 
temperatures tended to inhibit aggressive be- 
havior. Thus, it appears that the presence of 
uncomfortable levels of heat does not by 
itself account for the high incidence of civil 
disorders during the hot summer months. In- 
stead, it may be tentatively suggested that 
other conditions also present at this time of 
year (e.g, more people out on the streets, 
longer daylight hours, the presence of many 
teenagers who are home from school) are 
more directly responsible for the tragic “long 
hot summers” of recent years. 
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An experimental study was made of a relationship in which each of many 
persons faces a choice between two actions, one of which is to his own short- 
run advantage and the other of which is in the common interest, The more 
persons who choose the latter action, the better off they all are. Two factors 
were varied: (a) degree of individual interest served by the first action and 
(b) the degree of common interest served by the second. Choices were made 
in public in groups of approximately 13 subjects. The frequency of the first 
(cooperative) choice was found to decrease with an increase in degree of 
individual interest and, to a marginal degree, to increase with an increase in 
degree of common interest. The correlates and implications of the participants’ 
understanding of this type of social interdependence are discussed. The experi- 
mental task is proposed as a paradigm for a broad range of problems entailing 


socially responsible behavior. 


A prominent feature of modern social life 
is that every individual is interdependent with 
large numbers of other people. This is true 
in such diverse domains as automobile traffic, 
environmental pollution, and tax payments. 
In all of these various situations, the person 
takes actions that affect others as well as 
himself, and at the same time, his own satis- 
faction and well-being depends in large part 
on similar actions taken by large numbers of 
other persons. Such interdependence exists, 
for example, in regard to the individual’s 
maintenance of safety equipment on his auto- 
mobile, the carefulness of his driving, his 
thoughtfulness in regard to littering, and his 
honesty in reporting his taxable income. 

This type of social interdependence is char- 
acterized by three properties. (@) There are 
two actions or classes of actions, one of which 
may be described as “responsible,” “altru- 
istic,” “cooperative,” etc., and the other as 
“Grresponsible,” “selfish,” “competitive,” etc. 
(b) The latter type of behavior is less costly 
and/or more rewarding for the individual, at 
least in the short run. (c) The moře people 
who act in that manner, the less well off they 


1 The research reported here was conducted under 
National Science Foundation Grant GS-1121X. 

2 Requests for reprints should be sent to Harold 
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3 Fulbright-Hays Fellow at the University of Cali- 
fornia, Los Angeles, 1969-1970. 
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all are, and the more people who act in the 
cooperative or responsible manner, the better 
off they are. Thus, for example, in relation 
to air pollution, it is cheaper and more con- 
venient for each person to let the smog-control 
device on his automobile deteriorate and be- 
come inefficient. However, the more drivers 
taking this line of action, the more they all 
have to suffer the consequences of high 
concentrations of smog. 

To the degree that the individual is aware 
of these aspects of the situation, he faces a 
conflict—the conflict between his own immedi- 
ate individual interests and the interests of 
the total set of people (the common interest), 
himself included. This conflict is not likely 
to be of high salience or importance to him 


inasmuch as the consequences involved in _ 


the choice are usually not of great moment. 
The advantage he gains from the competitive 
type of behavior is often rather small, and 
the negative impact of his single action on 
the common interest is also usually small. It 
is largely in their collective consequences that 
the competitive actions of large numbers of 
persons have marked detrimental implications 
for the general welfare. Thus, an important 


aspect of the interdependence between the, 


“one and the many” is its asymmetry: col- 
lectively, they have a strong effect on his well- 
being, but individually, he has little effect 
on theirs. 


CONFLICT In AN V-PERSON RELATIONSHIP 


This type of social relationship presents a 
problem of obvious importance for the social 
psychologist. For theoretical reasons he is in- 
terested in identifying the factors that deter- 
mine the individual’s line of action in this 
situation, and, out of practical concern for 
the quality of life in his society, the social 
psychologist will seek to determine methods 
for increasing the frequency with which indi- 
viduals act in a manner consistent with the 
common interest, 

The situation described above can be char- 
acterized somewhat more formally as follows: 
consider V persons, each of whom, on re- 
peated occasions, has a choice of acting in 
Manner C or Manner D, where C is the 
“cooperative” action and D, the “competitive” 
one. (Our choice of symbols for the two 
actions follows the conventions used in re- 
search on the Prisoner’s Dilemma (PD) game 
as described, e.g., by Rapoport & Chammah, 
1965.) On each occasion of choice, the pay- 
off each person receives depends on (a) his 
response, the payoff for D always being 
greater than that for C and (b) the number 
of the set of V persons making Response C, 
the value of both responses increasing as some 
direct function of the proportion making C. 
The relation between the payoffs for Actions 
C and D and the number of people acting 
in Manner C is of the general form shown 
in Figure 1.4 The vertical distance between 
the two curves corresponds approximately to 
the person’s individual interest, that is, to 
how much better off he is, regardless of what 
the other persons are doing, if he makes the 
D response. The upward slope of the two 
curves corresponds approximately to the 
common interest, that is, to how much better 
off everyone is if instead of all acting in 
Manner D, they all act in Manner C. (More 
specifically, this latter factor is indicated by 
the slope of a line drawn from Point P to 
Point R in Figure 1.) 

This situation has much in common, psy- 
chologically, with the PD game: the indi- 


4The payoff curves need not have this specific 
shape and relationship. They may be curved lines, 
and they may be more or less convergent and still 
preserve the psychological properties we have speci- 
fied here. It is merely necessary that the D line be 
above and not in intersection with the C line. 
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Fic. 1. Payoff curves for N-person 
non-constant-sum game. 


vidual’s essential choice is whether to act 
for the common good or to pursue his own 
interest at the expense of others, with the 
latter course involving the risk of common 
harm if others also choose it as their mode 
of behavior. For this reason, in our work 
with this game, we have informally referred 
to it as an N-person generalization of the 
PD game. Its formal similarity to the two- 
person PD game can be seen by reference to 
Figure 1. Let us assume that, rather than 
acting independently, the N persons act as 
two equal teams, the members of each team 
responding in concert on each trial. Then the 
possible outcomes for the members of the two 
teams are as indicated by the labeled points: 
R if both teams make Response C, S for one 
and T for the other if the teams make Re- 
sponses C and D, respectively, and P if both 
teams make Response D. The properties of 
the PD game are present if T>R>P>S 
and 2R > T + S > 2P. These conditions are 
fulfilled by the curves in Figure 1. However, 
by drawing some curves of his own, the 
reader will quickly see that these are very 
special conditions which are met only for 


-certain Ws and when the slopes of the curves 


and the gaps between them (reflecting, re- 
spectively, the common interest and the indi- 
vidual interest) stand in a special relationship 
to each other. Thus, the PD game is a special 
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case of the more general nonconstant-sum 
game investigated here. 

The present paper reports the first experi- 
mental study of this social relationship. In 
a 2X2 design, we compared two values of 
the individual interest factor and, indepen- 
dently, two values of the common interest 
factor. Our hypothesis was that the number 
of persons making Response C would decrease 
with the first factor and increase with the 
latter one. We were also interested in percep- 
tions and understandings of this type of rela- 
tionship and the relation between such subjec- 
tive factors and the behavior enacted within 
the situation. . 


METHOD 
Subjects 


One hundred and forty-seven undergraduate stu- 
dents (65 women and 82 men) were recruited from 
the subject pool at the University of California, Los 
Angeles, and served in the experiment as part of a 
requirement for their class in introductory psychol- 
ogy. They assembled in 11 groups, 3 each of sizes 
13, 14, and 15, and 1 each of sizes 10 and 11. All 
groups were assembled without regard for sex, and 
all were heterogeneous in this respect. 


Procedure 


The subjects were seated in a U-shaped pattern 
with the experimenter at the open end, so everyone 
was in plain view of one another and of him. Each 
of the successive games (see below) was deñned by 
a particular pair of colored cards, for instance, red 
and green. On each trial, at a verbal signal from 
the experimenter, each subject held up one or the 
other card. The experimenter quickly counted the 
number holding up a particular color and, referring 
to the payoff schedule for that game, announced 
aloud two scores, for example, “Red 6 and Green 4.” 
Each subject then recorded his color for that trial 
and the score for that color. 

After this general procedure had been explained to 
the subjects, they were told: 


You are each to try to get as many points for 
yourself as possible. You will learn how to do this 
as we go along, and each time it’s up to you to 
decide what card you wish to hold up. We'll add up 
the scores at the end of the games, and you each 
will be given 1¢ cash for every 100 points you have 
accumulated during the various games, The score 
you get each time will depend im part on what 
color you yourself hold up and zm part on what 
colors the others hold up. So you will want to 
look around each time and see what colors other 
people are holding up. 
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After this general statement had been repeated, the 
instructions continued as follows: 


To be more specific, in each game you will find 
that the two colors have different consequences. 
There will be one color that always gives you a 
higher score than the other one. However, the 
more of you who hold up that color, the lower 
both scores will be. [Repeat] So you each have 
the choice on each trial—whether to hold up the 
color that gives yourself the higher score, or to 
hold up the color that keeps everyone’s scores high 
(including your own). [Repeat] The several games 
we play will each be different. That is, when we 
change colors, it will be a mew game with new 
scores. And the scores for the new pair of colors 
will not be related in any way to the scores for 
previous colors or games. Questions? 


After their questions were answered, the experi- 
menter explained in more detail the steps of the 
procedure on each trial. Considerable discipline and 
uniformity of process were imposed at this point so 
that they would be able to complete a number of 
trials for each game during the experimental hour. 
On each trial, the experimenter said “ready” so- they 
could make their decisions, “up” as a signal to dis- 
play their chosen cards, “down” after he had counted 
them, and he then announce “scores for the 
two colors, ‘ a 

Finally, the subjects were remini ain that their 
goal was to make as many-'points ; possible and 
that they would receive 1g for evel y 100 points they 
accumulated over all of the games. They were asked 
to pay attention and not to talk or make comments 
during the games. 

Each game was planned for 50 trials plus a vari- 
able number of extra trials to reduce the subjects’ 
ability to anticipate the last trial. The game was 
terminated sooner if for 10 consecutive trials, no 
subject changed the color he held up. At the end 
of the last game, they answered a brief set of ques- 
tions with reference to that particular game. These 
questions were: (1) Did you feel you could control 
your own score in this game? (2) Did you feel you 
had any influence on what colors other people chose? 
(3) Do you feel the best way to get a high score 
was to (a) do as others were doing or (b) be 
different from others? (4) Which color was best in 
terms of your own score? (5) Which color was best 
for everyone? (6) Do you feel your score swas con- 
trolled by what others did? (7) Did this situation 
seem to be a cooperative or a competitive one? and 
(8) Which color was the moral one to choose? Each 
question was answered on a 7-point scale, Then the 
purposes of the experiment were explained, the sub- 
jects were requested not to discuss it with other 
psychology students until the experiment could be 
finished, and they were paid (either a flat fee of 50¢ 
or the promised amount equal to V4o9 of their total 
scores). 
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öy TABLE 1 
THE Four Payorr SCHEDULES USED ror GROUPS OF 14 SUBJECTS 
Schedule Number responding C 
F AA z Response | 
Individi eee (Ostet be Ma cal S E AA E E OE beste bal Pe crs 
High Low D 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 
G 9 10} 119 12 | 13| 14} 15 | 16 | 17 -184 19 | 20 | 21:22 | 23 
| | 
High High D 8 | 10 | 12 | 14 | 16 | 18 | 20 | 22 | 24 | 26 | 28 | 30 | 32 | 34 | 36 
ic 2; 4] 6] 8 | 10) 12) 14 |} 16 | 18 | 20 | 22 | 24 | 26 | 28 | 30 
Low Low D 11 |12 | 13 | 14| 15 | 16} 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24} 25 
È 9 | 10} 11 | 12 | 13 | 14] 15 | 16 17E 18 | 19 | 20 | 21 | 22 | 23 
Low High D 4| 6| 8) 10] 12 | 14) 16 | 18 | 20 | 22 | 24 | 26 | 28 | 30 | 32 
x © 2| 4| 6) 8) 10} 12} 14] 16 | 18 | 20| 22 | 24 | 26 | 28 | 30 
j Note.—Entries are the scores each subject receives depending on the schedule, his response, and the number responding C. 
” 
è; Design RESULTS 
, Each group played five games, four being members One major dependent variable is the num- 


Jot a2X2 design defined by two degrees of individ- ber of C responses made in each group for 
‘suMinterest and two degrees of common interest, aL Hino Catch tonne dee chede ccia 


k nd the fifth being one of several “control” sched- 3 
ules. The: latter were included for exploratory pur- evaluate these values, it was first necessary 


"poses so their results are not reported here. For the to take account of a possible serial position 
| first 10 groups, the order of the five games was effect which inspection of the data suggested 
_ rotated according to Latin oe ee so sere to be present, The inclusion of the control 
3 ee ee approximately equally often in each’ schedules and the eleventh group made it im- 
Y ; possible to evaluate the positions and sched- 
The Four Payof Schedules ules simultaneously. Therefore, we first evalu- 

| ated the position effect by converting each of 

\+ The individual interest, that is, the amount by the four values for each group (average 
which the scores of the subjects choosing Response D number of Cs per member) into a deviation 


‘exceeded the scores of those choosing C, was low v 
1 (2 points) for two of the schedules me Fane from the appropriate schedule average. Hold- 


` points) for the other two. For each of these pairs ing the effect of schedules essentially constant 
‘of schedules, one involved low common interest in this manner, we then performed an analy- 
and the other, high common interest, Here, low meant gis of variance on the deviation scores in a 


that the scores for both responses increased 1 point ies F {, an 
* for each additional person choosing C, and high, that Position X Group design. This revealed a sig 


‘oth scores increased 2 points for each such person, Nificant position effect (F = 5.57, p< .01, 
‘for all schedules, the score for the C response was using the Position x Group interaction term 
set at 16 if N/2 of the subjects (or 4 (V +1) in the as the estimate of error variance). There 
case of odd-sized groups) made the C response. Thus, appears to be a regular increase in rate of C 


| the values for a 14-person group for the four sched- S 
E were as presented in Table 1. By using the TeSponses over the five positions from an 


rules and parameters suggested here, the reader can average of 7.2 for the first game to 15.1 for 
` easily determine what the schedules were for groups the last. 
-of other sizes. It will be noted that in the low-high To test the schedule effect, the process was 


schedule, the individual interest is so small relative 
‘to the cémmon interest that the conflict between reversed. Each schedule value was corrected 
; 


A tnem is not formally present. The individual makes for its serial position, thus essentially re- 
just as high a score (in absolute terms) with the moving the effects of serial position from the 
| C response as with D, regardless of what the others between-schedule differences. The average 


do, This happens to be an N-person case of mutual 
\ fate GS as described by Thibaut and Kelley numbers of Cs per person for the four sched- 


(1989). ules are summarized in Table 2. The p values 


È 


TABLE 2 


AVERAGE NUMBER OF C RESPONSES 
FOR EACH SCHEDULE 
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Schedule b 


High individual Eoin individual 


Trials interest interest 
Individual] oy 
i interest 

Low High | Low High 

GE CE CI Cr. 
First 10 | 2.01 | 2.63] 2.39] 2.61 ns ns 
Last 10 1.43 2.04} 2.58} 3.18 01 .20 
Total | 7.58 | 10.63 | 11,98 | 14.02 01 20 


Note,—CI = common interest. 


reported there are from an Individual Inter- 
est X Common Interest X Groups design in 
which the entries were the corrected group 
means and the variance between levels of 
individual interest (or common interest) was 
evaluated using the Individual Interest x 
Groups (or Common Interest x Groups) 
interaction as the error term. 

Our hypothesis was that the C response 
would decrease with heightened individual 
interest and increase with heightened com- 
mon interest. Thus, we expected the lowest 
rate of Cs in the high-low condition, the 
highest rate in the low-high condition, and 
intermediate rates for the other schedules. 
The results in Table 2 confirm these expecta- 
tions. There is little difference among the 
conditions during the first 10 trials when we 
may assume that the subjects spend at least 
some trials learning which response has which 
consequences. However, on the last 10 trials, 
the high-low condition is clearly lowest and 
the low-high, highest in rate of Cs. The 
analysis of variance results summarized in 
Table 2 indicate that the individual interest 
factor had a clearly significant effect, but the 
common interest factor, only a very marginal 
one. It appears that for the particular values 
of the two factors represented in this experi- 
ment, the individual interest factor was the 
more potent determinant of cooperative (or 
competitive) behavior. 

A comparison of the results for the first 
and last 10 trials indicates a further effect 
of the individual interest factor. With high 
individual interest, the number of Cs declined 
from the first to the last trials, but for low 


to be reporting “incorrectly” if we assume 


individual interest, it increased (differen 
significant at p < .05 by analysis of variancd 
of change scores corrected for serial position) J 

In order to gain insight into the basis of 
cooperative behavior in this experiment, we 
compared the answers on the postquestion- 
naire for subjects who had exhibited more 
or less of this behavior. We divided the sub- 
jects into those with high and low rates of C 
responses (10 or fewer versus 11 or more over 
the 50 trials), combined subjects over groups, 
and performed a Subject Type X Schedule 
analysis of variance of each question. 

These analyses suggest first that the more 
cooperative subjects have a keener sense of 
the relation of mutual control existing be- 
tween themselves and the others in the ex- 
perimental situation. They answered more 
strongly in the affirmative Questions 2 (Did 
you feel you had any influence on what 
colors other people chose?) and 6 (Do you 
feel your score was controlled. by what the 
others did? (p < .01, in each case). 

The second result is that the ‘two types of | 
subjects answered differently, * ‘Questions 4 
(Which color was best in terms of your own 
score?), 5 (Which color was best for every- 
one?), and 8 (Which color was the moral one 
to choose?). In all cases, the difference was in 
the direction of the behavioral difference. 
That is, the subjects who had been respond- | 
ing more frequently with C tended more 
strongly to endorse C as the answer to all three ag 
questions (p < 01, p < .10, and p < .01, re- 
spectively). This general pattern is not too 
surprising: each person reports the response 
he has been using as the better one either for 
the reason that he has been using it because ] 
he thought it was better or for. the reason 
that he now thinks it is better because he | 
has been using it. However, the fact that 
the more cooperative persons report C as best 
in terms of their own scores may be an 4 
important fact. In a sense, they may be said 


they should take “own score” to mean their 
own score in the short run, From a long-run 
point of view, however, they are reporting 
more correctly than their colleagues. The 
implication of this result may be that these 
cooperative persons tend more to view their 
own score from a long time perspective and 
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see the implication of the game, that in the 
ng run their own scores are advanced by 
i the more frequent choice of the C action. 


DISCUSSION 


It must be noted that the general level of 
cooperative behavior in this experiment was 
quite low. As the averages in Table 2 indi- 
cate, even on the last 10 trials with the most 
favorable schedule (low individual interest 
and high common interest), the typical sub- 
ject made the C response only 32% of the 
time. One factor that may account for this 
low rate is inadequate understanding of the 
situation. We were struck by the lack of 
comprehension shown by many subjects in 
their informal postexperimental comments. 
Although the situation does require the sub- 
jects to learn which response has what conse- 
quences, this would not seem particularly 
difficult to accomplish in view of the fact that 
(a) all responses were made publicly and in 
plain view so the distribution of choices be- 
tween the two colors was easy for the subject 
to observe, and (b) the scores for both re- 
sponses were made known to all subjects on 
each trial. On the other hand, the general 
description given by the experimenter was 
quite abstract, and, inasmuch as the number 
and/or pattern of the two responses varied on 
almost every trial, the information the sub- 
ject had from which to infer exactly what 
was happening was rather complex. 

If the generally low level of cooperative 
behavior reflects poor understanding (what- 
ever the reason), then we should be able to 
find an association over the subjects between 
their rate of responding with C and their 
amount of understanding. Three questions on 
the postexperimental questionnaire clearly 
measure the person’s understanding of the 
experiment. The correct answer to each of 
these questions (1. Did you feel you could 
control your own score in this game? 6. Did 
you feel your score was controlled by what 
others did? 5. Which color was best for every- 
one?) was given more often by subjects with 
high rates of C responses (p < .01 and p< 
10, ns, respectively). As noted in the Results 
section, they also answered less “correctly” a 
fourth question concerning which color was 
best in terms of their own scores. However, 
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the “correct” response to this question de- 
pends on short versus long time perspective, 
so the result is probably more properly inter- 
preted as reflecting the longer time perspective 
of the more cooperative subjects. Setting 
aside, then, the results for this last question, 
it appears that the more cooperative subjects 
had a somewhat better understanding of the 
situation. 

Of many possible explanations for this 
association between understanding and co- 
operative action, we will consider only two. 
First, the understanding may be responsible - 
for the higher rate of C responses. This is 
consistent with the serial position effect noted 
in the Results. If understanding provided the 
basis for cooperation, one would think co- 
operation would increase over successive 
games for each group. This is based on the 
reasonable assumption that the several sched- 
ules are similar enough to permit considerable 
transfer of training from one to the next. 
That understanding contributes to coopera- 
tion is also consistent with another covari- 
ation in the data, that between rate of C and 
frequency of shifts in response. The subjects 
with higher rates more often shifted colors 
from one trial to the next (p < .01, for each 
schedule). This shifting may have enabled 
the users of C to learn better the contingencies 
between responses and rewards.’ And, of 
course, some personality factor such as inter- 
nal locus of control may contribute to inter- 
est in learning how to control one’s outcomes, 
shift of responses to learn the extent of one’s 
control, attention to others’ responses and 
outcomes, and consequent clear understanding 
of the problem. Once understood, the situa- 
tion may elicit cooperative behavior from 
almost all subjects. 

A second, different line of interpretation 
assumes that a cooperative orientation to so- 
cial relationships is at the basis of better 
understanding rather than vice versa. This 
view is that the differences in understanding 
do not reflect differences in learning (the 


5 Although, surprisingly, there was no relation be- 
tween frequency of shifting and answers to Ques- 
tion 1 concerning own control over own score. If 
shifting permits superior learning of anything, it 
should certainly be that a person can control his 
own score, 


learning problem may not, after all, be that 
difficult), but rather differences in the pre- 
conceptions or stereotypes about social rela- 
tionships that various subjects bring to the 
situation. Thus, more cooperative persons may 
have more complex preconceptions which also 
tend to be appropriate to the experimental 
situation, whereas less responsible persons 
may have simple and inappropriate precon- 
ceptions. This difference can be derived, per- 
haps, from the differences between cooperative 
and competitive persons identified by Kelley 
and Stahelski (1970), if we assume our co- 
operative persons to be very similar to theirs. 
Their results are that competitively oriented 
persons learn little about individual differ- 
ences in social orientations, believe that most 
others share their own competitive orienta- 
tion, and are not aware of the manner in 
which they themselves influence certain other 
(cooperative) people to behave in a competi- 
tive manner. From this, we might deduce that 
competitively inclined persons will feel little 
conflict about which response to make in the 
present situation. Being confident that others 
will also act competitively, they will be certain 
that the D response is best for themselves 
(consistent with results from Question 4 re- 
garding which color is best for one’s own 
score) and will make that response quite con- 
sistently (as indicated by the lower rate of 
switching colors for our less cooperative sub- 
jects). Furthermore, being unaware of dis- 
crepancies between orientation and behavior 
and of his own influences in his interactions 
with cooperators, the competitive person will 
be less sensitive to possibilities of mutual 
influence in social relationships of this type 
(consistent with results from Questions 2 and 
6 concerning own influence on others’ choices 
and others’ control over own score). 

Our results do not permit us to choose be- 
tween these two interpretations of the relation 
between cooperation and understanding of the 
game. And there are, of course, other plausible 
explanations that might be considered, involv- 
ing such processes as self-justifying reports 
and defensively selective attention. We be- 
lieve a fruitful problem for future research on 
this type of social relationship will concern 
the relation between orientations to social 
relationships, on the one hand, and expecta- 
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tions about others’ behavior and influencibil- 
ity, and learning and understanding of inter- 
dependence patterns, on the other hand. 
More generally, we believe the payoff 

model and experimental paradigm described 
here may be useful tools for the analysis of 
responsible behavior in social relationships 
involving diffuse interdependence between the 
one person and the many, It is hoped that our 
model and paradigm will not be used merely 
to generate experiments in which the obvious 
parameters of payoffs, information, communi- 
cation, etc., are studied in all their variations 
and combinations. Rather, the model and 
paradigm should serve, in our view, as aids 
in thinking of various approaches to the situ- 
ation and to planning their related investiga- 
tions. An economic approach immediately 
comes to mind: cooperative behavior can be 
increased by taking the individual profit out 
of the competitive response, as through cre- 
ating taxes for Behavior D or bonuses for Be- 
havior C. However, this involves extra costs, 
such as those of the administrative apparatus 
necessary to insure the necessary payments, 
so the full collective benefits of widespread 
responsible behavior are not realized. In con- 
trast, there are various approaches of a more 
psychological nature, one of which we have 
indicated in the preceding paragraphs. This 
approach has to do with the perception and 
understanding of the type of diffuse inter- 
dependence illustrated by the paradigm. An 
implication of our thinking, admittedly rather 
farfetched, is that better insight into this 
relationship and its possible consequences will 
serve to increase the rate of responsible be- 
havior. A rather different psychological analy- 
sis, one less sanguine about inducing coopera- 
tion through insight, would begin with a close 
scrutiny of individual decisions in this situa- 
tion and of the relevant motives. This analysis 
would take cognizance of the special nature ' 
of the conflict the (fully informed) individual 
typically faces here—the conflict between an 
immediate, specific cost to himself and the 
very small gains for the many anonymous 
others. A still different approach, more social— 
psychological in nature, would examine the 
development of an ethic for behavior in the 
situation of diffuse interdependence as a gen- 
eralization from similar interdependence in 


small face-to-face groups. Experience with the 
consequences of wide variations in rate of 
cooperation and with the benefits of effective 
reciprocity norms in small groups may afford 
both the cognitive and motivational founda- 
tions necessary for responsible action in 
relation to larger collectivities. 

The foregoing comments are not intended 
to outline all possible approaches to this situ- 
ation but only to point to some salient pos- 
sibilities. The experimental paradigm deals 
with an important class of little-studied social 
relationships. As has been noted (Crowe, 
1969; Hardin, 1968), these constitute the 
modern equivalents of the ancient problem 
of the “commons,” that is, the problem of 
responsible use of common and limited re- 
sources, be they pasture, space, air, water, 
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scenery, or whatever. Our purpose here has 
been to direct psychologists’ attention to this 
problem and to propose one way of thinking 
about it. 
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INCREASING PAIN TOLERANCE BY INFORMATION 


ABOUT AVERSIVE STIMULI‘ 


ERVIN STAUB ? anp DEBORAH S. KELLETT 


Harvard University 


Subjects received a series of aversive stimuli (shocks) gradually increasing in 
intensity. The influence of two types of information on the perceived intensity 
and the limit of endurance for the shocks was investigated using a 2 X 2 design: 
information about sensations produced by the shocks and information about 
the apparatus and the manner of transfer of electricity (the latter emphasizing 
safety features). Significant interactions were found; subjects who received 
both kinds of information received more intensive shocks before evaluating 
the shocks as painful and endured more intense shocks than subjects in the 
other three groups. Responses to questionnaire items indicated that in combina- 
tion, the two kinds of information enhanced each others’ perceived trustworthi- 
ness, or usefulness, or both. While the apparatus information minimized the 
degree of threat or danger, only the sensation information could be validated 
by the subjects’ own experience; both are probably essential for information 


to be effective. 


Can information about noxious or feared 
stimuli influence reactions to them? A variety 
of experiments have shown that information 
about the time of onset of aversive experi- 
ences is preferred to no information by both 
animals (Knapp, Krause, & Perkins, 1959; 
Lockard, 1963) and man (Badia, McBane, 
Suter, & Lewis, 1966; Jones, Bentler, & Petry, 
1966; Lanzetta & Driscoll, 1966), even when 
it has no instrumental value. Furthermore, 
predictability was found to decrease the dis- 
rupting effect of loud noise (Glass, Singer, & 
Friedman, 1969) and the perceived aversive- 
ness of electric shocks (Staub, Tursky, & 
Schwartz, 1971). The ability to predict on- 
coming events, particularly aversive ones, may 
come to be reinforcing or “valued” because 
of past association between predictability and 
the ability to control events. 

The influence of information other than 
time of onset on reactions to fear-provoking 
or aversive stimuli—information about the 
nature or characteristics of the stimuli (e.g., 
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their intensity and physical properties), the 
degree to which they may have physically 
damaging effects, or the sensations and feel- 
ings they are likely to induce—has not been 
extensively explored. A number of writers 
have suggested that the kind of emotional 
reaction produced by a stimulus depends on 
how it is interpreted (Schachter & Singer, 
1962) or assessed (Arnold, 1960; Lazarus, 
1966). Under certain conditions, information 
about the nature of the stimulus may affect 
its assessment and, in turn, its impact on 
people. Lazarus and his associates (Lazarus 
& Alfert, 1964; Lazarus, Opton, Nomikos, & 
Rankin, 1965; Speisman, Lazarus, Mordkoff, 
& Davison, 1964) found that vicarious stress 
reactions produced by observation of films 
that depicted other persons experiencing harm 
or suffering were affected by verbal communi- 
cations. These stress reactions, as measured 
by physiological responses and verbal reports, 
were reduced by information which suggested 
that an operation that was part of a primitive 
initiation ceremony which subjects watched 
on film did not produce harm (Lazarus & 
Alfert, 1964; Speisman et al., 1964) or that 
injury suffered by someone on the film was 
enacted rather than real (Lazarus et al., 
1965), and they were enhanced by informa- 
tion that emphasized the harm inflicted by 
the observed operation (Speisman et al., 
1964). 

Information is generally not expected, cer- 
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tainly not by itself, to affect reactions to 
persistent fears. Blanchard (1970) examined 
the relative contribution of information to 
the effectiveness of “participant modeling” 
(Bandura, Blanchard, & Ritter, 1969) or 
“contact desensitization” (Ritter, 1968) in 
reducing fear or avoidance. In one experi- 
mental group, subjects observed a model; in 
another one, they observed a model and also 
listened to information about the character- 
istics of the feared stimulus; and in a third 
group, they did both and were also guided 
in their approach to the feared stimulus, a 
snake. Blanchard reported that information 
did not decrease fear or avoidance. In another 
experiment (Staub, 1968), subjects afraid of 
snakes were asked to approach a snake gradu- 
ally, controlling their own speed of approach. 
Information about the feared stimulus and 
about the characteristics of the fear itself 
enhanced the speed of approach to the snake 
and decreased subsequent avoidance behav- 
ior. The information that subjects received 
was more extensive in this experiment than 
in the Blanchard study. Moreover, as part 
of the experimental treatment, over a period 
of time, subjects approached the snakes them- 
selves, while in the Blanchard study the 
crucial condition for the evaluation of the 
effects of information was one in which the 
observation of the model was combined with 
information; that is, subjects observed an- 
other person approach the snake, but they 
themselves did not do so. Information may 
facilitate the reduction of fear if it is com- 
bined with direct exposure to the feared 
stimulus, in the course of which a person can 
validate the information (Staub, 1968) and 
can integrate it with his own personal experi- 
ence with the feared stimulus, thus leading 
to reappraisal. 

y Little is known of the effect of informa- 
' tion about the characteristics of noxious or 
painful stimuli on reactions to such stimuli. 
In one experiment where such information had 
been provided, subjects in all experimental 
conditions received the information (Bowers, 
1968). In another experiment, variation in 
the expectations that were created about the 
intensity of an aversive stimulus, a noise, 
affected subsequent physiological reactions to 
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both a neutral stimulus that served as a signal 
for the noise and to the noise itself (Epstein 
& Clarke, 1970). In the present experiment, 
two kinds of information were provided to 
subjects, and their effects on subsequent 
evaluation of the aversiveness of electric 
shocks and on endurance of shocks were in- 
vestigated. One kind of information stressed 
the factual or objective aspects of receiving 
shocks. It described the shock apparatus, em- 
phasizing its safety features, described the 
manner in which electricity gets conducted, 
and generally indicated the limited impact 
of the electricity on the subjects. By mini- 
mizing danger and emphasizing the relatively 
small impact of shocks, this information was 
expected to decrease the perceived aversive- 
ness of the shocks. Another kind of informa- 
tion described the sensations usually experi- 
enced due to electric shocks, their subjective 
consequences. Such information may enhance 
the predictability of the experiences resulting 
from the shocks and thereby decrease their 
perceived aversiveness. On the other hand, by 
focusing attention on their experiences, it 
may enhance subjects’ sensitivity to pain 
(Kanfer & Goldfoot, 1966; Nisbett & Schach- 
ter, 1966). Therefore, no hypothesis was made 
about the effect of this information. The 
combined effects of the two kinds of informa- 
tion on reactions to the shocks, whether they 
would interact in some manner or have addi- 
tive effects, was left as an empirical question. 


METHOD 
Overview 


The experiment had a 2 X 2 design. Subjects either 
recived information about the apparatus or they 
did not; they either received information about 
sensations or they did not. A series of electric 
shocks of gradually increasing intensity was admin- 
istered to all subjects. Subjects were asked to indi- 
cate when they first felt the shock, when the shock 
was uncomfortable, when it was painful, and when 
they were unwilling to endure any more. After- 
wards, subjects responded to questions about their 
reactions to the shocks and to the information they 
received. 


Subjects 


Forty-three graduate and undergraduate males at 
Harvard University participated in the study. They 
were recruited through advertisement and were paid 
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$2 for participation. None of the subjects had 
participated before in an experiment using electric 
shocks. 


Apparatus 


The shock delivery apparatus and shock electrode 
were essentially identical to those described by 
Tursky and Watson (1964), and Tursky, Watson, 
and O’Connell (1965). The apparatus delivered a 
low amperage alternating current (60 cycles per 
second sinuoidal current) of up to 20 milliamperes 
(ma.), 5,000 ohms impedance, in .5-ma. steps through 
a concentric electrode attached to the dorsal surface 
of the left forearm. A ground was placed approxi- 
mately 4-6 inches above the shock electrode, 


Procedure 


At the time they signed up for the experiment, 
subjects rated their pain tolerance; they were as- 
signed to the experimental groups matched on this 
basis. Subjects participated individually and were 
met and instructed by a male experimenter. After 
entering the experimental room, subjects were seated 
and told that the study dealt with obtaining data 
on how people experience electrical stimulation. They 
were shown the electrodes, and the purpose of each 
was briefly explained as they were attached. Subjects 
were told that the experimenter would deliver 
shocks from the adjoining room, faintly visible 
through a one-way mirror. Speakers connected the 
two rooms. Experimental treatments were introduced 
after the electrodes were attached by giving subjects 
mimeographed sheets containing appropriate infor- 
mation. They were told: 


Since wed like people in this study to know 
pretty much what is going on, would you please 
read this (these) carefully, while I’m setting some 
things up in the other room? Let me know when 
you are through reading it (them). 


Subjects in the sensation information group received 
a page of information described therein as a general 
summary of “commonly reported” sensations arising 
from exposure to electrical stimulation. The progres- 
sion of sensations directly related to the increase in 
electrical intensity of the shocks was described 
(“slight tingle,” etc.). General arousal reactions, such 
as mild tremor of the stimulated arm, accelerated 
heart rate, and respiration were also described 
(Nisbett & Schachter, 1966). The apparatus informa- 
tion group received a mimeographed sheet titled 
“Safety Features of Shock Apparatus, Laboratory, 
14th Floor,” which described the safety features of 
the shock and ground electrodes and the function 
of a transformer in the experimental room. Emphasis 
was placed on the fact that current did not enter 


3 Three subjects were eliminated; two subjects mis- 
understood the instructions and failed to make 
accurate self-reports; in one case, there were experi- 
menter errors in delivering the shocks. 
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the body but traveled across the skin between the 
electrode poles, that even if current did escape, it 
would have to exit via the ground electrode, and 
that the transformer eliminated all possible stray 
harmful currents. Subjects were also informed of 
the presence of a second transformer, so that insula- 
tion was virtually complete. Subjects in the sensation 
and apparatus information condition received both 
sheets, and control subjects got neither sheet. 

After they read the information, the administra- 
tion of shocks began. Subjects were asked to report 
when they first felt something. When they did so, 
they were asked to report when the shock “first 
becomes uncomfortable.” After that, they were asked 
to report when they perceived the shocks as painful. 
The final instructions were: 


Okay, now lets see w much shock you can 
stand, I don’t want you to say anything more 
until we reach a shock level that is your absolute 
limit, the most shock you can tolerate, then say, 
“I prefer to stop.” 


Levels were coded, F(Feel), 
P(Painful), and E(Endurance). 

Shocks were delivered at 20-second intervals, and 
each shock lasted 1.0 second. All subjects were begun 
at .5 ma., and the shock level was raised by .5 ma, 
every 20 seconds until endurance level was reached. 

After endurance was reached, each subject was 
asked to complete a questionnaire on which they 
rated their reactions to the study and the informa- 
tion and marked a checklist of sensations experienced 
during the shock series. Ratings were made on a 
4-point scale ranging from “not at all” to “very 
much” (scored 1-4), The purpose of the experiment 
was then explained to the subjects. 


U(Uncomfortable) , 


RESULTS 


Mean tolerance levels (feel, uncomfortable, 


painful, and endurance) are shown in Table 1, 


with » = 10 in each group. Two-way analyses 
of variance were performed to evaluate the 
effects of the two kinds of information. None 
of the effects reached or approached signifi- 
cance for the sensation (feel) and uncom- 
fortable levels. The main effects for the two 
kinds of information were also negligible at 


the shock level that subjects evaluated as } 


painful and at the level of endurance. How- 
ever, at both of these levels the two types of 
information significantly interacted in deter- 
mining the average shock intensities (F = 
4.48, df = 1/36, p < .05, painful; F = 5.17, 
df = 1/36, p < .05, endurance). t tests evalu- 
ating the differences in the average shock level 
of the four groups showed that subjects who 
received both kinds of information accepted 
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significantly more shocks than subjects in any 
of the other three groups before they reported 
that the shocks were painful (p < .05 for all 
three comparisons),* while the other three 
groups did not differ from each other. ¢ tests 
also showed a similar difference at the en- 
durance level; subjects who received both 
kinds of information endured more intense 
shock than subjects in any of the three other 
groups (t= 1.97, df = 36, p< .07, for the 
comparison with the no-information group; 
t= 2.65, df = 36, p < .02, for the compari- 
son with the sensation information group; 
and £ = 2.53, df = 36, p < .02, in comparison 
with the apparatus information group), Again, 
differences among the remaining three groups 
were negligible. 

Differences between groups in responding 
to the questionnaire were also examined by 
analyses of variance or ¢ tests, whichever was 
appropriate. No differences were found in sub- 
jects’ evaluations of the sensations they ex- 
perienced. The other questions, and average 
ratings by subjects in the four experimental 
groups are shown in Table 2. A two-way 
analysis of variance showed that both ap- 
paratus information (F = 10.3, df = 1/36, 
p< .01) and sensation information (F = 
5.5, df = 1/36, p < .05) reduced worry about 
the effects of shocks. Subjects who received 
no information were significantly more wor- 
ried about the effects of shock than subjects 
in any of the other groups (p< .02 in all 
three comparison, with no differences among 
the other three groups). Subjects who received 


4 All reported p values of t tests are based on two- 
tailed tests. 


TABLE 1 
Mean TOLERANCE LEVELS IN MILLIAMPERES 
Group 
Level Appa- Sensa- Appa- N 
emer | set | Be | imio 
oaot mation mation | Mation 
Feel 1.25 80 1.05 1.05 
Uncomfortable 6.95 5.10 52 4.55 
Painful 8.70 6.20 6.39 6.10 
Endurance 11.30 7.85 8.00 8.75 


Note.—n = 10 in each group. 
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TABLE 2 


PosTEXPERIMENTAL QUESTIONNAIRE [TEMS 
AND MEAN RATINGS 


Group 
Question 
AISI | SI AI | NI 


1. How frightened are you 
about receiving electric 
shock in an experiment?| 1.7 | 1.8 | 1.7 | 2.1 

2. How worried are you about 
the effects of electric 
shock? 1:23) 1:53) 135) 23 

3. How painful did you find 
the last shock you had?} 3.2 | 3.3 | 3.0 | 2.9 

4. How useful did you find 
the information about 
the sensations you might 
expect to experience? 3.3 | 24 


5. How trustworthy did you 
feel this information 
was? Spite hae! 


6. Would you recommend we 
continue to provide such 
information? Viera wh) 

7. How useful did you find 
the information about 
the apparatus? 3 3.2 

8. How trustworthy did you 
feel this apparatus in- 
formation was? 3.8 3.3 

9. Would you recommend we 
continue to provide such 
information? 3.7 3.4 


Note.—AISI = apparatus information and sensation infor- 
mation; SI =sensation information; Al = apparatus infor- 
mation; NI = no information, 


both sensation and apparatus information 
rated the information about sensations as 
both significantly more useful (t = 6.62, dj = 
18, p < .01) and more trustworthy (¢ = 2.5, 
df = 18, p < .05) than did the sensation in- 
formation group and rated the information 
about the apparatus as significantly more 
trustworthy (t = 2.67, df = 18, p < .02) than 
did the apparatus information group. 


Discussion 


The two kinds of information together 
resulted in subjects evaluating more intense 
shocks as painful and enduring more shocks 
than those in the other three groups. This 
did not seem to be due to the greater quantity 
of information; the effects of the two kinds 
of information were interactive, rather than 
additive. 

In order for a verbal or written communica- 
tion to influence the subjects’ reactions to 
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aversive stimuli, it may have to contain 
information that both minimizes the degree 
of threat, pain, or danger that is to be 
expected and that can be validated by the 
subjects’ experience so that it will be believed. 
The sensation information contained relatively 
little information that would reduce perceived 
threat or danger, but it was possible to 
verify its correctness by subjects comparing 
the sensations they experienced with those 
they were told they would experience. On the 
other hand, the apparatus information at- 
tempted to minimize the degree of threat, 
danger, and discomfort, but the subjects 
could not confirm any of the information by 
their own experience. When the two types of 
information were combined, the apparent va- 
lidity of the sensation information may have 
increased the credibility of the apparatus in- 
formation, The responses to the questionnaire 
items provide support for this interpretation. 
The sensation information was perceived as 
more useful in combination with the appa- 
ratus information than alone. Moreover, when 
both kinds of information were received, each 
was perceived as more trustworthy than the 
same information by itself. The greater per- 
ceived trustworthiness of the sensation infor- 
mation in combination with the apparatus 
information may be due to a halo effect, to 
the generally greater perceived value of the 
information. 

Previous research findings have shown that 
cognitive factors may influence reactions to 
aversive stimuli. For example, arousing dis- 
sonance about accepting shocks (Brock & 
Buss, 1962; Zimbardo, Cohen, Wisenberg, 
Dworkin, & Firestone, 1966) by giving sub- 
jects a choice of continuing participation or 
not (Zimbardo et al., 1966), and creating 
conditions which resulted in the misattribu- 
tion of the sensations due to the shocks to 
origins that would not produce pain (Nisbett 
& Schachter, 1966), both enhanced tolerance 
for shocks. Information that presumably af- 
fected expectations about the intensity of a 
noxious stimulus, a noise, affected subse- 
quent physiological reactions to it (Epstein & 
Clarke, 1970). The present findings further 
suggest the influence of cognitive factors on 
reactions to aversive stimuli by showing that 
information about the characteristics of aver- 
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sive stimuli and their effects may enhance 
tolerance for them. The information that sub- 
jects received may have affected their expecta- 
tions of how aversive shocks will be, or may 
have affected their assessment of the shocks 
directly, or both. Further research is needed 
to explore the conditions under which infor- 
mation has such effects. Direct experience 
with the aversive stimuli to which infor- 
mation refers and the ability to validate 
at least part of the information may be neces- 
sary for, or at least facilitative of, the effec- 
tiveness of information. Knowledge about 
the usefulness of information in enhancing 
tolerance for aversive stimulation may be of 
practical importance, for example, in psycho- 
therapy. 
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COGNITIVE AND ATTENTIONAL MECHANISMS 
IN DELAY OF GRATIFICATION * 


WALTER MISCHEL,? EBBE B. EBBESEN, ano ANTONETTE RASKOFF ZEISS 
Stanford University 


Three experiments investigated attentional and cognitive mechanisms in delay 
of gratification, In each study preschool children could obtain a less preferred 
reward immediately or continue waiting indefinitely for a more preferred but 
delayed reward. Experiment I compared the effects of external and cognitive 
distraction from the reward objects on the length of time which preschool 
children waited for the preferred delayed reward before forfeiting it for the 
sake of the less preferred immediate one. In accord with predictions from an 
extension of frustrative nonreward theory, children waited much longer for a 
preferred reward when they were distracted from the rewards than when they 
attended to them directly. Experiment II demonstrated that only certain 
cognitive events (thinking “fun things”) served as effective ideational dis- 

p tractors, Thinking “sad thoughts” produced short delay times, as did thinking 
about the rewards themselves. In Experiment III the delayed rewards were 
not physically available for direct attention during the delay period, and the 
children’s attention to them cognitively was manipulated by prior instructions. 
While the children waited, cognitions about the rewards significantly reduced, 
rather than enhanced, the length of their delay of gratification. Overall, atten- 
tional and cognitive mechanisms which enhanced the salience of the rewards 
shortened the length of voluntary delay, while distractions from the rewards, 
overtly or cognitively, facilitated delay. The results permit a reinterpretation of 
basic mechanisms in voluntary delay of gratification and self-control. 


As early as 1890, William James contended 
that attentional processes are at the very core 
of the self-control phenomena usually sub- 
sumed under the term “will” or, since James’s 
time, under the concept “ego strength.” 
According to James (1890): “Attention with 
effort is all that any case of volition implies. 
The essential achievement of will is to attend 
to a difficult object . . . [p. 549].” In con- 
trast, psychoanalytic theories of self-control 
emphasize unconscious processes and motiva- 
tional dynamics, as well as internalization of 
values and intrapsychic conflicts to explain 
self-control phenomena. In spite of this 
shift in emphasis away from attentional to 
psychodynamic interpretations of self-control, 
some strands of evidence suggest possible 


1 This research was supported by Research Grant 
MH-6830 from the National Institutes of Health, 
United States Public Health Service. Portions of this 
Paper were presented by the first author in an 
invited address, “Personality and Cognition,” at the 
meeting of the Western Psychological Association, 
Los Angeles, April 1970. 

2 Requests for reprints should be sent to Walter 
Mischel, Department of Psychology, Stanford Uni- 
versity, Stanford, California 94305. 


links between attentional processes and self- 
regulatory mechanisms. 

In particular, beginning with Hartshorne 
and May (1928), a few correlations have 
been found between indexes of moral behavior 
and measures of attention or resistance to 
distraction on mental tests (e.g., Grim, Kohl- 
berg, & White, 1968). On the basis of such 
correlations, it has been suggested that the 
individual’s ability to resist temptation may 
be facilitated by how well he attends to a 
task, In most experimental “resistance to 
temptation” paradigms, yielding to tempta- 
tions, such as cheating, depends on the sub- 
ject’s being distracted from the main task to 
which he is supposed to be attending. In those 
situations a subject’s ability to resist distrac- ‘ 
tion may automatically make it easier for 
him to refrain from temptations such as 
cheating (Grim et al., 1968). 

Using experimental rather than correla- 
tional methods, Mischel and Ebbesen (1970) 
have explored a different link between atten- 
tion and self-control in the context of the 
delay-of-gratification paradigm. In that study, 
preschool children sat waiting for a preferred 
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reward which was available at a later time. 
At any moment the children could signal to 
terminate the waiting, thereby obtaining a 
less preferred reward, while forfeiting the 
more desired one. The experiment investigated 
the effects of attention to the goal objects on 
the length of time that the children actually 
waited. Specifically, while the children waited, 
they were given the opportunity to attend to 
the delayed and/or to the immediately avail- 
able reward or to neither reward, 

In accord with several previous theories 
discussed by Mischel and Ebbesen (1970), 
it had been expected that attention to the 
delayed reward would facilitate delay of grati- 
fication. In part it was expected that making 
the reward objects salient might facilitate 
“time binding” by permitting the subject to 
engage in self-persuasion and anticipatory 
gratification. For example, he might sustain 
his delay by imagining how satisfying the 
preferred outcome would be (e.g., how good 
it would taste) when it became available. In 

iï fact, the findings obtained by Mischel and 
Ebbesen were exactly opposite to the pre- 
dictions. It was found that if the child could 
attend to either or both of the rewards, he 
waited much less than if he could attend to 
neither reward during the delay period. In 
addition, the length of time which the chil- 
dren waited with only the immediate reward 
available for attention was similar to the 
time they waited with only the delayed reward 
available. Finally, when both rewards were 
available, children waited a slightly shorter 
time, Thus, children waited most readily 
when neither the delayed nor the immediate 
reward was available for attention during the 
delay period, and they waited a relatively 
short time when any reward was available. 
Observation of the children’s spontaneous 
behavior during delay of gratification sug- 
gested that the mechanism used by many 
youngsters to sustain their voluntary delay 
involved suppressing rather than enhancing 
attention to the rewards. The children seemed 
to reduce the subjective aversiveness of delay 
of reward by engaging in covert and overt 
distracting responses such as staring at the 
mirror, covering their eyes with their hands, 
and talking to themselves. These responses 
seemed to divert their attention away from 
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the frustration-inducing rewards. Further- 
more, self-induced distractions seemed to be 
fairly easy to maintain in the condition in 
which no rewards were facing the children, 
but seemed very difficult to maintain when 
any of the rewards were facing the children. 
Apparently, the children were able to delay 
longer when neither reward was available for 
attention because in this condition it was 
easier for them to avoid or suppress cognitions 
about the rewards. 

Post hoc, the obtained results become most 
understandable if delay of gratification is seen 
as a frustration situation. Indeed, the essence 
of frustration is a delay or interruption in 
an expected and desired outcome (Mandler, 
1964). The necessity to delay in order to 
obtain a more gratifying reward may be deter- 
mined externally by physical barriers or other 
people. In self-control and voluntary delay of 
reward, this delay is self-imposed. If a person 
desires the delayed outcome, he must impose 
the frustrative waiting situation upon himself, 
foregoing the immediately available outcome 
for the sake of the more desirable but delayed 
alternative. 

As Amsel’s (1958, 1962) “frustrative non- 
reward theory” has suggested, frustration in- 
volves an actively aversive effect. It follows 
that any conditions that enhance the aversive- 
ness of frustration should make it harder to 
wait. Although Amsel’s work with frustration 
effects in animals has concentrated on the 
scheduling of prior reinforcement, it seems 
reasonable that any cues that enhance atten- 
tion to what one wants but cannot have 
should increase the aversiveness of frustra- 
tion. This interpretation of the frustration 
effect suggests that attending to the rewards 
cognitively, rather than helping to bridge the 
delay period, may make it more aversive 
to delay gratification, and therefore lead to 
shorter waiting time. The present three 
experiments were designed to test this 
proposition. 

If our speculations regarding the frustrative 
effects of delay are correct, then delay of 
gratification should be enhanced when the 
subject can readily transform the aversive 
waiting period into a more pleasant non- 
waiting situation. This line of reasoning sug- 
gests that voluntary delay of reward should 
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be enhanced by any overt or covert activities 
that serve as distractors from the rewards. 
Through self-distraction, the subject should 
be able to suppress or avoid the aversiveness 
of wanting gratification but not having it, 
and thus convert the frustrative delay-of- 
reward situation into a psychologically less 
aversive condition. Therefore, it was expected 
that overt activities and internal cognitions 
and fantasy which could help the subject to 
distract himself from the rewards would in- 
crease the length of time which he would 
delay gratification. 

To investigate the role of attention and 
cognition in delay of gratification, we gave 
each subject either an overt activity, a cogni- 
tive activity, or no activity to engage in 
during the delay period. The overt and cogni- 
tive activities were designed to reduce the 
probability that the subjects would be at- 
tending to the reward during the delay period. 
We then assessed how these alternatives af- 
fected the subjects’ voluntary delay time, in 
comparison to the group of subjects which was 
not supplied with a distractor. Consistent 
with our extension of frustrative nonreward 
theory, we predicted that voluntary delay of 
gratification should be increased by any 
covert or overt activities that distract the 
subject from the anticipated outcomes and, 
conversely, should be diminished by attention 
to the rewards during the delay period. 


EXPERIMENT I 


To test our expectations, a study was de- 
signed with a delay-of-gratification paradigm 
in which aversive frustration was deliberately 
made high. For this purpose, preschool sub- 
jects were faced with both the immediately 
available and the more preferred but delayed 
reward during the delay period.® All children 


3 This attentional condition was used because it 
presumably would provide maximum frustration 
cues. When the subject attends to the immediate 
reward and is tempted to take it, he is frustrated 
by remembering the contingency that attainment of 
the reward now prevents his getting the preferred 
reward later. When the subject attends to the delayed 
reward, he is frustrated by the fact that he wants 
it now but cannot have it yet. When he attends to 
both objects, both of the above aversive frustrations 
occur, and, hence, delay should be most difficult 
for him. Indeed, this attentional condition, in which 
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could signal at any time to terminate the 
waiting period, thereby forfeiting the more 
preferred gratification but attaining the less 
desirable one immediately. 


Method 
Design 


The independent variable in this study was a 
manipulation designed to permit the subject to dis- 
tract himself from the reward objects for which he 
was waiting. In all conditions, while the subjects 
waited, the immediate and delayed rewards were 
both physically available for direct attention, The 
dependent variable was the length of time the chil- 
dren remained alone in the room before they rang a 
bell and thereby ended the delay period. 

Two methods of self-distraction were used. One 
technique involved an external activity; in the other 
method, instructions were given to generate internal 
cognitive activity. In the external activity, the child 
was given the opportunity to play with an attractive 
toy while he waited. In the cognitive activity group, 
instructions were given to increase the probability 
that the child would think pleasant and distracting 
thoughts while he was waiting. A control group was 
designed to determine how long the children would 
wait for the delayed reward without either the | 
external or the internal distractors provided by the 
experimenters, To control for the effects of playing 
with an attractive toy or thinking pleasant thoughts 
independent of waiting for the delayed reward, two 
additional control groups of children were given 
either the toy or the cognitive sets but no delayed 
reward contingency. 

Thus, a total of five groups were employed: Group 
1: waiting for delayed reward with external distrac- 
tor (toy); Group 2: waiting for delayed reward with 
internal distractor (ideation); Group 3: waiting for 
delayed reward (no distractor); Group 4: external ` 
distractor (toy) without delay-of-reward waiting 
contingency; Group 5: internal distractor (ideation) 
without delay-of-reward waiting contingency. 


Subjects 


The subjects were 50 children (25 boys and 25 
girls) from the Bing Nursery School of Stanford 
University. They ranged in age from 3 years 6 
months to 5 years 6 months, with a mean age of 
4 years 6 months. Six additional children began but 
did not complete the experimental procedures be- 4 
cause they did not comprehend the instructions. Five 
males and 5 females, equated for mean age, were 
randomly assigned to each of the five conditions. 
One male and one female served as experimenters. 
Specifically, within each condition, the male experi- 
menter ran 3 male and 2 female subjects, and the 
female experimenter ran 3 female and 2 male subjects. 


both rewards are present, was the one that has 
produced the shortest waiting time in a previous 
study (Mischel & Ebbesen, 1970). 
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Procedure 


The experimental room and setting was similar to 
that previously described (Mischel & Ebbesen, 1970). 
An addition was a barrier, behind which the experi- 
menter, but not the children, could see and reach. 
Behind this barrier was placed a “Slinky” (a toy 
spring) and an opaque cake tin. Under the cake tin 
was a small marshmallow and a stick pretzel. A box 
of attractive battery- and hand-operated toys was 
on a second table next to the barrier. A table and 
chair were against one wall, and on the table was 
a desk bell. 

When the experimenter escorted a child to the 
experimental room, he first showed the child the 
box of toys and explained that they would play 
with the toys later on in the session. After finishing 
a brief demonstration of one or two of the toys, the 
experimenter escorted the subject to the table with 
the desk bell and asked the child to sit in the chair 
in front of the table. The experimenter then intro- 
duced and practiced the child’s use of the bell signal. 
They played a “game” in which the experimenter 
repeatedly stepped out of the room, closed the door, 
but returned as soon as the child signaled. Every 
time the bell was rung, the experimenter thus im- 
mediately returned to the room. The procedure was 
the same as the one previously described (Mischel & 
Ebbesen, 1970), with the exception that in the 
present study the signal was a bell. Thereafter, the 
experimenter consulted a concealed slip of paper 
which informed him by a prearranged random 
schedule of the condition to which the child was 
to be assigned, 


Delay-of-Gratification Contingency 
Instructions 


For subjects assigned to the three delay-of- 
gratification conditions (Groups 1, 2, and 3), the 
experimenter next removed the cake tin from behind 
the barrier and placed it on the table in front of the 
child, giving these instructions: 


Let’s see what’s under here. PIl bet it’s a surprise. 
Oh boy, look at that. A marshmallow and a pretzel. 
Which would you like to eat? You can eat either 
the marshmallow or the pretzel. [At this point the 
child chose which one he wanted to eat.] Oh, you 
know what? I have to go out of the room now, 
and if you wait until I come back by myself then 
you can eat this one [pointing to chosen object] 
right up. But, you know, if you don’t want to 
wait you can ring the bell and bring me back 
anytime you want to, But if you ring the bell then 
you can’t have this one, but you can have that 
one (pointing to the unchosen object). So, if you 
ring the bell and bring me back then you can’t 
have the , but you can have 
the 


The experimenter then assessed the child’s compre- 
hension by asking three questions: “Can you tell me, 
which do you get to eat if you wait for me to come 
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back by myself?” “But if you want to, how can you 
make me come back?” “If you ring the bell and 
bring me back, then which do you get?” As pre- 
viously mentioned, six children were eliminated from 
the study. Four failed to pass these questions, and 
the other two subjects were lost because one ate the 
food objects while the experimenter was out of the 
room, and the other one refused to ring the bell 
during training. 

In one of the three delay-of-gratification condi- 
tions (Group 3), subjects did not receive either the 
thinking or the toy activity distraction instructions, 
but they did receive the foregoing waiting contin- 
gency instructions. In this condition, after the sub- 
jects answered the three comprehension questions cor- 
rectly, they were simply told: “I have to leave the 
room now. And if you want to you can ring the bell 
whenever you want to and bring me back. When I 
come back, whether you ring the bell or wait for me 
to come back by myself, we’ll play with all my toys.” 

In Groups 4 and 5—the two conditions with no 
delay-of-reward contingency—the foregoing instruc- 
tions of course were not given. Instead, the experi- 
menter, after stating that he would have to leave, 
merely looked at some papers, while telling the child 
that he had to “check something” before he left and 
then shuffled through the papers for approximately 
as long as it would take to give the waiting 
contingency instructions. 


Distraction through Overt Activity 
Instructions 


In the two overt distraction conditions (Groups 
1 and 4), the child was left alone in the room with 
a potential distracting activity that involved playing 
with a toy (the Slinky). In Group 1 each subject 
was also given the delay-of-gratification contingency 
instructions and thus was waiting with the possibility 
of getting the preferred food object if he waited long 
enough, and the less preferred object if he did not 
delay long enough. In Group 4 the child was left 
alone merely to play as long as he wished. In both 
of these groups, prior to leaving the room, the experi- 
menter placed the Slinky on the floor and informed 
the child that he could play with the Slinky on the 
floor for as long as he wanted, that he could ring 
the bell whenever he wanted to bring back the 
experimenter, and that, regardless of whether the 
child rang the bell or waited, he could play with 
the toys when the experimenter came back. 


Distraction through Cognition-Inducing 
Instructions 


In Groups 2 and 5, before leaving the room, the 
experimenter gave the subject instructions designed 
to encourage the child to generate his own thoughts 
and covert cognitive activities while waiting. He 
said: “Oh, while Pm gone you can think of anything 
that’s fun to think of, for as long as you want to, 
if you want to. Can you tell me something to think 
about that’s fun?” (The experimenter paused for the 
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Fic, 1. Mean number of minutes of voluntary waiting time for each condition 
in Experiment I. 


child’s examples and said “Yes” no matter what the 
subject said.) The experimenter then added other 
examples: “You can also think about singing songs, 
or think of playing with toys or anything that is fun 
to think of.” His final departing instructions were 
identical to those given in the other groups. 

Recall that Groups 2 and 5 differed in that only 
the children in Group 2 expected to get a reward 
if they waited long enough, although both groups 
could remain alone and “think about fun things” for 
as long as they wished before terminating the delay 
by signaling. 

In all groups, when the experimenter returned he 
asked the child “What happens now?” No child 
failed to respond correctly (either verbally or by 
eating the proper food reward spontaneously). In all 
conditions the session ended with the experimenter 
and child playing with the toys as had been 
promised. 


` Sequence 


To review, the order of events were as follows: 
(a) The experimenter demonstrated some toys to the 
child. (b) The child was taught how to bring the 
experimenter back into the room (by ringing the 
bell). (c) In Groups 1, 2, and 3, the children were 
presented with the delay contingency, and compre- 
hension questions, while in Groups 4 and 5 the 
experimenter merely explained that he had to leave 
the room and that the children could bring him 
back whenever they wished to. (d) The overt dis- 
tractors were presented in Groups 1 and 4, and the 
covert, in Groups 2 and 5. (e) Finally, all children 


were reminded that no matter what they did (ring 
the bell or not) when the experimenter returned, they 
would play with the toys. 


Results 
Delay of Gratification 


The mean length of waiting time was com- 
puted for each condition (Figure 1). The first 
result to note is the extremely low mean delay 
time found in the delay-of-reward condition 
in which no distractor was available. In this 
condition, since both the chosen and unchosen 
rewards were present, and no distractor was 
available, the attention paid to the rewards 
should have been fairly high. The low mean 
delay time of less than } a minute found 
for this group replicates the low mean waiting 
time found in the previous comparable study : 
when children also waited with both rewards 
present, and no distractor was available 
(Mischel & Ebbesen, 1970). 

As can be seen in Figure 1, the mean length 
of time which the children waited was much 
greater when they had available either an 
external or a cognitive distractor during the 
delay period, the mean delay times in the 
latter two conditions being 8.59 and 12.12 
minutes, respectively. 
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Table 1 presents a one-way analysis of 
variance for the mean waiting times and four 
orthogonal contrasts (Winer, 1962). The 
overall effect of conditions was highly signifi- 
cant (p < .001). Orthogonal contrasts were 
computed to determine the exact sources of 
this effect and to test specific hypotheses 
about the effect of distractors on the length 
of voluntary delay of reward. 

The first orthogonal contrast (C, in Table 
1) compared the three delay-of-gratification 
contingency conditions (Groups 1, 2, and 3) 
with the two conditions in which this con- 
tingency was absent (Groups 4 and 5). This 


contrast yielded a highly significant effect - 


(p < 001), indicating that giving the children 
a reward for which to wait greatly increased 
the length of time which they spent alone in 
the experimental room. 

It was also predicted that waiting would 
be long only in those conditions in which a 
chosen reward was contingent on delay and a 
distractor was available. As Figure 1 reveals, 
the availability of the desired but delayed 
gratification yielded a mean delay time of 
less than 4 a minute when the subject had 
no overt or cognitive distractions available to 
reduce or avoid frustration while he was at- 
tending to the rewards. The second contrast 
(Cə in Table 1) was computed with only the 
three contingent delay-of-gratification condi- 
tions. This contrast compared the two dis- 
traction conditions (Groups 1 and 2) with the 


_ one no-distraction condition (Group 3). The 


F for this contrast was highly significant 
(p < .001) and strongly confirmed the pre- 
diction that children would wait longer for 


TABLE 1 


ANALYSIS OF VARIANCE AND ORTHOGONAL CON- 
TRASTS FOR Mean Wattinc Times As A 
FUNCTION oF DELAY CONTINGENCY 
AND DISTRACTION CONDITIONS 


Source df MS F 
Between 4 289.1 19.21** 
C 1 428.2 28.46** 
C: 1 655.1 43.54** 
C; 1 62.7 4.18* 
C; 1 3.6 <1 
Erròr 45 15.1 


*p < .05. 
** p < 001, 


209 


TABLE 2 


NuMBER oF CHILDREN Warring or Not WAITING 
TO THE CRITERION (15 MINUTES) AS A 
FUNCTION OF DELAY CONTINGENCY 
AND DISTRACTION CONDITIONS 
(EXPERIMENT I) 


Waiting for contingent Noncontingent 
rewar' waiting 
Waiting | 
„to Thinking Thinking 
Rene MOTE Nra ldis. y as fun Toy as fun 
Graction: a things | distrac- | things 
tion as dis- tion aş dis- 
traction traction 
No 10 6 4 10 10 
Yes 0 4 6 0 0 


aX2 = 8.18, df = 2, p < .025. 
bxX2 = ns. 


rewards if a distractor from the rewards were 
available during the delay period. 

The third contrast (C3 in Table 1) com- 
pared the two distraction conditions in which 
children were waiting for a reward. The dif- 
ference between the mean length of times 
waited in these two conditions reached signifi- 
cance at the .05 level: a longer mean delay 
time was found in the cognitive distraction 
condition than in the overt distraction condi- 
tion. Because this difference was not very 
great, a more stringent statistical test was also 
computed, using the studentized range statis- 
tic and the Tukey (a) procedure for a pos- 
teriori ¢ tests (Winer, 1962). With this more 
stringent statistic, the difference between the 
toy distraction and the cognitive distraction 
when children were waiting for a reward did 
not approach significance (¢ = 2.87). 

A fourth contrast (Cs in Table 1) com- 
pared the two conditions in which reward was 
not contingent on delay (Groups 4 and 5). 
It was not significant (F < 1). 

Another index of waiting behavior is the 
number of children who waited the full 15 
minutes (i.e., until the experimenter returned 
by himself). These results (Table 2) were in 
the same direction as those reported for the 
mean waiting times. A chi-square compared 
the number of children waiting to criterion 
with the number who did not in the three 
contingent delay-of-gratification conditions, 
and yielded a significant effect (x? = 8.18, 
df =2, p < 025). When delay of gratifica- 
tion was attempted in the presence of re- 


wards and novdistractors were available to 
suppress attention from them, not a single 
child waited to criterion. When distractors 
were available cognitively or overtly, half of 
the subjects waited to criterion. Moreover, 
when the distractors were available, but re- 
wards were not contingent on waiting, not a 
single subject waited to criterion. 

Before discussing the present results, two 
additional experiments will be described. 
These studies are intended to clarify further 
the cognitive and attentional mechanisms that 
seem most crucial for effective voluntary 
delay of gratification, 


EXPERIMENT IT 


The effects found in the first study were 
strong and in accord with theoretical expecta- 
tion. The results clearly supported the 
hypothesis that effective delay behavior is 
greatly enhanced by the avoidance or reduc- 
tion of the frustrative aspects of delay of 
gratification. Such reduction presumably was 
achieved when the subjects shifted attention 
away from the potential gratification and 
instead distracted themselves with competing 
cognitions or with overt activity. 

The findings concerning the potency of 
the instruction-induced cognitive distractions 
seemed especially provocative. It also would 
be important to determine how the substan- 
tive content of cognitions (as generated by 
various types of instructions) affects subse- 
quent delay behavior, A second experiment 
was designed to explore this topic. 

It was assumed that the effects of instruc- 
tion-produced cognitive content on voluntary 
delay would parallel those found by manipula- 
tion of external stimulus objects. Therefore, 
it was predicted that delay of gratification 
would be short when the frustration was made 
high by directing the children to think about 
the rewards. We also expected that the 
content of thoughts could influence their 
effectiveness in bridging the delay-of-reward 
period. Thus, we anticipated that aversive 
cognitions, as in “thinking sad thoughts,” 
should be relatively ineffective compared to 
such positive cognitions as ideating about 
“fun things.” Such a result could be expected 
for several reasons; first, aversive thoughts 
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might be avoided and not employed effec- 
tively as distractors. Alternatively, if sad 
thoughts were generated by the children, the 
additional aversiveness might lead them to 
terminate the already aversive waiting situa- 
tion. Consequently, it was predicted that delay 
of gratification would be longer when the 
cognitions were affectively positive distrac- 
tors and shorter when the cognitions were 
affectively negative. 


Method 
Design 


A three-condition study was designed which varied 
the types of instructions given to the subjects just 
before they began to wait for rewards. The instruc- 
tions were intended to induce in the subject various 
types of ideation during the delay-of-gratification 
period. One condition was a replication of a prc- 
viously run cell: Subjects here were instructed that 
they could think about fun or happy thoughts 
while waiting. In a second condition the subjects 
were told that they could think unhappy or sad 
thoughts while they waited. In the final condition 
the children were instructed that they could think 
about the reward objects. In all conditions both 
rewards were again present during the waiting period. 
The dependent variable was the length of time that 
the children waited for their more preferred reward 
before terminating and settling for the less preferred 
outcome. 


Subjects 


Thirty-two children from the Bing Nursery School 
of Stanford University were the subjects in this 
study. Six of them were lost because of incomplete 
understanding of the instructions or because they 
ate one of the reward objects while waiting for the 
experimenter. The subjects in the final analysis ranged 
in age from 3 years 9 months to 5 years 3 months, 
with a mean age of 4 years 9 months. The final data 
were based on 10 subjects in each of the two new 
conditions and 6 subjects in the replicated condition. 
Sex ratios and ages were equated across groups. 


Procedure 


The initial procedure in the three conditions of this 
study was identical to that in Groups 1, 2, and 3 
of Experiment I, in which the children were waiting 
for reward objects. The only differences came after 
the child had answered the usual three comprehen- 
sion questions correctly, As in the earlier studies, 
after the child answered these three questions, the 
experimenter was informed as to which one of the 
following three sets of instructions he was to give; 
the decision had been determined randomly. The 
“think fun” distraction instructions were identical 
to those described for the equivalent group in 
Experiment I. 
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In the “think sad” distraction instructions, the 
key phrases as the experimenter started to depart 
were “Oh while I'm gone you can think of anything 
that is sad to think of, for as long as you want to, 
if you want to, Can you tell me something that is 


- sad to think of?” (The experimenter paused for 


the child’s examples and said “Yes” regardless of 
the answers.) The experimenter then added other 
examples: “You can also think of falling down and 
getting a bloody knee which hurts a lot, or you can 
think of crying with no one to help. You can think 
of anything that makes you unhappy.” 

The “think food reward” instructions directed at- 
tention to the reward objects. Therefore, the experi- 
menter simply mentioned both reward objects as 
things the child could think about while the experi- 
menter was gone: “Oh, while I’m gone you can 
think of the marshmallow and the pretzel for as 
long as you want to, if you want to.” This thought 
was repeated several times in reversed order and 
rephrased form, with the experimenter noting that 
the child could think anything he wanted to about 
the pretzel and the marshmallow for as long as he 
wanted to. 

In all groups the final departing instructions were 
identical and the same as described in Experiment I; 
thus, they emphasized that the child could ring the 
bell or wait, and in either event the experimenter 
and child would play with all the toys at the end. 


Results 


The mean number of minutes waited in the 
three conditions (“think fun,” “think sad,” 
and “think food”) is summarized in Figure 2. 
A one-way analysis of variance of these data 
is summarized in Table 3. It can be seen that 
most of the between variance is accounted 
for by the comparison between the think fun 
condition and the other two conditions, think 
sad and think food. This finding is exactly 
in accord with the prediction. The direction 
of the difference was as expected: namely, 
children waited longer for a reward when they 


TABLE 3 


ANALYSIS OF VARIANCE AND ORTHOGONAL Con- 
TRASTS FOR MEAN LENGTH OF DELAY OF 
GRATIFICATION AS A FUNCTION OF 
VARIOUS THOUGHT-INDUCING IN- 
STRUCTIONS (EXPERIMENT IT) 


MS F. 


COGNITIVE AND ATTENTIONAL 


Source df 
Between 2 176.1 5.47* 
Cı 1 348.9 10.84** 
Co 1 3. <1 
Error 23 one. 
* 
wh S01 
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Fic. 2. Mean number of minutes of voluntary wait- 
ing time for each condition in Experiment II. 


presumably were distracted by thinking “fun 
things” than when they were thinking about 
the food rewards or than when they were 
thinking “sad things.” 

As Figure 2 indicates, instructions to think 
about the food rewards and instructions to 
think about sad things produced similar delay 
times. It is also interesting to compare the 
delay times in these two conditions with the 
delay time in a similar condition but without 
any ideation instructions—namely, Group 3 
from Experiment I. 

To compare these three conditions, the 
distribution of waiting times for subjects was 
dichotomized at the median. A chi-square 
comparison among conditions on this dichot- 
omy did not approach significance (x* = 3.2, 
dj = 2), suggesting that delay times in the 
three conditions were essentially similar. 
Thus, instructions to think about the rewards, 
or to think sad, did not significantly facilitate 
delay of gratification when compared to a 
no-ideation condition. Instructions to think 
about fun things, however, produced waiting 
times well above either of the two other 
think groups and also greater than the compa- 
rable but uninstructed delay-of-gratification 
condition (Group 3 from Experiment I). In 
this regard, note that the “think fun” group 
of Experiment II closely replicated the long 
mean waiting time found in the comparable 
condition of Experiment I. 
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Experiments I and II involved distraction 

manipulations when the subjects were waiting 
for delayed rewards that were always visually 
present and directly available to attention. It 
is conceivable that under conditions in which 
delayed gratifications are not physically avail- 
able for direct visual attention, their mental 
representation in the form of images or ideas 
does have a “time-binding” function and 
would facilitate voluntary delay. Indeed, that 
expectation is one that might follow from 
Freud’s (orig. publ. 1911) formulation of pri- 
mary process and the development of delaying 
ability. Freud implied that delay capacity 
begins to develop when the child provides 
himself with mental representations or images 
of the delayed object, but only when he can- 
not see it externally. That is, some discharge 
of tension,may be achieved by cathecting an 
image of the reward objects, but only when 
they are physically unavailable for direct at- 
tention. For example, the hungry child may 
achieve some gratification by hallucinating the 
mother’s breast when it is absent. If one 
tested that hypothesis in the present para- 
digm, one would have to predict long delay 
times when subjects were instructed to ideate 
about the rewards but the rewards were 
physically obscured from view. On the other 
hand, a focus on the frustrativeness of non- 
reward predicts that such instructions would 
produce short delay times. Experiment III 
was designed to test which expectation was 
correct, 


EXPERIMENT II 


METHOD 
Design 


The previous experiments were undertaken to 
determine the effects of distraction from the reward 
objects on delay of gratification. This experiment 
was designed to direct the subjects’ attention to the 
reward objects cognitively when the objects them- 
selves were physically obscured during the waiting 
period. One experimental condition and two control 
groups were run, In all three groups both the im- 
mediate and the delayed rewards were obscured from 
the sight of the children during the waiting period. 
The groups differed, however, in the instructions 
given to the subjects. In the experimental condition, 
the subjects were instructed that they could think 
about the reward objects while waiting, In the two 
control conditions, the subjects were either told 
nothing or they were instructed that they could 
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think about “fun things” while waiting. The de- 
pendent variable was the length of time that the 
children waited for the delayed reward before 
terminating by settling for the less preferred, but 
immediately available, outcome. 


Subjects 


Sixteen subjects were run in this study. The sub- 
jects ranged in age from 3 years 5 months to 5 years 
6 months, and their mean age was 4 years 6 months. 
There were 11 males and 5 females, Half of the 
subjects were assigned to the experimental condition 
(which directed the children’s attention to the reward 
objects). The remaining 8 subjects were equally 
divided between the two control conditions (“think 
fun” and no ideation). The sex ratios and the mean 
ages of the subjects were similar across conditions, 
One male experimenter tested all of the subjects. 


Procedure 


All procedural aspects of this experiment were 
identical to those in the previous two experiments 
except for the following modifications. In the pre- 
vious experiments both of the reward objects were 
directly available for attention during the delay 
period. In this study it was necessary to remove 
the reward objects from the child’s visual field. 1 
the experimenter took the rewards with him when 
he lëft the child waiting in the room, he might affect 
the child's ‘trust that the promised rewards would 
ultimately be re d. ierefore, the reward objects 
were placed cde a tin and put under 
the table at which the child sat so that they could 
not be seen by him during the waiting period. 
The children were told that this operation would 
keep the food objects fresh while the experimenter 
was out of the room. The rewards were obscured 
in this fashion after the child had passed the compre- 
hension questions, Thereafter, the experimenter was 
informed of the condition in which the subject was 
to be run. 

In the experimental condition the experimenter 
gave the subjects the “think food rewards” instruc- 
tions used in Experiment II. These instructions were 
designed to direct the children’s attention to the 
reward objects during the delay interval. In one 
control condition, the experimenter merely left the 
room; in the other control condition, the experi- 
menter gave the identical “think fun” instructions 
that were used in Experiments I and II. After | 
completing these instructions, the experimenter left 
the room. 


RESULTS 


All of the means are depicted in Figure 3. 
The “no-ideation” group mean was 12.86 
minutes, and the “think fun” condition mean 
was 14.48 minutes. It is interesting to note 
that these means are very close to the “think 
fun” means found in Experiments I and II, 
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which were 12.12 and 13.33 minutes, respec- 
tively. In the experimental condition (“think 
rewards”) in which the children were directed 
to think about the physically obscured reward 
objects, the mean waiting time was only .78 
minute. These results utterly contradict the 
belief that ideating about the reward objects 
in their absence enhances voluntary delay of 
gratification for them. 

Mean delay times in the two control groups 
(‘no ideation” and “think fun”) obviously 
did not differ from each other, and, conse- 
quently, they were combined and compared 
with mean waiting time in the experimental 
condition, The resulting ¢ test was highly 
significant (= 11.93, df=14, p< .001). 
Moreover, of the eight children in the experi- 
mental group, not a single one waited more 
than 2 minutes; in contrast, five of the eight 
control children waited the full 15 minutes 
(p = .05, two-tailed by Fisher’s exact test). 
Thus, even when the rewards are not visually 
present, ideating about them and attending to 
them cognitively serves to substantially de- 
crease, rather than to enhance, the duration 
of delay of gratification for the sake of 
attaining the preferred reward. 


Discussion: EXPERIMENTS I, II, ann III 


Considering all three experiments together, 
it seems remarkable how well our brief 
cognition-inducing instructions seemed to 
work with our young subjects. The present 
seemingly simple techniques may provide a 
fruitful methodology for studying cognition 
and attention experimentally in young chil- 
dren. By manipulating cognition-inducing in- 
structions and visual presence of rewards, 
extremely powerful effects on delay time were 
obtained. 

To provide an overview of all the results, 
the main findings from the present three 
experiments are summarized in Figure 4. As 
Figure 4 shows clearly, effective delay of 
gratification depended on cognitive avoidance 
or suppression of the reward objects during the 
waiting period. This conclusion is based on 
several sets of data. First, when the subjects 
were waiting for the preferred but delayed 
reward with the reward objects in their atten- 
tional field, delay of gratification was minimal. 
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Fic. 3. Mean number of minutes of voluntary wait- 
ing time for each condition in Experiment III. 


In contrast, delay was dramatically facili- 
tated when the subjects engaged in affectively 
pleasant cognitive distractions (“think fun” 
conditions) during the delay period. That 
result was replicated in the second experiment. 
Second, and completely consistent with 
these findings, are the results from Experi- 
ment III, in which the rewards were not 
externally available for attention during the 
delay period. As Figure 4 shows, under these 
conditions the “think fun” group again 
yielded extremely long periods of delay of 
gratification. In contrast, cognitions about the ` 
rewards (induced by the “think rewards” 
instructions) resulted in an average delay 
time of less than 1 minute. Thus, when the 
children thought about the absent rewards, 
it was as difficult for them to delay gratifica- 
tion as when the rewards were directly in 
their attentional field (the no-ideation condi- 
tion of Experiment I). Note also that when 
the rewards were not available for direct 
attention, uninstructed subjects (no ideation) 
found it relatively easy to delay gratification, 
waiting no less than the “think fun” children. 
These findings in the two no-ideation condi- 
tions essentially replicate those from compa- 
rable conditions in a previous study (Mischel 
& Ebbesen, 1970). In that study it was found — 
that delay of gratification was exceedingly © 
difficult when the youngsters faced the reward 
objects (either the delayed one, the immediate | 
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Fic. 4. Mean number of minutes of voluntary waiting time for treatment 
conditions in Experiments I, II, and III, comparing igent ideation instruc- 


tions with controls. 


one, or both, with no differences between 
these conditions). However, just as in the 
present Experiment III, when the children 
waited with no reward objects in their atten- 
tional field, they were able to substantially 
delay gratification in order to attain the 
preferred but delayed reward. 

The present data may be relevant to 
psychodynamic theorizing regarding delaying 
capacity. Predictions from psychodynamic 
theory concerning the development of delay 
capacity are complex since the process is seen 
as one that involves transitions from primary 
process to secondary process thinking. Ac- 
cording to Rapaport’s (1967) elaboration and 
clarification of Freudian theory, delay ca- 
pacity begins with “the emergence of a hal- 
lucinatory image of the need-satisfying object 
when tension rises to the point where dis- 
charge should take place but the need- 
satisfying object is not present [pp. 315- 
316].” This part of the theory seems to 
suggest that instructions designed to help 
subjects imagine or ideate about absent but 
desired delayed gratifications should facili- 
tate voluntary delay time. Clearly, the present 
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data do not Woo such a view and there- 
fore might be interpreted as undermining the 
Freudian position. 

But Rapaport (1967) goes on to state that 
this image of the need-satisfying object does 
not provide “more than a minute opportunity 
for discharge [p. 316].” He therefore argues 
that delay is further developed by transition 
from primary process hallucinatory images 
to secondary process reality testing. In this 
phase of the development of delay capacity, 
discharge is postponed “until external reality 
conditions have been found suitable [p. 
318].” Psychodynamic theory (as interpreted 
by Rapaport) thus suggests that effective 
delay begins to occur when the ego can divert 
energy away from images of delayed rewards 
and toward reality consideration and instru- 
mental activity. Rapaport noted that internal 
impulse control requires “countercathexes” as 
in repression. This part of the psychodynamic 
formulation of delay seems to imply that 
removing attention from the delayed rewards 
might enhance effective impulse control. 
Rapaport correctly comments that “Little is 
known about the nature of the process by 
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which these countercathexes arise [p. 219].” 
Future research needs to explore more closely 
the exact conditions that moderate the rela- 
tions between specific cognitive activity and 
delay behavior. 

The research thus far demonstrates that 
effective delay, rather than being mediated 
by consummatory fantasies, probably depends 
on suppressive and avoidance mechanisms 
that reduce frustration. This interpretation is 
congruent with the “Satan get thee behind 
me” approach to self-control recognized by 
Skinner (1948) in Walden Two. In his novel, 
Skinner described children learning to control 
themselves by learning to physically and 
mentally remove temptations. This approach 
emphasizes that one may effectively resist 
temptations by engaging in activities (overtly 
or covertly) that prevent one from attending 
to them. Obviously, not just any cognition 
serves as an effective distractor from aversive 
frustration or temptation. As the second ex- 
periment indicated, “think fun” is better 
than “think sad,” whereas “think rewards” 
increases frustration most, hence making 
continued delay most difficult. 

Observations of the children, in the 
reward-absent delay conditions, lend further 
credence to these interpretations. When the 
distress of waiting seemed to become espe- 
cially acute, children tended to reach for the 
termination signal, but in many cases seemed 
to stop themselves from signaling by abruptly 
creating external and internal distractions 
for themselves. They made up quiet songs 
(“Oh this is your land in Redwood 
City”), hid their heads in their arms, 
pounded the floor with their feet, fiddled 
playfully and teasingly with the signal bell, 
verbalized the contingency (“If I stop now 
I get , but if I wait I get 
DS DEIS Aes) A prayed. to the ceiling; 
and so on. In one dramatically effective self- 
distraction technique, after obviously experi- 
encing much agitation, a little girl rested 
her head, sat limply, relaxed herself, and 
proceeded to fall sound asleep. 

These observations and the results from 
three experiments suggested that the ease with 
which subjects can cope with the frustration 
depends on the overt and covert response 
alternatives available to them during the im- 


215 


posed delay. The manipulations of the present 
experiments may be construed as having pro- ~ 
vided subjects with various types of planned 
alternative responses for coping with that 
frustration. The more the available response 
directed the child’s attention away from the 
frustration, the better he was able to con- 
tinue the delay and substitute a new adaptive 
activity during the frustration; in the present 
paradigm that activity could have been any- 
thing that kept him waiting without ideating 
about the goal objects. 

It would be interesting to know if pre- 
school subjects such as those in the present 
experiments were aware of the principle of 
“Satan, get thee behind me” before they 
actually began waiting. That is, given a choice 
of waiting for the preferred reward, with the 
rewards obscured or available for attention, 
would subjects make the right choice? To 
answer this question, 29 preschool children 
were administered the previously described 
instructions for the delay-of-gratification 
paradigm. Just before the point at which the 
experimenter would usually leave the child 
waiting alone in the room, she picked up an 
opaque cake cover and gave the subject the 
choice of covering with it either the rewards, 
or another set of comparable but irrelevant 
objects also lying on the table, or simply 
placing the cover over another fixed spot on 
the table. The results showed that the chil- 
dren chose the place to cover quite randomly 
before commencing their voluntary delay. 
That is, they did not seem aware that ob- 
scuring the relevant rewards would facilitate 
their ability to wait for them. Thus, young 
children do not seem to have insight into the 
role of cognition and attention in self-control, 
at least prior to actually waiting. 

The data from the main experiments seem 
to contradict James’s (1890) belief that “the 
essential achievement of the will” requires 
one to bear up and force oneself to maintain 
directed attention to the difficult or boring. 
Rather than trying to maintain aversive 
activities such as delay of reward through 
“acts of will” and focused attention, effective 
self-control may hinge on transforming the 
difficult into the easy, the aversive into the 
pleasant, the boring into the interesting, while 
still maintaining the task-required (reward- 
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contingent) activity. Such transformations 
may occur either by engaging in the appropri- 
ate overt distracting activity (e.g., the “toy” 
condition) or changing one’s own mental con- 
tent and ideation so that it functions as a 
covert distractor. 
It is important to recognize, however, that 
the mental transformations and distractions 
which occur during delay do not erase or 
undo the role of the reward contingencies in 
the waiting situation. This is evident in the 
data from Experiment I, which show how 
little persistence there was in “thinking fun” 
or playing with a toy when there was no 
waiting contingency. The distracting activity 
itself, while pleasant and distracting enough 
to maintain waiting for a contingent reward, 
did not in itself keep the children in the room 
for more than a minute. Additional evidence 
that the contingency was available mentally 
throughout the waiting period is that the 
children easily reproduced, verbally or by 
appropriate action, the contingency at the end 
of the waiting period. Children who had been 
busily distracting themselves for the full 15 
minutes, playing with a toy or singing songs, 
immediately and spontaneously ate the appro- 
priate food reward when the experimenter 
returned. Obviously then, the transformation 
of the aversive waiting into a pleasant play 
period does not efface the task-oriented pur- 
pose of the behavior, and presumably the 
two processes somehow coexist. Subjects 
were guided by their goals, even when seem- 
ingly absorbed in distractions designed to 
obscure them. Just how the contingency was 
operating is an interesting point for specula- 
tion. The contingency may have been avail- 
able but never reproduced mentally until the 
end of waiting; even more likely, subjects 
may have reminded themselves of the con- 
tingency episodically throughout the waiting 
period. As mentioned previously, verbaliza- 
tions of the contingency often occurred when 
the subjects momentarily left their distracting 
play and seemed about to terminate the wait- 
ing period. It is as if the subject periodically 
reminds himself of the goal for which he is 
waiting, distracts himself from it to make 
delay less frustrative, and then repeats the 
process. 
Extending these speculations further, a 
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good way to master the difficult or aversive 
may be to think or do something that is 
pleasant, while still performing the necessary 
task-relevant response (e.g, waiting, work- 
ing). Rather than “willing” oneself to heroic 
bravery, one needs to perform the necessary 
“difficult” response while engaging in another 
one cognitively. The principle involved here 
seems similar to the one underlying “counter- 
conditioning” of aversive emotional reactions 
in behavior therapy. To master a snake 
phobia, for example, the subject needs to 
deal with the problematic stimulus while 
engaged in a fear-incompatible positive inter- 
nal response (e.g., relaxation); whereas in 
delay of gratification one must perform a 
difficult or problematic response while en- 
gaged in ideation of stimuli that are positive 
or distracting, In either case, it is easier to 
do something difficult if one also does some- 
thing easy or pleasant at the same time. 
The findings from the present studies seem 
extremely reliable, being based on several 
replications and diverse convergent data. 
However, one obviously cannot generalize 
from them to the role of cognition in forms 
of self-control other than the delay-of- 
gratification paradigm. For example, it might 
be adaptive to ideate about desired or needed 
but currently unavailable goal objects, but 
only in situations in which the subject’s 
actions can be potentially instrumental in 
producing the desired outcome. Thus, when 
attainment of a positive outcome is contin- 
gent on the subject’s own problem-solving 
behavior, it might help him to think about 
the goal object while seeking means for 
achieving or reaching it in reality. In contrast, 
in the present delay-of-gratification paradigm, 
attainment of the preferred goal required only 
passive waiting: beyond delaying there was 


absolutely nothing the subject could do to \ 
influence the occurrence of the desired out- ` 


come. Moreover, even his delay behavior 
(while a necessary condition for attainment 
of the preferred outcome) could in no way 
influence the time at which gratification would 
ultimately occur. : 

Data relevant to the dilemma, in which 
subjects cannot do anything to attain a de- 
sired but unavailable outcome, may be found 
in studies of “defensive perception.” Our 
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studies on attention in delay employ a design, 
which, in a way, seems the reverse of these 
classic “new look” studies. In the latter the 
subject was, first, frustrated or pained and 
was then asked what he thought or perceived 
most: readily (on projective measures). In 
contrast, in the present methodology, the sub- 
ject is given the “thoughts,” and then their 
effect on what he does is noted; specifically, 
his ability to sustain goal-directed delay 
behavior and to cope with frustration are 
measured. Consistent with our data, findings 
from perceptual defense studies indicate a 
tendency to avoid painful stimuli cognitively 
and perceptually when nothing can be done 
by the subject to cope with them instrumen- 
tally (e.g, Reece, 1954). It also has been 
reported that in response to projective ma- 
terial, sleep-deprived subjects showed fewer 
sleep-related ideas and themes than did con- 
trols (Murray, 1959). Likewise, food ideation 
is less when subjects are severely food de- 
prived than when they are not hungry 
(Lazarus, Yousem, & Arenberg, 1953). Simi- 
larly, Clark’s (1952) pioneering experiments 
on Thematic Apperception Test (TAT) 
sexual imagery compared the amount of 
sexual responses in TAT stories written by 
sexually aroused and nonaroused males. He 
found less sexual imagery and also less “sexual 
guilt” in the stories of sexually aroused sub- 
jects. Clark attributed his results to a simul- 
taneous, predominating increase in sexual 
guilt which he suggests was evoked by the 
sexual arousal and is “sufficient to inhibit the 
expression of sex with a consequent lowering 
of guilt [p. 398].” But a more parsimonious 
interpretation in terms of the present experi- 
ments is that the Clark results could reflect 
cognitive avoidance of all sexual thoughts 
(including those scored as “guilt”) under 
conditions that made sexual thoughts highly 
frustrative, that is, sexual arousal with no 
opportunity for satisfaction. 

Thus, when subjects cannot cope with aver- 
sive stimuli instrumentally (e.g, by going 
to sleep when sleepy, by finding food when 
hungry, by avoiding painful shock, and by 
obtaining sexual release when aroused, they 
may engage in cognitive avoidance of those 
stimuli. To decrease frustration, subjects may 
generate their own distractors and avoid the 
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aversive stimuli cognitively, if possible, as we 
observed repeatedly in our studies. This con- 
clusion—that aversive stimuli are avoided ~ 
cognitively—may be restricted, however, to 
paradigms in which the subject believes that 
thinking about the aversive stimulus cannot 
change the contingencies in the situation. 

The overall results of the present 
experiments help to clarify some widespread 
basic theoretical misconceptions regarding 
self-control. In particular, following dynamic 
formulations, it has been customary to con- 
strue voluntary delay of reward as involving 
the ability to defer immediate gratification. 
This ability has been viewed as an enduring 
trait of “ego strength” on which individuals 
differed stably and consistently in many 
situations. In fact, as the present data indi- 
cate, under appropriate motivational and 
attentional-cognitive conditions, virtually all 
subjects, even young children, could manage 
to delay for lengthy time periods. 

Taken collectively, research on delay of 
gratification permits us now to speculate 
about a two-part process in delay of gratifica- 
tion. First, one must consider the determi- 
nants of the choice to delay for the sake of 
more preferred delayed outcomes. This choice 
is influenced mainly by the subject’s expecta- 
tions concerning the probable consequences 
of his choice. These consequences include the 
relative subjective values of the immediate 
and delayed outcomes themselves as well as 
other probable reinforcing outcomes associ- 
ated with each alternative. As previous re- 
search has shown, expectancies relevant to 
these outcomes depend on the subject’s direct 
and vicarious past experiences and trust rela- 
tionships, modeling cues, the specific contin- 
gencies in the choice, and so on (e.g., Mischel, 
1966). 

Second, once the choice to delay gratifica- 
tion has been made, effective delay depends 
on cognitive and overt self-distractions to 
reduce the aversiveness of the self-imposed 
frustration. For this purpose, the subject 
needs to “tune out” on the goal objects 
and generate his own distractions while 
maintaining the contingent behavior for goal 
attainment. 

Thus, the subject can wait most stoically 
if he expects that he really will get the 
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deferred larger outcome in the waiting para- 
digm, and wants it very much, but shifts his 
» attention elsewhere and occupies himself in- 
ternally with cognitive distractions. Any con- 
ditions that shift attention from the delayed 
objects appear to facilitate voluntary waiting 
times appreciably. In order to bridge the 
delay effectively, it is as if the subject must 
make an internal notation of what he is wait- 
ing for, perhaps remind himself of it periodi- 
cally, but he must spend the remaining time 


attending to other less frustrative internal. 


and external stimuli. 
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A concept was presented to explain the conditions under which “identification 
modeling” (operationally defined as identity between a subject’s self-ratings 
on a personality inventory and the personality traits of his experimenter) and 
“reference modeling” (defined as an opposite relationship between experimenter 
traits and subject self-ratings) will occur. In a study based on this concept, 
modeling of either type was observed only with same-sexed experimenter— 


subject dyads, and identification modeling occurred for male pairs and reference 
modeling for females. Several findings are reported dealing with variations in 


experimenter behavior as a function of experimenter and subject sex. 


The term “modeling” generally refers to 
imitation of the behavior of one person on 
the part of another (Bandura, 1962). Re- 
cently, however, the term has been used to 
describe a potential source of artifact in psy- 
chological research in which “there is a sig- 
nificant relationship between the experi- 
menter’s own performance of the particular 
task he requires from his subjects and the 
performance he obtains from his subjects 
[ Rosenthal, 1966, p. 112].” 

In most reported cases this is a positive 
relationship. The evidence, mainly from inter- 
view and clinical testing studies, has been 
reviewed by Masling (1966) and Rosenthal 
(1966). In addition, Rosenthal (1966) re- 
ported 10 further investigations using his 
photo-rating projective measure of success 
and failure feelings, in which 6 showed posi- 
tive and 4 showed negative relationships be- 
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tween experimenters and subjects’ scores, with 
the difference among correlations significant 
at .006.* As Silver (1968) has pointed out, 
“examination of the [Rosenthal] studies 
generally reveals no specification of the condi- 
tions which produce positive as opposed to 
negative correlations [p. 175].” 

One possible explanation is that there are 
two modeling processes, which may be called 
identification modeling and reference model- 
ing. Identification modeling occurs if aspects 
of the experimental situation (such as the 
status or power relationship between the 
experimenter and the subject or the likability 
of the experimenter) encourage identification 
with the experimenter by the subject, and the 
subject incorporates something of the experi- 
menter’s behavior and personality, which pro- 
duces a positive relationship between their test 
protocols. In the absence of conditions for 
identification modeling, however, reference 
modeling may occur if the subject uses the 
experimenter as a point of reference or com- 
parison for his own self-evaluation or behav- 
jor, yielding an inverse relationship. For 
example, on Rosenthal’s (1966) task, a 
success-oriented experimenter would give 


8Silver (1968) has questioned the validity of 
including one study, the source of which was a 
personal communication to Rosenthal, on the grounds 
that the data were not available. He regards the 
proper estimate, without this study, as p<.11, 
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higher success ratings and may also provide 
a “reference model” for his subjects, whereby 
they feel relatively less success oriented and 
give higher failure ratings. 

This explanation assumes that the relation- 
ships called “modeling” between experimenter 
and subject scores are based on the effects of 
the experimenter’s personality on the subject, 
which can be only inferred from the studies 
described above. There are three sources of 
data that bear at all on this contention, 

Klinger (1967) showed that exposure of 
subjects to actor experimenters playing either 
achievement- or affiliation-oriented roles in- 
creased the expression of these motives, cor- 
respondingly, on both the Thematic Appercep- 
tion Test and a self-descriptive adjective 
checklist. Marwit (1969) found a positive 
relationship between judges’ ratings of the 
voices of experimenters and their subjects on 
a number of personality dimensions. Finally, 
~ Rosenthal (1966, pp. 62-86) found that sub- 

jects of experimenters rated as “self-assured” 
and “dominant” scored the photos for their 
person-perception task higher in the failure 
direction, which may suggest the operation 
of “reference modeling.” 

It appears that what is needed at this stage 
of inquiry is a systematic exploration of the 
basic premises of subject-experimenter model- 
ing. The present investigation was such an 
attempt, directed to the following questions: 
(a) What personality traits of experimenters 
are discriminable by subjects in a brief experi- 
mental interaction? (6) Which of these traits 
in experimenters bear a direct and which bear 
an inverse relationship to traits shown by 
subjects in projective and self-report tests; 
that is, for which traits do identification 
modeling and reference modeling occur? (c) 
In what ways do identification modeling and 
reference modeling occur differentially as a 
function of the sex-pairing of experimenters 
and subjects? 

In addition, the data collected for the above 
purposes enabled us to examine differences in 
behaviors of experimenters in the experimental 
situation as these relate to their own and their 
subjects’ sex. 

Rosenthal (1967) has reported some evi- 
dence on this question on the dimension of 
“friendliness,” using judgments of experi- 
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menters by both the experimental subjects 
and independent observers of films of the ex- 
periment. He found with both criteria that 
male experimenters were friendlier than fe- 
male experimenters to both male and female 
subjects, and that both male and female 
experimenters were friendlier with female 
subjects. Regarding the interactions, female 
experimenters were friendlier to male than 
female subjects in the auditory mode of com- 
munication (e.g., tone of voice), but less 
friendly to male than female subjects in 
the visual mode (e.g., direction of body in 
relation to the subject). 

Sarason and Winkel (1966) have some 
data which contradict one of Rosenthal’s 
findings. Using judgments of experimental 
subjects of both sexes, they found that female 
experimenters were rated friendlier than males. 

There are possible artifacts in all of these ] 
studies, however. Reports o : f 


ating male and female experimenters, rther, i 
the film observers in Rosenthal’s studies | 
watched both experimenters and subjects, and® 
their judgments may have been affected by a 
their concepts of how experimenters should 
behave toward male and female subjects. In 
addition, the films were of experiments where 
experimenters were deliberately given expect- 
ancies for subjects’ responses, and the sex 9 
differences observed may be restricted to the 
type of situation in which experimenters are 
trying to covertly influence subjects. 

For the present study, experimenters were 
not given expectancies about subjects’ re- 
sponses, and film observers saw and heard 
only the experimenter. Further, each experi- 
menter interacted with the same subjects, 
who were trained confederates of the investi- 
gators, thus minimizing variability in experi- 
menters’ behavior due to subject differences. 
For these reasons, the present data may yield 
a more valid assessment of sex differences 4 
in experimenters’ behaviors than has been 
previously obtained. 

The report of the present investigation is 
divided into two phases: Study I concerns the 
discriminability of experimenter traits by sub- 
jects and variations in the experimenter’s 


= 


AN 


a 
t 


MODELING AN: 


behavior as a function of experimenter and 
subject sex. Study II deals with modeling. 


Srupy I 
Method 


Experimenters, confederate subjects, and film ob- 
servers. This phase of the investigation used six 
experimenters, six confederate subjects, and 257 film 
observers. Experimenters were three male and three 
female senior honors students who were in the in- 
vestigator’s employ as part-time research assistants 
throughout the academic year. Confederate subjects 
were three male and three female introductory psy- 
chology students who were paid for their period of 
participation. Film observers, 129 males and 148 
females, were introductory psychology students who 
were meeting part of the course requirement for 
research participation. 

Filmed samples of experimenter behavior. Experi- 
menters administered a brief person-perception task 
to confederate subjects. The experimenter delivered 
the instructions verbally and then showed the subject 
10 photographs, 1 at a time. After each presenta- 
tion, the subjects made written ratings of the person 
in the photo on three bipolar trait scales. 

The experimenter and the subject sat across a 
table from each other during this interaction with 
the experimenter facing a one-way-vision mirror, at 
a slight angle, from which 16-millimeter sound films 
were taken. The camera was directed at the 
experimenter only. 

Confederate subjects had been administered the 
person-perception test by one of the investigators 
previously, as part of their briefing. Their instruc- 
tions were to take the test the first time as naturally 
as they could; then to perform as similarly as pos- 
sible on subsequent administrations. They were given 
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pseudonyms to use for each experimenter. Subjects 
were told that experimenters would be filmed and 
that the purpose of their role playing was to enable 
the investigators to study behavioral differences be- 
tween experimenters with subject factors held as 
constant as possible. 

Each experimenter saw the six subjects consecu- 
tively, in the same sequence arranged in ABBA order 
by subject sex, with sessions scheduled 10 minutes 
apart. Sessions for all experimenters were filmed in 
the same place on the same day. The sequence in 
which experimenters conducted their sessions was 
also arranged in ABBA order by sex of the experi- 
menter. Experimenters had been given experience 
administering the task with two pilot subjects each, 
on a previous day, 

After the sessions, experimenters were informed 
that their subjects were confederates and that they 
had been filmed. All stated that they were not sus- 
picious at all about either of these procedures. They 
were told solely about the aspects of the in- 
vestigation dealing with differential behavior as a 
function of experimenter and subject sex, so they 
would not be aware of the purpose of the tasks they 
were to administer later to subjects in the modeling 
study. Their permissions were obtained to show the 
films to judges. 

Experimenter trait scales. An attempt was made 
to construct a comprehensive set of scales appropriate 
for judging experimenter personality traits from the 
films. Development of the instrument was based in 
large part on the trait factor-analytic studies of 
Krieger (1964), Nowlis and Nowlis (1956), and 
Wishner (1960). It comprised twenty-two 7-point, 
bipolar scales, each containing three synonymous trait 
descriptive adjectives at each pole (see below). 

Judges were informed by written instructions that 
“each scale has a description of a personality type 
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. Outgoing, expansive, loud 

. Irritable, defiant, rebellious 

. Lively, energetic, enthusiastic 

. Serious, contemplative, intent 

. Emotional, excitable, passionate 

. Egotistic, boastful, pretentious 
Dogmatic, unbending, opinionated 
. Intelligent, wise, deep thinking 

. Efficient, capable, competent 

. Independent, self-reliant, decisive 
. Sociable, friendly, communicative 
. Stern, strict, hard 

. Creative, imaginative, inspired 

. Cynical, skeptical, suspicious 

. Open, blunt, candid 

. Poised, calm, self-assured 

. Conscientious, dependable, greedy 
. Spontaneous, impulsive, unconventional 
. Moody, depressed, worrisome 

. Ambitious, competitive, industrious 
. Passive, submissive, obliging 

. Aloof, cold, distant 
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versus reserved, shy, quiet 

versus pleasant, agreeable, cooperative 
versus sluggish, lazy, bored 

versus light-hearted, carefree, playful 
versus placid, unemotional, dispassionate 
versus modest, humble, unpretentious 
versus adaptable, flexible, open-minded 
versus shallow, foolish, unintelligent 
versus inept, incompetent, ineffectual 
versus dependent, indecisive, conforming 
versus unsociable, isolated, unfriendly 
versus lenient, permissive, tender 

versus practical, realistic, straight thinking 
versus idealistic, trusting, unsuspicious 
versus secretive, tactful, discreet 

versus nervous, insecure, self-doubting 
versus disorderly, hasty, undependable 
versus formal, conservative, cautious 
versus optimistic, cheerful, satisfied 
versus casual, nonchalant, leisurely 
versus forceful, dominant, aggressive 
versus warm-hearted, affectionate, sympathetic 
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TABLE 1 
FACTOR STRUCTURE OF EXPERIMENTER RATINGS 
Factor Scale Loading 
I: Cold-Warm Irritable—pleasant -114 
Stern-lenient 743 
Aloof-warm-hearted 697 
Dogmatic—adaptable .689 
Cynical-idealistic -629 
Unsociable-sociable -618 
Egotistic-modest «609 
Forceful-passive -576 


IL: Competence- 
Incompetence Efficient-inept 
Conscientious-disorderly 
Poised—nervous 
Independent—dependent 


Intelligent-shallow 


IIL: Extraversion— 


Introversion Spontaneous-formal 718 
Outgoing-reserved 705 

Light-hearted-serious 638 

Creative—practical .624 

Emotional-placid .539 

IV: Vigorous-Apathetic | Ambitious-casual 702 
Emotional-placid .503 


Note.—The eigenvalues for Factors I-IV are 10,87, 6,19, 
5.42, and 1.72, respectively, 


on one side and a description of an opposite type 
on the other side,” and were asked to circle the 
number between scales that they believed “best 
represents the personality of the experimenter with 
regard to the two descriptions.” They were instructed 
further to use the numbers closest to the adjectives 
as judgments of very much, moderately, and slightly, 
the type described at the corresponding pole, and to 
circle the middle number (four) if “the best descrip- 
tion of the experimenter is that he falls between the 
two types.” 

Film judgments. Film-judging sessions were con- 
ducted with groups ranging from 3 to 11 observers. 
(Discrepancies in size were due to variable numbers 
signing for a given session.) Each judge saw all six 
experimenters conducting sessions with either the 
male or the female confederate subjects. Judges 
watched a given experimenter interact with the 
three confederate subjects and then rated the experi- 
menter. To maintain constancy of observing time 
between experimenters, the first 2-minute segment of 
each session was used. The voice of the confederate 
subject, whenever present, was deleted from the film. 

There were 36 separate observer groups, 18 watch- 
ing experimenters with male and 18 with female 
subjects. All judging sessions were administered by 
the same investigator. Within each set of 18 sessions, 
order of presenting the experimenters was counter- 
balanced so that each experimenter was observed 
in each position and was observed preceding 
and following each of the other experimenters 
an equal number of times. Male and female experi- 
menters were always presented in alternating order. 


4This was done to obtain separate judgments of 
experimenters interacting with male and female sub- 
jects without requiring observers to make more than 
one judgment of a given experimenter. It was con- 
sidered that the number of observers was sufficient 
to assume equivalence of the two groups. 
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Assignments of observer groups to films of experi- 
menters interacting with either male or female sub- 
jects were also counterbalanced in ABBA order. In 
addition, six orders of scales were constructed, three 
in rotating order beginning with Scales 1, 8, and 16, 
and three in opposite rotating order, beginning with 
Scales 22, 15, and 7, with the six orders counter- 
balanced within each set of 18 observer groups. 
Observers were anonymous; they recorded only 
their sex on the forms. They were informed by the 


investigator that the experimenters were not aware 
that they were being filmed and that the purpose 
of their judgments was to assess “the extent to which 
an experimenter’s personality may come through to 


his subjects.” 

The protocols for six judges were excluded, either 
because they were incomplete or they indicated, in 
response to the investigator’s question, that they 
were acquainted with one of the experimenters. Of 
the remaining 251, 27 were randomly excluded to 
obtain equal Ns of 56 in the four subcells, that is, 
male and female judges watching experimenters 
interacting with male and female subjects. 

Data analysis. First a factor analysis was per- 
formed across the 22 experimenter rating scales for 
all of the 1,344 protocols (224 observers, @ach rating 
six experimenters). Then an attempt was Made to 
locate the personality dimensions on which experi- 
menters were discriminable by judges. This. was 
done separately with male and female experimenters 
by 2X 2X3 analyses of variance for sex of judge, 
sex of subject, and experimenters, for both individual 
traits and factors. The focus of these analyses was 
on experimenter effects and Experimenter X Sex of 
Subject interactions. 

In a third analysis, male and female experimenters 
were compared in a series of 2 X 2 X 2 analyses of 
variance, across traits and factors, for sex of experi- 
menter, sex of subject, and sex of judge. The pur- 
poses of these analyses were to assess differential 
behavior of experimenters as a function of their own 
and their subjects’ sex and, incidentally, to observe 
sex differences in judging behavior. The two criterion 
scores for each judge were the summed ratings across 
the three subjects for male and female experimenters. 
As noted earlier, different judge groups observed 
experimenters interacting with male and female sub- 
jects; thus, sex of experimenter was treated as a 
within factor and sex of subject and sex of judge as 
between factors. 


Results 


Factor structure of experimenter ratings. A 
principal components analysis of the 22 trait 
variables yielded four terms with eigenvalues 
greater than one. These four factors were 
placed in a varimax rotation and, from this 
analysis, scales with loadings of greater than 
-5 were used to define the factor. Eigenvalues 
for the four factors and the scales consti- 
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tuting each, with their loadings, are shown in 
Table 1. (Scales are listed by the first pair 
of adjectives). The first column of Table 1 
contains the descriptive terms given the 
factors. 

Factors I and III seemed the most readily 
describable, corresponding, respectively, to di- 
mensions of “‘cold-warm” and “extraversion— 
introversion.” Factor II was most heavily 
loaded on the traits of efficiency, conscien- 
tiousness, and poise, and appeared best de- 
fined as “competence-incompetence.”’ Factor 
IV comprised just two scales, referring to 
ambition and drive on the one hand and 
emotionality on the other, and “vigorous— 
apathetic? seemed the most suitable label. 

Differences among experimenters. Table 2 
presents the analyses of experimenter differ- 
ences and Experimenter X Sex of Subject 
interactions, separately for male and female 
experimenters. In the interests of conciseness, 
the data are presented for the factors only. 
(Across the 22 traits, there were 21 experi- 
menter effects significant at the .05 level for 
males and 17 for females; there were 7 sig- 
nificant interactions for males and 13 for 
females.) 

Scores in this and in Table 3 are arranged 
from low to high in the direction of the first- 
to the second-named adjective. Table 2 shows 
significant experimenter effects on all four 
factors for both sexes. 
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For male experimenters, there was one sig- 
nificant interaction: Experimenter 2 was rated 
more competent with female than with male 
subjects, while Experimenter 3 showed the 
opposite relationship. 

For female experimenters, two interactions 
were significant: both involved Experimenter 
2, who, alone of the three experimenters, was 
less warm and more introverted with male 
than with female subjects; Experimenter 3 
showed opposite effects on the cold-warm 
dimension. 

Experimenter-, subject-, and judge-sex ef- 
fects. This section presents the results for the 
analyses of variance comparing male and fe- 
male experimenters, subjects, and judges. 
Judge-sex effects, while they may be of some 
interest, were incidental to the purposes of 
the study. These data are described in the 
following paragraph but are not included in 
the tabular presentation. 

There were significant (p < .05) main ef- 
fects on Factors I, II, and IV, with female 
judges making ratings more in the warm, 
competent, and vigorous directions than their 
male counterparts. There was one significant 
Sex of Judge X Sex of Experimenter inter- 
action; on Factor III, female judges showed 
more of a difference than males in their 
ratings of female experimenters as more extra- 
verted. This difference for male judges did, 
however, reach significance as well. 


TABLE 2 


DIFFERENCES IN MEAN RATINGS OF ĪNDIVDUAL EXPERIMENTERS ON Four FACTORS, 
BY SEX OF SUBJECT 


Male experimenter F. Female experimenter F 
Factor E AA Por ERIS Erpen 
AS S aer e Se ohy Ea T fe LSN entek | eee 
Subject | Subject 
Cold-Warm Male 20.37 | 23.70 | 19.41 | 27.78** ns 23.81 | 15.81 | 16.43 | 76.44** | 19,69** 
Female | 21.05 | 23.65 | 19.52 22.59 | 20.96 | 13.94 
Competent- Male | 13.44 | 16.04 | 15.74 | 23.13** | 11.53**| 12.84 | 14.99 | 15.87 | 24.40**| ns 
Incompetent Female| 14.05 | 13.81 | 17.38 12.44 | 13.15 | 15.28 
Extraverted-— Male | 15.66 | 12.75 | 16.30 | 65.47**| ns | 12.01 | 14.13 | 13.82 | 13.51** | 6.92** 
Introverted Female | 15.22 | 11.58 | 16.69 11.88 | 11.77 | 14.76 
Vizorous-Apathetic | Male | 7.93| 832] 833| 3.754 | ms |. 7.24| 8.28) 8.24] 13.61%*| ms 
fee Ay E, Female| 7.63| 7.63 |- 8.20 698| 733| 8.04 
* 025. 
+} < i001: 
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TABLE 3 


DIFFERENCES IN MEAN RATINGS OF EXPERIMENTERS ON FOUR FACTORS, 
BY SEX OF EXPERIMENTER AND SUBJECT 


Male experimenter Female experimenter F 
Factor 3 
Male Female Male Female Sex of Sex of Experimenter 
subjects subjects subjects subjects experimenter: subject X Subject 

Cold—Warm 63.47 64.21 56,05 57.49 41.05*** ns ns 
Competent—Incompetent | 45.21 45.23 43.70 40.87 129910% ns 3.03* 
Extraverted-Introverted | 44.71 43.76 39.96 38.40 44.08*** ns ns 
Vigorous—Apathetic 24,58 23.45 23.76 223 Sa 6.16** ns 


bp < 10. 
KD < 01. 
*k p < 001. 


Table 3 presents the analyses of experi- 
menter and subject sex differences. Again, in 
order to be concise, these data are presented 
for factors only, (Across traits, there were 14 
main effects of sex of experimenter, 4 main 
effects of sex of subject, and two interactions 
significant at the .05 level). 

Significant effects of experimenter sex were 
found for the four factors. Female experi- 
menters were judged as less warm and more 
competent, extraverted, and vigorous than 
males. There was one significant subject-sex 
effect; experimenters were judged as more 
vigorous (Factor IV) when they were inter- 
acting with female subjects, There was an 
interaction between experimenter sex and 
subject sex that approached significance on 
Factor II, which may suggest that female 
experimenters seemed more competent with 
female than with male subjects. 


Discussion 


From the principal components analysis de- 
scribed in Table 1, we may conclude that 
impressions of experimenters are largely 
structured in terms of four dimensions. 

The analyses of experimenter effects (Table 
2) suggests that even with the limited sample 
of behavior presented to judges, experimenters 
were highly discriminable on all four dimen- 
sions. The experimenter’s personality, then, 
does make itself present in a very brief and 
structured experimental situation, and the 
basic premise for the occurrence of modeling 
is supported. Further, the high discriminabil- 
ity of experimenters suggests that to the ex- 
tent that an experimenter’s personality can 
affect a subject’s responses, by whatever proc- 


ess, it will be a potential source of variance 
within and between experiments, 

The data on experimenter-sex effects (Table 
3) corroborates Rosenthal’s observations that 
male experimenters were judged as “warmer” 
than females. Male experimenters in the pres- 
ent study were also judged to be more in- 
competent, introverted, and apathetic. The 
concept that emerges is that the | male experi- 
menter acts colder and more formally and, 
at the same time, is seen as more capable, 
outgoing, and energetic, 

Unavoidably, judgments were potentially 
confounded by the judges’ ideas of how males 
and females should behave; however, sex dif- 
ferences seemed to be counter to what we 
might intuitively expect these notions to be. 

One explanation is that the female high- 
quality student is more select than the male 
in terms of seriousness and competence, She 
may also be more anxious to conform to 
expected behavior as an experimenter, inas- 
much as this is probably perceived as a 
predominantly male role. 

Rosenthal’s finding that both male and 
female experimenters were friendlier to female 
subjects was not observed in the present data 
in terms of differences on the cold-warm 
dimension. Experimenters across sexes, how- 
ever, were more vigorous appearing with 
female subjects. The discrepancy between 
studies may be a function of the different 
experimenter samples, but it should be noted 
also that in the present procedure, in contrast 
to Rosenthal’s, judges were not aware of the 
subject’s sex, and ratings did not, therefore, 
contain this possible source of bias, 

On only one of the four factors did an 


\ 


i 
$ 


MODELING AND SEX EFFECTS 


interaction between experimenter and subject 
sex approach significance, providing the sug- 
gestion that female experimenters behave 
more competently with female than with male 
subjects, perhaps as a function of less anxiety, 

Our conclusions about all of these sex ef- 
fects are tempered by the interactions of 
individual experimenters and subject sex in 
Table 1. The three significant interactions of 
a possible eight suggest that there are con- 
siderable individual variations within male 
and female experimenter groups in the differ- 
ences they will manifest toward same- and 
opposite-sex subjects. From a methodological 
standpoint, this should make us somewhat 
wary in applying generalities about experi- 
menter sex differences or Experimenter X Sub- 
ject interactions. In a given experiment, it 
would seem to require more than one or two 
experimenters of each sex to adequately 
counterbalance for, or draw generalizations 
about, such effects. This caution pertains, as 
well, to the present data. 


Stupy II 
Method 


As described previously, Study II was concerned 
with “modeling”: the relationship between charac- 
teristics of experimenters and responses of their sub- 
jects on tasks involving projection or description 
of self. About 6 weeks after the experimenters were 
rated for Study I, they each administered a projec- 
tive and a self-rating task to samples of subjects. 

Subjects came from the same introductory psy- 
chology course as the film observers of Study I, 
although different class sections of the course were 
solicited. There were 108 males and 130 females. 
Each experimenter ran between 18 and 21 males and 
between 16 and 20 females. (Differences in Ns were 
due to subjects not being available for given time 
slots and to cancellations,) Subjects were given the 
tasks individually and, inasmuch as scheduling prob- 
Jems would allow, the time periods in which experi- 
menters conducted their sessions and the order of 
male and female subjects for each experimenter were 
counterbalanced. 

For the projective test, subjects were told; “The 
purpose of this study is to see if there are ways in 
which people judge personality from facial charac- 
teristics.” They were shown three photographs each 
of four different people, posed with smiling, serious, 
and neutral expressions. With the three photographs 
before him, the subject rated the person on the same 
22-item inventory on which experimenters had been 
judged in Study I. (The reader will recall that experi- 
menters had no previous acquaintance with this 
inventory.) 
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For the self-rating task, after all of the photo- 
graphs were shown, the subject was asked to “rate 
himself” on the same items. 

The initial data analyses were directed to the ques- 
tion of whether there were differences among experi- 
menters in the responses obtained from subjects, 
both for the projective measure (the sum of the 
ratings of the three sets of photos) and for the 
self-ratings. This required 3 X 2 analyses of variance 
(unweighted-means analyses) for each of the 22 
traits, separately for male and female experimenter 
groups, for the three experimenters and sex of sub- 
ject. These analyses used the 22 traits and not the 
four factors inasmuch as it was considered that the 
factor structure for the subject’s self-ratings would 
not be identical to that for judges ratings of 
experimenters. 

The next analysis assessed whether these differ- 
ences, if present, were attributable to reference 
modeling or identification modeling. Separate analy- 
ses were done for male and female experimenters 
administering to male and female subjects. For each 
of these four groups, a correlation coefficient was 
taken between two sets of difference scores: the 
difference between the mean judgments of any two 
experimenters in Study I for a given trait and the 
difference between the mean ratings made by the 
subjects of these experimenters on the same trait 
(for both the projective and the self-rating tasks). 
There were three experimenters and 22 traits for each 
analysis, hence the N for each correlation was 66. 

Further observations, described in the following 
section, were made to establish the specific traits for 
which modeling of either type occurred. 


Results 


For scores on the photo ratings, there were 
no significant (p < .05) experimenter effects 
and no significant Experimenter X Subject 
Sex interactions across the 22 traits for either 
male or female experimenter groups. 

These effects did occur, however, on the 
self-rating measure. For male experimenters, 
there were five significant (p < .05), experi- 
menter effects of a possible 22 and three 
significant interactions. Differences among ex- 


‘perimenters occurred for the traits conscien- 


tious, spontaneous, moody, efficient, and 
stern; interactions on outgoing, independent, 
and sociable. 

For female experimenters, there were eight 
significant experimenter effects and no inter- 
actions. Differences among experimenters oc- 
curred on the traits poised, ambitious, passive, 
lively, efficient, independent, stern, and open. 

Inasmuch as subjects responded differently 
with different experimenters only on the self- 
rating task, the analyses of modeling were 
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conducted solely for this measure. Correla- 
tions between differences in mean judgments 
for any two experimenters and corresponding 
differences in mean self-ratings by their sub- 
jects were calculated separately for male and 
female experimenter groups administering to 
male and female subjects. 

The correlations for male experimenters 
interacting with female subjects and female 
experimenters interacting with male subjects 
did not approach significance (.159 and 
—,004, respectively). The correlations for 
males with males and females with females 
both reached significance at the .01 level, 
and were in opposite directions (.418 and 
—.320, respectively). These correlations were 
derived partially from between-experimenter 
and partially from within-experimenter 
sources of variance. Thus, to obtain an addi- 
tional estimate, in all four cases, we averaged 
the three correlation coefficients, each based 
on Ns of 22. These averages were at about 
the same magnitudes as the overall correla- 
tions; .08 for male experimenters interacting 
with females, .17 for female experimenters 
interacting with males, and .443 and —.255 
for male-male and female-female pairs, 
respectively. 

Thus, using both analyses, we find that dif- 
ferences between experimenters on the various 
traits were systematically related to differ- 
ences in their subjects’ self-ratings, but only 
for same-sexed experimenter—subject dyads, 
directly for male pairs and inversely for 
female pairs. 

Further observations were made to judge 
whether, for opposite-sexed dyads, there were 
admixtures of positive and negative relation- 
ships for different traits which would account 
for the null correlations. Instances were re- 
corded in which there were significant dif- 
ferences (by # tests at p < .05) between the 
mean self-rating scores obtained by any two 
experimenters on a given trait and significant 
differences, by the same criterion, between the 
mean judgments made of these experimenters 
on the same trait. 

There were four such cases for male experi- 
menters with female subjects; three with dif- 
ferences in the same direction and one in the 
opposite. For female experimenters with male 
subjects, there were two cases, one in each 


I. SILVERMAN, A. SHULMAN, AND D. WIESENTHAL 


direction. Thus, it did not appear that the 
absence of significant correlations for oppo- 
site-sexed pairs were based on dual effects 
which canceled each other out. The same 
analyses were performed for the same-sexed 
dyads to try to establish the specific traits on 
which modeling occurred, For male dyads, 
there were eight cases, all with differences in 
the same directions. For female dyads there 
were five cases, all with differences in the 
opposite directions. 

Three of these 13 cases were for the trait 
eficient; two each for the traits poised, inde- 
pendent, and lively; and one each for the 
traits, spontaneous, moody, outgoing, and so- 
ciable. If we add the six cases for opposite- 
sexed dyads, then there are si the trait 
eficient; two each for poised, 
and lively; and one each for cons 
spontaneous, moody, outgoing, oie 
and stern. ne 

In terms of the factor structure des 
previously, 11 instances of modeling, of either 
type, occurred for traits located in Factor II 
(competence-incompetence) two each in Fac- 
tor I (warm-—cold) and (introversion—extra- 
version) and four for traits not included in 
any factor. 

There were no suggestions of systematic 
differences in traits modeled with male and 
female experimenters or subjects or any com- 
binations of these. 


Discussion 


One strong finding of this study was that 
different experimenters did obtain different 
responses from subjects on the self-rating in- 
ventory. Of the 44 analyses of experimenter 
main effects, 13 were significant at .05, which 
is more than five times the number that 
would be expected by chance. 

Experimenter effects were not present for 
the photo-rating task. Inasmuch as these 
photos were not preselected for neutrality on 
the dimensions rated, as were, for example, 
Rosenthal’s, this instrument may not have 
been an adequate projective measure. 

The self-rating data also provided evidence 
of both identification and reference modeling, 
but not in a very concise pattern. The sug- 
gestion was that modeling of either sort oc- 
curred only in same-sex, experimenter—sub- 
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ject dyads, which is plausible on the assump- 
tion that subjects more readily use an experi- 
menter of the same sex as a standard for 
self-evaluation. 

Identification modeling, however, occurred 
with male and reference modeling with female 
dyads. In terms of the theoretical framework 
described earlier, we may conjecture that male 
subjects tend to identify with male experi- 
menters because the role of psychological ex- 
perimenter is generally associated with the 
male sex. In the absence of such identifica- 
tion, female experimenters are used by female 
subjects as a reference point for their self- 
ratings. 

The questions of whether these modeling- 
sex interactions hold in other experimental 
settings and whether they are predictive of 
behavior in comparable naturalistic situations 
(e.g., the family, the classroom) remain for 
further research. 

There were no suggestions of differences in 
kind of modeling as a function of trait. There 
was an indication that modeling of both kinds 
occurs most frequently for traits related to 
the “competence” dimension. 

Though these data do not go very far in 
clarifying the modeling process, they do pro- 
vide further evidence of its operation in the 
psychological experiment. Coupled with the 
findings on sex effects in Study I, they em- 
phasize anew the need for randomization and 
control of experimenter effects where appro- 
priate in psychological studies. This is a 
means of increasing validity and replicability 
of data, advocated by Rosenzweig in 1933 
and again by Brunswick in 1947 and Mc- 
Guigan in 1963, which researchers have ap- 
parently not heeded to any meaningful extent. 
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The present research investigated the distribution of rewards under conditions 
of different total amounts of money. Results indicate that subjects divided 


money equally between themselves and 
when the total amount of reward avail 
amount was either substantially more or 


standard of fair pay; when the total 


a co-worker with equal inputs, only 
lable was congruent with an internal 


substantially less than this amount, the subjects gave proportionately more to 
themselves. These results are used as the basis for an argument that a more 
complex theory of equity—one that is not based solely on interpersonal com- 
parisons of input/outcome ratios—is needed to integrate the findings of this 


and previous studies. 


Social scientists have shown increasing 
interest in the extent to which persons value 
the welfare of others relative to their own 
welfare. A major portion of this work has 
been concerned with the sense of equity. The 
primary theoretical treatment in this area, 
equity theory (Adams, 1965; Homans, 1961), 
has as its fundamental premise the idea that 
group members will be satisfied with a dis- 
tribution of rewards (outcomes) to the degree 
that such rewards are proportional to each 
member’s contribution to the group (inputs), 
This premise strongly implies that the basic 
mechanism for the generation and mainte- 
nance of a state of equity is the social com- 
parison of one’s own input/outcome ratio to 
those of other group members. 

However, Weick ( 1966; Weick & Nesset, 
1968) argued that this approach is somewhat 
simplistic. He theorized that persons not only 
compare their own inputs and outcomes with 
those of other group members (“comparison 
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equity”), but they also compare them with 
some internal standard derived from past 
social experiences (“own equity”); further, 
they use this standard in evaluating the 
other’s input/outcome ratio as wall (“other 
equity”). PF i 

A somewhat different explanation of the 
manner in which concerns with equity are 
expressed was offered by Lane and Messé 
(1971) as a possible interpretation of their 
results. Their explanation proposes that per- 
sons, when distributing rewards, only con- 
sider the level of inputs of the other individ- 


ual. Accordingly, if the inputs of the other JA 


surpass a given threshold value, rewards will 
be distributed equally; if not, the persons 
choosing will take proportionally more of 
the total reward for themselves. Note that 
this explanation is somewhat congruent with 
Weick’s concept of other equity. 

The present research is a direct test of 
the three theoretical positions described 
above. Using a procedure similar to that 


developed by Leventhal (e.g., Leventhal a 3 


Anderson, 1970; Leventhal & Lane, 1970: 
Leventhal & Michaels, 1969), subjects were 
asked to distribute rewards to themselyes 
and co-workers with whom they were paired. 
Some subjects were given an amount of 
money to distribute which, if divided equally, 
would give each individual the reward that 
other persons from the same population per- 
ceived as reasonable. Other subjects were 
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given rewards to distribute that were either 
double or one-half of this amount. This same 
procedure was followed in three different 
conditions in which the person choosing and 
his co-worker had the same level of inputs, 
but these levels differed absolutely. 

If, as Adams (1965) and Homans (1961) 
implied, comparison equity is the only cri- 
terion by which distributions of rewards are 
evaluated, then subjects in the present re- 
search should divide rewards equally, regard- 
less of the absolute level of inputs and the 
total amount of money available, since they 
and their co-worker had equal inputs. This 
is so because the comparison-equity cri- 
terion disregards the absolute magnitude of 
input/outcome ratios and only focuses upon 
the degree to which these ratios are equal. 
There are a number of studies, conducted 
in a wide range of research settings, which 
apparently support the position that persons 
compare input/outcome ratios (comparison 
equity) when evaluating the degree to which 
they are satisfied with a distribution of 
rewards (e.g, Adams & Jacobsen, 1964; 
Andrews, 1967; Clark, 1958; Jacques, 1961; 
Leventhal & Michaels, 1969; Patchen, 1961; 
Sayles, 1958). However, in these studies it 
is not possible to determine directly if sub- 
jects were evaluating payments on the basis 
of comparison equity, own and other equity, 
or both. But the possibility does remain that 
comparison equity is the sole criterion of 
satisfaction with a distribution of rewards. 
On the other hand, if persons take into 
account the internal standards of own and 
other equity, then results different from those 
predicted solely from a compatison-equity 
framework would be expected in the present 
study. It seems reasonable that persons should 
be more concerned that their own reward 
, meets their internal standard of fair pay 
(own equity) than that the other person’s 
reward does so (other equity). This follows 
from, and is an extension of, the position 
advanced by Adams (1965) and Homans 
(1961) that there is more dissatisfaction 
generated from inequitable underpayment 
than from inequitable overpayment. If per- 
sons are more concerned with own equity 
than with other equity, then a subject in the 
present study should give a larger portion 
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of the reward to. himself than to his- co- 
worker when the total amount to be dis- 
tributed is insufficient to give both persons 
the payments prescribed by the subject's 
internal standard. 

The results of Lane and Messé (1971) 
suggest two additional hypotheses relevant to 
own and other equity. They found that when 
subjects distributed a total amount of money 
that seemed to be an overpayment, given 
their own and their co-worker’s inputs, they 
tended to give themselves proportionately 
more money. This finding implies that persons 
will tend to keep any reward in excess of 
the amounts sufficient to satisfy their sense 
of own and other equity. Moreover, they 
also found that subjects tended to divide 
the money equally when the total amount of 
reward seemed commensurate with inputs. 
This suggests that when the amount of money 
available is just sufficient to satisfy own and 
other equity, persons will behave accordingly 
and, with equal inputs, divide rewards 
equally. 

The threshold explanation offered by Lane 
and Messé (1971) is a third orientation 
toward equity theory, and it generates a 
different set of predictions concerning how 
rewards will be distributed in the present 
research. If the threshold interpretation is 
valid, then persons in the low-inputs condi- 
tion should give themselves more money than 
their co-workers, and persons in the high- 
inputs condition should divide rewards 
equally, regardless of the total amount of- 
money available for distribution. 

Evidence in support of the threshold po- 
sition is provided by the results of Lane and 
Messé (1971). In this study, persons were 
given choices between distributions of rewards 
that were more or less equal and ones that 
maximized their own payments. Irrespective 
of their own input levels, subjects tended to 
choose the equal alternatives when inputs of 
the co-worker were high, but they tended 
to maximize their own reward when the 
inputs of the co-worker were low. Further, 
as they noted, the results of other studies 
(e.g, Leventhal & Michaels, 1969) can be 
interpreted within this framework. 

In summary, predictions about the dis- 
tributions of rewards based solely on the 


i 


concept of comparison equity state that equal 
inputs should generate equal outcomes, 
regardless of the absolute magnitude of the 
total reward. On the other hand, the theo- 
retical position advanced by Weick that 
persons judge their own and the other’s 
input/outcome ratios against a socially de- 
rived internal standard and that they are 
more concerned about their own welfare 
generates the following three hypotheses: 
(a) when inputs are equal and the total 
amount of money available is congruent with 
the internal standard of fair pay, subjects 
should distribute rewards equally; (b) when 
inputs are equal and the total amount of 
money is less than the internal standard, 
subjects should give themselves proportion- 
ately more of the total reward; and (c) when 
inputs are equal and the total amount of 
money is greater than the internal standard, 
subjects also should give themselves propor- 
tionately more of the total reward. Finally, 
the position that only the inputs of the other 
are evaluated in terms of some threshold 
leads to the prediction that when the other’s 
inputs are high, subjects should distribute 
rewards equally, but when they are low, 
subjects should give more of the total rewards 
to themselves. 
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METHOD 


Subjects and Recruitment 

Subjects were 79 male students at Michigan State 
University who were randomly selected from ap- 
proximately 500 respondents to an advertisement in 
the student newspaper. The advertisement stated 
that respondents would be paid for their participation 
in “motivation research.” 


Input and Reward Conditions 


Three conditions of inputs were used, In one 
condition (high inputs), subjects worked for 14 
hours, responding to an open-ended questionnaire 
dealing with problems in industrial relations. A 
second group of subjects (those in the low-input 
conditions) worked for 30 minutes on a shortened 
version of the same questionnaire. In a third condi- 
tion (explicit low inputs), subjects also worked for 
30 minutes, but they were told before they began 
that based on their responses to questions asked 
of them at the time they were telephoned and 
asked to participate, they were expected to produce 
low-quality answers to the questionnaire.’ 


ê This last condition was included since it is pos- 
sible that subjects in the low-input condition would 
not have perceived their inputs as actually being low. 
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A pretest was conducted to determine the 
standard of fair pay for the type of work 
the subjects performed in this study. M 
an introductory social psychology class w 
what they thought was fair pay for pa 
in psychological experiments of varying 
The mean responses of 91 males in the ci 
the 30-minute and 14-hour situations were $1.23 
and $2.98, respectively. Therefore, each su! 
the high-inputs, sufficient-pay condition we 
$6 to divide between himself and his co- 
while each subject in the high-inputs, insu! 
and oversufficient-pay conditions were given and 
$12, respectively. Likewise, subjects in the low- ‘puts 
and explicit-low-inputs, sufficient-pay con ditions 
were given $2.50 to divide, and subjects i: the 
insufficient- and oversufficient-pay condition 
given $1.25 and $5, respectively, 


were 


Procedure 


Subjects responded to the industrial relations 
questionnaire in groups of 8 or 10 persons. Actually 
only half of the parti ‘in each group wer: 
subjects, and the remainder were confederates. In 
every group, the actual subject ; sat at one table 


„and the confederates sat at a second table immedi 


ately adjacent to the first. 

After all participants, both subjects and con: 
federates, wrote for the appropriate amount of time, 
the group was told that each person seated at 
Table 1 was going to determine that amount of 
pay he and one person at Table 2, selected at 
random, were going to receive. The experimenter 
indicated that the subjects sitting at the table who 
were to determine the pay were selected totally by 
chance. The experimenter pointed out that while 
each subject knew he was paired with one of the 
four (five) persons at the other table, he would 
never know exactly which one. 

Each subject was then given the amount of money 
dictated by the experimental design, two envelopes 
(one marked “My pay” and the other, “His pay”) 
and a one-item questionnaire that asked how 
divided the money. The total amount of money 
that each subject had to divide (the same for 
every subject in the group) was announced to the 
group. The subjects were told that to insure that 
their choices were confidential each was to enter 
one of the cubicles that lined the room, close the 
door, put the amount they wished to give them- 
selves and the others in the appropriate envelopes 
seal the envelopes, and indicate on the questionnaire 
how they distributed the rewards. Meanwhile, the 
remaining subjects (the confederates) remained 
seated at Table 2. 

After each subject entered his cubicle and divided 


*The students reported what they felt was fair 
compensation in each of six situations which ranged 
from just showing up to participating in a study that 
lasts for 3 hours. Mean fair pay per situation ranged 
from $1.10 to $5.71, and the differences between 
situations were highly reliable (F= 321.28), 
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TABLE 1 


MEAN PERCENTAGE OF TOTAL REWARD THAT SUBJECTS 
ALLOCATED FOR THEMSELVES, CLASSIFIED BY 
Input CONDITION AND SUFFICIENCY 
or: REWARD 


Sufficiency of reward 


Input condition 


Insufficient Sufficient | Oyersufficient 
High 63.3 50.4 55,4 
Low 57.6 50.0 58.9 
Explicit low 69.5 51.1 66.7 


the money as instructed, he returned to the main 
room and handed the sealed “His pay” envelope and 
the questionnaire to the experimenter, Each subject 
left immediately afterward, having been instructed 
not to open his “My pay” envelope until he had 
left the building, To make the situation as realistic 
as possible, the experimenter distributed the “His 
pay” envelopes to the confederates after the last 
subject had left, and they too left the building and 
went separate ways. There were two groups run in 
each experimental condition. 


RESULTS 


The amounts of money in the “His pay” 
envelopes were used to determine the per- 
centage of the total reward that each subject 
gave to himself.* Table 1 presents the means 
of these percentages classified by experimental 
conditions. 

Although the subjects completed the indus- 
trial relations questionnaire in a group set- 
ting, they divided the money individually in 
their own cubicles. However, it was possible, 
although unlikely, that the group-adminis- 
tration procedure used influenced results. To 
make sure that this was not the case, an ex- 
perimental group was included as a nested 
factor in a 3 (input condition) X 3 (suffi- 
ciency of reward) X 2 (experimental group) 
unweighted-means analysis of variance of the 
percentage of the total reward that the sub- 


|. jects allocated to themselves. The results of 


this analysis are summarized in Table 2. 

As predicted, the effect of experimental 
groups was insignificant. In fact, using a pro- 
cedure suggested by Walker and Lev (1953, 


5 Obviously, this was a potentially more accurate 
measure of the distribution of rewards than was 
each subject’s self-report. However, it is interesting 
to note that only one subject reported a division 
that was different from the way he actually distrib- 
uted the rewards. 
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pp. 207-208), groups were found to be not 
different at a significant level (F = 5.21, df 
= 61/9, p < .01). For this reason, a pooled 
error term was used to test for the significance 
of the remaining effects. 

As Table 2 indicates, the effect of suffi- 
ciency of reward was significant. To further 
explore the nature of this effect, Dunnett’s 
tests (Winer, 1962, pp. 89-91) were per- 
formed to compare sufficient reward to in- 
sufficient and oversufficient rewards across 
input conditions. These tests revealed that 
when compared to sufficient-reward condi- 
tions, subjects allocated proportionately more 
money to themselves in the insufficient-re- 
ward condition (t = 3.42, p < .005) and in 
the oversufficient-reward condition (t = 2.58, 
p< 025). 


DISCUSSION 


The results in respect to the effects of dif- 
ferences in the sufficiency of rewards strongly 
support the own-equity—other-equity position i 
of Weick, which states that persons compare 
input/outcome ratios to a socially derived 
internal standard, Thus, a theory developed 
solely on the basis of comparison equity, or 
one developed on the basis of evaluation of 
the other’s input level, could not have ac- 
counted for the findings of the present re- 
search. 

This point is further supported by an ex- 
amination of some of the studies that have 
produced results that were interpreted as 
nonsupportive of equity theory. For example, 
Lawler (1968) found that subjects overpaid 
by circumstance tended to produce the same 
amount of the same quality of work as sub- 
jects who were equitably paid. Adams (1968) 


TABLE 2 


Summary oF ANALYSIS OF VARIANCE OF THE 
PERCENTAGE OF THE ToTAL REWARD THAT 
SUBJECTS ALLOCATED TO THEMSELVES 


Source of variation af MS F 
Input condition (A) 2| 354.63 | 1.94 
Sufficiency of reward (B) 2 | 1165.51 | 6.38* 
AXB 4| 114.74 | .63 
Pooled error 70 | 182.80 
Experimental groups/A X B | 9 40.09 | .20 
Subjects within groups 1| 203.86 


* p <.005, 


interpreted these findings in terms of differ- 
ences in comparison individuals. That is, he 
stated that subjects overpaid by circumstance 
should compare themselves with persons over- 
paid in similar situations and not with per- 
sons who were equitably paid, a supposition 
that may or may not be true. A theory of 
equity that includes the concepts of own and 
other equity would predict the results Lawler 
obtained, even if the overpaid-by-circum- 
stance subjects compared themselves with 
equitably paid subjects. This is so because, as 
the results of the present study indicate, per- 
sons should not feel dissatisfied with receiving 
more than they deserve as long as their com- 
parison person was being treated equitably. 

It should be emphasized, however, that the 
results of the present research suggest that 
comparison equity is also an important de- 
terminant of the distribution of rewards. Sub- 

_ jects did seem to compare their own input/ 
outcome ratios with those with whom they 
were paired. If subjects were only concerned 

_ with own and other equity, then they should 
have allocated a larger portion of the total 
reward to themselves in the insufficient-re- 
ward conditions than they did. Likewise, if 
comparison equity were not salient, they 
should have allocated a larger proportion of 
the excess money to themselves in the over- 
sufficient-reward conditions. 

Moreover, the results of the present re- 
search do not support the threshold explana- 
tion advanced by Lane and Messé (1971). For 
example, if such a threshold were relevant to 
the manner in which rewards were distributed, 
then subjects in the low- and explicit-low— 
sufficient-reward conditions should have given 
proportionately more of the rewards to them- 
selves. However, the results indicate that sub- 
jects in these conditions divided the rewards 
equally. Thus, this interpretation, although it 
accounted for their results, seems too sim- 
plistic to have any general validity. In addi- 
tion, the theoretical position advocated by 
Weick also adequately accounts for their find- 
ings. 

It is interesting to note that there was lit- 
tle, if any, difference between responses (in 
terms of percentages) in the three input con- 
ditions. More importantly, there was no in- 
teraction between input conditions and suf- 
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ficiency of reward. These findings support 
idea that the internal standard is a re 
one, that is, the standard changes as < C- 
tion of differences in the input levels of all 
relevant persons. 

It could be argued that the subjects’ be- 
havior in the present study was not a func- 
tion of concerns with equity of any type but 
rather a function of demand charac cs 
generated by the procedure used to test the 
hypotheses. For example, it is possible tha 
by asking subjects to distribute rewards in 
stead of doing so himself, the experimenter 
“communicated” that he wished them to di- 
vide the money unequally. On the other hand, 
since it must have been obvious to the sub- 
jects that the experimenter was interested in 
how they distributed rewards, they might have 
felt constrained’ to do the “right thing” and 
conform to the norm of equity by dividing 
the money equally. = 

However, there are a number of considera- 
tions that seriously call into question any 
explanation of the results based upon demand 
characteristics. First, all subjects were in- 
duced to participate in the study by the offer 
of money. It seems likely that persons who 
were interested in making money would be 
little influenced by what they thought the 
experimenter wanted them to do. Second, the 
procedure used was specifically designed to 


instill in the subjects a feeling that their ac- ot 


tual distribution behavior was truly private. 
Thus, it seems unlikely that the subjects’ per- 
ceptions, if any, of the experimenter’s 
tations would have affected their actual dis 
tribution of rewards, since they were led to 
believe that the experimenter would see o nly 
the self-reports of their behavior, 

Finally, and perhaps of greatest importauce, 
it is difficult to conceive of an interpretxticn 
based upon demand characteristics that would 
adequately explain all of the results, Wi: 
each input condition, the only variati 
procedure was the amount of money tha 
subjects were given to divide. Given t's. it 
is difficult to see how there could have jen 
demand characteristics present that would 
have produced unequal distributions of in- 
sufficient and oversufficient rewards and equal 
distributions of sufficient rewards. On the 
other hand, an explanation based upon own 


i 
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and other equity can readily account for these 
results. 

Ta summary, it is evident that no one con- 
xt of equity can explain completely the be- 
or of persons in this and past research. 
‘There is a great deal of empirical support 
for the proposition that persons, to some ex- 
tent, evaluate the degree to which they are 
satisfied by comparing their own input/out- 
come ratios with those of relevant others. 
However, there also is evidence, especially the 
results of the present research, which indi- 
cates that (a) persons evaluate both their own 
and the other’s ratios in terms of some in- 
ternal standard that varies as a function of 
inputs, and (5) concerns with own equity are 
stronger than concerns with other equity. 
Thus, it appears that the determinants of sat- 
isfaction with the distribution of rewards in a 
situation are more complex and varied than 
at first thought. ? 
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HETEROGENEITY-HOMOGENEITY OF EXPOSURE SEQUENCi 
AND THE ATTITUDINAL EFFECTS OF EXPOSURE’ 


ALBERT A. HARRISON? ann RICK CRANDALL ® 


University of California, Davis 


Two experiments tested Berlyne’s two-factor (positive habituation and tedium) 
theory of the effects of repeated stimulus exposures. In Experiment I, stimuli 
were shown in varying frequencies under conditions of maximally heterogene- 
ous exposure sequences, in which no stimulus followed itself, and maximally 
homogeneous sequences, in which each stimulus followed itself until all expo- 
sures were completed. Results showed an “exposure effect” (001) and an 
attenuation of this effect under conditions of homogeneous presentation 
sequences (.001). In Experiment II, under conditions where maximum homo- 
geneity of exposure sequence was combined with ratings prior to the inter- 
positioning of other stimuli, increased exposure led first to increased then 


decreased favorability (,025). Implications for exposure, two-factor, 
response-competition hypotheses are discussed. 


Location of a stimulus on the novelty- 
familiarity dimension has long been consid- 
ered an important stimulus property, but the 
present pattern of theoretical expectations 
and empirical findings is somewhat confusing. 
On the one hand, the “novelty group” (e.g., 
Berlyne, 1960, 1966, 1970; Cantor, 1968) 
has stressed the high “hedonic value” or 
favorability of unusual, novel, low-frequency 
stimuli, while the “frequency group” (e.g., 
Harrison, 1969; Matlin, 1970; Zajonc, 1968) 
has stressed the high “affective ratings” 
or favorability of common, familiar, high- 
frequency stimuli. 

A partial resolution of some apparently dis- 
crepant findings is to be found in the methods 
used for assessing stimulus favorability. For 
example, increased self-exposure to a stimulus 
cannot be taken to imply a more favorable 
attitude toward it (Harrison, 1968; Harrison 
& Hines, 1970; Saegert & Jellison, 1970). 
But there remain incompatibilities among dif- 
ferent studies using similar measures of ver- 
bally expressed preference. While some recent 
studies have obtained a positive exposure- 
favorability relationship (Harrison, Tutone, & 
McFadgen, 1971; Zajonc, Swap, Harrison, & 


1 This research was supported by National Science 
Foundation Grant GS-2791 to the senior author. 

2 Requests for reprints should be sent to Albert A. 
Harrison, Department of Psychology, University of 
California, Davis, California 95616, 

3 Now at the University of Michigan. 
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preferences for low-frequency Roe or 
curvilinear inverted-U relationship (Berlyne, 
1970; Cantor, 1968; Saegert & Jellison, 
1970). — 

In an attempt to reconcile such inconsisi sz- 
cies, Berlyne (1970) has examined differe 
in stimulus properties (e.g., simple-comp's) 
and experimental procedures (e.g., order of 
presentation). Typically, researchers obtain 
ing the positive monotone function presented 
stimuli in heterogeneous sequences in which 
several stimuli were interspersed, while re- 
searchers obtaining an eventual decrease in “ 
favorability over exposures presented stimi 
in homogeneous sequences in which a g 
stimulus repetitiously followed itself. I: 
test of this hypothesis, Berlyne found high 
frequency stimuli presented in homogeneous 
sequences declined in judged pleasantness 
more than high-frequency stimuli presented in 
heterogeneous sequences. 

Exposure sequence effects were interpreted 
by Berlyne in terms of a “two-factor” theory, F 
which posits two effects of stimulus repetition. 
First, there is positive habituation, which 
increases as the organism becomes orie 
to the stimulus. Positive habituation effects 
are similar to “reduction of response competi- 
tion” effects (e.g, Harrison et al., 1971: 
Matlin, 1970) in that exposure reduces un- 
certainty and conflict, thereby decreasing 
displeasure. The second factor, tedium, also 


increases over exposures and results in an 
overexposed stimulus losing pleasantness. Sev- 
eral variables affect the strength of these two 
factors, including the sequence of stimulus 
presentations. Berlyne suggested 


one would expect monotonous repetition to 
strengthen the tedium factor, while the appearance 
of several stimuli in the sequence might well miti- 
gate this factor and allow the positive habituation 
factor to gain the upper hand [Berlyne, 1970, p. 
281]. 


The purpose of the present research was 
both to test further Berlyne’s two-factor 
_ theory and to bridge the methodological gap 
| between “exposure” and “novelty” studies by 
using procedures and assessment techniques 
similar to those used in earlier exposure re- 
search (e.g., Zajonc, 1968). Specifically, an 
attempt was made to influence exposure- 
favorability functions by varying the hetero- 
geneity-homogeneity of the exposure se- 
quences and by varying the point in the 
experiment that the fayorability ratings were 
made. 

In the usual exposure studies, each stimulus 
presentation lasts about 2-4 seconds. In an 
experiment in this series that is not reported 
in detail here, we assigned 40 subjects to each 
of four conditions. Subjects in each condition 
saw two stimuli each in the frequencies of 0, 
1, 3, 9, and 27 for 4 seconds per exposure. 
The conditions represented four degrees of 
stimulus sequence homogeneity ranging from 
a condition in which no stimulus followed 
itself to a condition in which each stimulus 
followed itself for the required number of 
presentations. Ratings obtained afterwards 
showed the usual exposure-favorability mono- 
_tone function with the overall means for the 
Zot, 3, 9, and 27 stimuli 2.61, 2.81, 3.08, 
3.13, and 3.54, respectively (F = 39.49, 
df = 4/576, p< .001). Exposure sequence, 
singly or in combination with other variables, 
had no effect. However, it follows from the 
two-factor theory that even 27 successive 
brief exposures of a complex stimulus such 
as the ideographs might not give “tedium” a 
chance to appear. In all the current research, 
15-second exposure times were used to give 
both factors a chance to appear. 
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EXPERIMENT I 
Method 


Subjects. Eighty volunteer subjects from intro- 
ductory psychology classes were used. They received 
class credit for their participation. 

Stimuli. Stimuli were 10 of the Chinese ideographs 
used in earlier exposure research, Prepared as 35- 
millimeter slides, they were projected automatically 
onto a screen. 

Procedure. Subjects were assigned, 40 each, to the 
two experimental conditions. After being instructed 
to simply observe the ideographs, stimuli were pre- 
sented two each in the frequencies of 0, 1, 3, 9, and 
27, for 15 seconds per exposure. In the high- 
heterogeneous condition, no stimulus followed itself. 
In the high-homogeneous condition, each stimulus 
followed itself until the required number were 
presented. For example, in the high-homogeneous 
condition, the subject saw each “27” stimulus 27 
times in a row. 

The design was counterbalanced to insure that 
each stimulus appeared in each frequency an equal 
number of times. In the high-homogeneous condition, 
there was also a counterbalancing of the order in 
which blocks of slides were shown. For example, 
an equal number of subjects saw the “27” stimulus 
in every position of a random ordering of all fre- 
quency blocks. After the exposure phase, subjects 
were asked to rate the ideographs on 7-point good- 
bad or bad-good scales, half of the subjects in each 
condition receiving each form. 


Results and Discussion 


Ratings were scored from 0 to 6, with 
higher scores indicating greater goodness of 
meaning (Figure 1) and an analysis of vari- 
ance performed. While there were no overall 
differences in ratings of the high-heterogene- 
ous and high-homogeneous stimuli (F < 1, 
df = 1/72), there was a main effect for fre- 
quency (F=9.21, df=4/288, p< 001, 
tested against Subjects X Frequencies) and 
a Frequency X Conditions interaction effect 
(F= 7.09, dj = 4/288, p< .001) showing 
that in the high-homogeneous condition, the 
attitude-enhancing effects of exposure were 
severely attenuated. 

The main effect supports the mere exposure 
hypothesis, and the interaction effect supports 
Berlyne’s two-factor theory. In the high- 
heterogeneous condition, the interspersal of 
stimuli presumably prevented tedium from 
overcoming the effects of positive habituation, 
while in the high-homogeneous condition, the 
absence of interspersed stimuli presumably 
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Fic. 1. Mean affective ratings of Chinese ideo- 
graphs as a function of exposure frequency and 
heterogeneity-homogeneity of exposure sequences. 


allowed tedium to visibly impair the effects 
of positive habituation. 

Clearly, the high-homogeneous sequence 
impaired the favorability-enhancing effects of 
exposure, exactly as the two-factor theory 
suggests. But in the high-homogeneous condi- 
tion, there was no decrease in favorability as 
a function of repeated exposure, only an 
attenuation of the exposure effect. Berlyne 
(1970), on the other hand, reported that 
generally there was a decline in favorability 
over exposures and that this decline was 
faster under homogeneous as opposed to 
heterogeneous presentation sequences. In Ex- 
periment T, we did not find the decline in the 
favorability ratings of high-frequency stimuli 
that should result from a potent tedium factor. 


* EXPERIMENT IT 


When Berlyne (1970) compared hetero- 
geneity and homogeneity of presentation se- 
quence, subjects rated after each presentation. 
In Experiment I, all stimuli were presented, 
then ratings were obtained, It is quite possible 
that the separation of rating and exposure 
phases has important effects on ratings of 
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ous sequences. 
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ioned stimuli 
while not ap- 
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interspersed. ii 


stimuli presented in homoge? 
Our procedures of taking 
have resulted in inter 
“breaking up” tedium efi 
preciably impairing positiv: 
fects. In Experiment II, : 

before additional stimuli were 


Method 
Using 40 subjects, Experiment I! duplicated the! 
high-homogeneous condition of nt I. Identi-7 


pt that 
tation of 
or twenty- 
exposure phase” 


cal stimuli and procedures w 
ratings were obtained on t 
each stimulus (eg., first, third, r 
seventh) during what would be the 
in Experiment I. 


Results and Discussion 


Figure 2 presents an exposure-favorability 
relationship rather: strikingly different from 
those usually obtdined with these stimuli 
(F=3.34, df= 3/108, p <.025, tested 
against Subjects X Frequency). Results are 
quite in accordance with Berlyne’s two-factor. 
theory. Under conditions of homogeneous | 
presentation sequences followed by immediate 
ratings, a few exposures led to enhanced 
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ratings (presumably because of the ascend- 
ance of the positive-habituation factor), 
while further exposures led to lower ratings 
(presumably because of the ascendance of 
the tedium factor), Thus, under conditions 
of long-exposure duration, complete homo- 
geneity of stimulus sequence, and immediate 
fayorability assessment, ratings accorded the 
ideographs initially increased then decreased 
as a function of repeated stimulus exposure. 


OVERVIEW AND GENERAL DISCUSSION 


In Experiment I, the favorability-enhancing 
effects of mere exposure were reduced when 
stimuli were presented in homogeneous se- 
quences, In Experiment II, when ratings of 
stimuli presented in homogeneous sequences 
were obtained prior to the interpositioning of 
other stimuli, the effects of tedium appeared 
to predominate as stimuli were increasingly 
exposed, 

The present results have implications for 
the “mere exposure” hypothesis and for the 
two-factor and response-competition interpre- 
tations of the effects of stimulus repetition on 
favorability ratings. 

First, the present results suggest a certain 
limitation of the “mere exposure” hypothesis. 
Like Berlyne (1970) and Saegert and Jellison 
(1970), we have found an inverted-U rela- 
tionship between exposure and favorability 
ratings. However, instead of introducing the 
simplicity-complexity manipulation, we fol- 
lowed other procedures for making certain 
stimuli more tedious or boring than others. 
Apparently, mere exposure under repetitious 
conditions that maximize tedium and offer no 
alleviation by the interpositioning of addi- 
tional stimuli produces something other 
than the usual monotone exposure-favorability 
relationship. 

Second, our results are generally consistent 
with the Berlyne two-factor theory. How- 
ever, the present results also suggest certain 
limitations on the effects of tedium. In the 
early experiment briefly reported in the intro- 
duction, tedium did not appear, perhaps be- 
cause the exposure times were too brief. In 
Experiment I, in which stimuli were presented 
under conditions of complete heterogeneity or 
homogeneity for relatively lengthy exposure 
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times, tedium appeared to impair, but not 
reverse, the attitude-enhancing effects of ex- 
posure. Only in the last experiment, where 
lengthy exposure times in uninterrupted se- 
quences were coupled with ratings prior to 
the interpositioning of other stimuli, did in- 
creased exposure of the Chinese ideographs 
lead to first increased then decreased favor- 
ability ratings. 

The present results also have implications 
for the response-competition interpretation of 
the exposure effect. This interpretation sug- 
gests that repeated exposure to a stimulus 
reduces the negative affect elicited by the pres- 
ence of competing responses initially attached 
to that stimulus, through a process whereby 
one response becomes dominant. Not only are 
the interaction effect in Experiment I and the 
results of Experiment II not easily handled 
by this hypothesis, they seem to run counter 
to it. Reduction of response competition (and 
subsequent enhancement of affect) should be 
facilitated by homogeneous stimulus presenta- 
tion sequences in which intraserial competi- 
tion is minimized. However, there is now evi- 
dence that response competition and the 
concomitant negative affect vary as a function 
of a number of variables. The complexities 
of their interrelationships are not yet fully 
understood. Hence, it is presently difficult 
to fully evaluate the implications of the 
present findings for the response-competition 
hypothesis. 

In conclusion, we suggest that the present 
experiments provide further reconciliation be- 
tween the apparently discrepant findings of 
exposure and novelty studies. The overall pat- 
tern of results in Experiments I and II offer 
some support for the Berlyne two-factor 
theory, demonstrating some potential effects 
of both positive habituation and tedium. They 
also raise new challenges for the mere exposure 


and response-competition hypotheses. Hope- 


fully, a parsimonious theoretical interpreta- 
tion will soon be developed that accounts not 
only for frequency, novelty, and response- 
competition findings but also for connotative 
satiation findings, which suggest that very 
fast stimulus repetition in homogeneous 
sequences leads to decreased polarity on 
evaluative scales (Amster, 1964). 
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A cognitive model of motivation is 


presented in which ascriptions concerning 


the causes of success and failure mediate between achievement outcomes and 
subsequent achievement-related behavior. It is demonstrated that attributions 


to effort, an internal or personal causal 


factor, magnify yelative reward for 


success and punishment for failure at achievement activities. Further, attribu- 


tions to lack of effort or bad luck, causal 
moment to moment, produce lesser decrements in 
than attributions to 


following failure experiences 


difficulty. Ability and task difficulty, 


person and task characteristics. The locus of control 
variable) dimensions of causality are confounded in 


and stability (fixed versus 


factors which may vary from 
the expectancy of success 
low ability or high task 
are perceived as fixed 
(internal versus external) 


respectively, 


the locus of control literature. It is also established that task difficulty is a cue 


providing information concerning the 


efficacy of effort expenditure. Outcomes 


SRAN at tasks of intermediate difficulty are believed to be most influenced by effort. 


It is contended that attributions 
the direction, magnitude, and 


The basic paradigm of a cognitive theory 
of motivation (see Baldwin, 1969) is that 
mental events following the perception of a 
stimulus determine the behavioral response 
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stimulus ——— cognitive activity 


to effort play an important role in determining 
persistence of achievement-oriented activity. 


to that stimulus: 


stimulus —-~-scognitive activity ---> response 


At least two distinct programs of research 
are subsumed within this paradigm. One ap- 
proach has attempted to specify the ante- 
cedents of an “expectancy,” and includes 
cognitive goal anticipations among the im- 
mediate determinants of behavior (Atkinson, 
1964; Rotter, 1954; Tolman, 1932). A second 
cognitive approach to motivation has focused 
upon emotional reactions rather than instru- 
mental acts, and has examined the affective 
consequences of either cognitive “reappraisal ga 
(Lazarus, 1968) or the labeling of internal 
arousal states (Nisbett & Schachter, 1966; 
Schachter & Singer, 1962). 

The data generated by these research pro- 
grams provide evidence that cognitive proc- 
esses influence both the expectancy of goal 
attainment and the affective response to a 
stimulus. It is therefore proposed that a cogni- 
tive theory of motivation should assume the 
following, more specific, form: 


ieee EXPE CIANC yay 
behavior 


—+affect eri 
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An ATTRIBUTIONAL MODEL OF 
ACHIEVEMENT MOTIVATION 


A model of achievement-related behavior, 
in which causal ascriptions mediate between 
antecedent stimulus conditions and achieve- 
ment-oriented responses, further elaborates 
the above cognitive paradigm (see Weiner, 
Frieze, Kukla, Reed, Rest, & Rosenbaum, 
1971). It is assumed by Weiner et al. that 
individuals allocate the causes of success and 
failure to four elements: ability, effort, task 
difficulty, and luck (see Heider, 1958). These 
four elements are comprised within two 
causal dimensions: locus of control (internal 
versus external) and stability (fixed versus 
variable; see Table 1). Locus of control refers 
to the self versus environmental responsibility 
for an outcome (from Rotter, 1966), while 
the stability of the attribution connotes its 
perceived fluctuation over time. Table 1 indi- 
cates that ability and effort are perceived as 
internal (personal) causes of success and fail- 
ure, while task difficulty and luck are external 
(environmental) determinants of outcome. In 
addition, perceived ability and task difficulty 
remain relatively constant over time (fixed 
elements), while effort and luck may shift 
from moment to moment (variable elements). 

It is contended that locus of control influ- 
ences the affective reactions to an event, with 
internal ascriptions magnifying emotional re- 
sponses. For example, the veridical or non- 
veridical belief that arousal symptoms such 
as rapid heart rate are due to oneself, rather 
than to an external agent such as a drug, 
produces greater reactivity to shock (Nisbett 
& Schachter, 1966), increases the liking of 
nude pictures (Valins, 1966), and enhances 
euphoria or anger in appropriate social situa- 
tions (Schachter & Singer, 1962). We think 


TABLE 1 
PERCEIVED DETERMINANTS OF SUCCESS AND FAILURE 


Locus of control 


Stability. 

Internal External 
Fixed Ability Task difficulty 
Variable Effort Luck 


that attributions of achievement outcomes 
internal causes also result in greater affect! 
(pride and shame) than attributions to en- 
vironmental factors. For example, success” 
ascribed to high ability should produce more | 
pride in accomplishment than success per- 
ceived as due to an easy task. These two 
factors differ in locus of control, while both 
are perceived as relatively stable. There is 
experimental evidence in the achievement 
literature that ascriptions of success and fail- 
ure to effort, which is perceived as distinctly, 
subject to volitional control (see Leventhal 
& Michaels, 1971), maximizes both persona 
pride and shame and rewards and punish- 
ments allocated by others (Lanzetta & 
Hannah, 1969; Leventhal & Michaels, 1971; 
Weiner & Kukla, 1970). Indi 
premises and data, Wi 
achievement-related contexts 
mined primarily by the control ‘rather than 7 
the stability, dimension of causality. i 

We also believe that attributions to the 
stability of the causal factors primarily influ- $ 
ence the changes in expectancy of success 
following success or failure. For example, 
failure ascribed to low ability or a hard task 
(stable factors) should produce greater decre- 
ments in the subsequent expectancy of suc- — 
cess than attributions of failure to a lack of 
effort or bad luck (variable factors). This is 
intelligible because effort subsequently can be 
increased, just as luck may improve. Thus, 
ascriptions to unstable elements signify that 
future expectations may differ from the prior 
outcome, On the other hand, attributions 
of failure to low ability or a hard task 
(factors differing in locus of control but 
identical in stability) are likely to produce 
similar decrements in the expectancy of goal 
attainment, for both ability and task difficulty 
are perceived as remaining relatively constant 
over time. In light of the above examples, it < 
appears to these authors that expectancy ` 
changes are determined primarily by the 
stability, rather than control, dimension of ~ 
causality. 

The cognitive model previously outlined 4 
therefore may be further elaborated, and ` 
within achievement-related contexts is de- 
picted as: 
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achievement outcome —> causal ascriptions ae 7- 
locus of control —affect ee 


The diagram specifies a temporal order of 
events; thoughts (attributions of causality) 
precede expectancy change and affect and 
determine subsequent behavior. 

The research reported in this paper exam- 
ines some of the suppositions outlined above. 
The investigations pertain to the relationships 
between causal ascriptions and (a) affective 
reactions (Experiment 1), and (b) expectancy 
changes and performance intensity (Experi- 
ment IL), In addition, the proposed linkage 
between antecedent stimuli and mediating 
cognitions is examined by relating the per- 
ceived difficulty of a task to inferences about 
the causal significance of effort (Experiment 
Til). 


EXPERIMENT I: EFFORT ASCRIPTIONS AND 
AFFECTIVE REACTIONS 


Three recent investigations (Lanzetta & 
Hannah, 1969; Leventhal & Michaels, 1971; 
Weiner & Kukla, 1970) demonstrated that 
when an individual attributes success to 
effort, rewards for achievement success are 
augmented. In a similar manner, punishments 
for failure are maximal when attributions for 
that failure are due to a lack of effort. 

In the experiment by Lanzetta and Hannah, 
subjects acted as “trainers” with the power 
to reward and punish a “trainee” for good or 
poor performance at a discrimination task. 
The trainer was given false information con- 
cerning the ability of his trainee and the 
difficulty of the task. The data reveal that 
trainees perceived as high in ability failing 
a supposedly easy task receive the greatest 
amount of punishment. Lanzetta and Hannah 
(1969) concluded that in this condition fail- 
ure is ascribed to a lack of effort. This causal 
attribution is “reacted to more negatively 
_.. [and] motivate(s) behavior that might 
effect a change in the responsible factor 
[p 2514.7 

In a related experiment by Weiner and 
Kukla (1970), subjects were given informa- 
tion about hypothetical students’ ability, ef- 
fort expenditure, and performance on an 


4Dotted arrows indicate weaker relationships. 
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exam. The subjects were to act as “teachers” 
and reward and punish the pupils for their 
performances. The data clearly demonstrate 
that given high or low ability, attribution of 
success to effort heightens reward, and at- 
tribution of failure to lack of effort increases 
punishment. Greatest reward for success is 
given to students perceived as high in effort 
and low in ability, while most punishment 
for failure is dispensed to pupils perceived as 
high in ability but low in effort. Similar 
findings were reported by Leventhal and 
Michaels (1971). In addition, the same gen- 
eral pattern of results is displayed when 
subjects are asked to introspect about their 
personal feelings of pride and shame (Weiner 
& Kukla, 1970). 

In the Lanzetta and Hannah (1969), Leven- 
thal and Michaels (1971), and Weiner and 
Kukla (1970) experiments, effort presumably 
was perceived as an unstable factor which 
may be altered from moment to moment. Yet, 
effort also may have the conceptual status of 
a stable personality disposition. Individuals 

“may describe themselves, and others, as lazy, 
industrious, etc. The present investigation 
extends the prior studies, demonstrating an 
effort ascription-affect contingency by exam- 
ining the relationship between perceived moti- 
vational dispositions regarding effort expendi- 
ture and the tendency to reward oneself for 
success and punish oneself for failure. 


Method 
Procedure 


Subjects were 63 ‘American male children in the 
fifth and sixth grades. They first were administered 
the Intellectual Achievement Responsibility (IAR) 
Scale (Crandall, Katkovsky, & Crandall, 1965). This 
self-report inventory consists of 34 items, each item 
depicting a successful or an unsuccessful achievement, 
outcome. The responses pair an internal against an 
external attribution. Further, the internal responses 
can be partitioned into ability and effort ascriptions. 
Thus, an individual’s total score on the IAR scale 
may be partitioned into four subscales representing 
the tendency to ascribe success to effort, failure to 
a lack of effort, success to ability, and failure to 
a lack of ability. The following are two typical items 
illustrating failure and success outcomes and ability 
and effort internal stems: 


When you can’t work a puzzle, would it probably 
happen 
a. because you are not especially good at work- 
ing puzzles, or 
b. because the instructions didn’t make sense? 


When you learn something quickly in school, is it 
usually 

a. because you paid close attention, or 

b. because the teacher explained it clearly? 


Individual experimentation followed the group IAR 
test administration. Subjects were given a set of 
achievement-related puzzles to solve, the task being 
to trace over all the lines on a puzzle without lifting 
the pencil from the paper or retracing a line (see 
Feather, 1961). Solvable and insolvable forms of 
these puzzles can be constructed which are not 
discriminable by the naive performer, Each subject 
received a booklet of 10 solvable and 10 insolvable 
puzzles. The solvable and insolvable puzzles were 
equated for apparent level of difficulty by matching 
the number of lines on the puzzles. The task was 
“timed,” and failure was signaled by interruption 
of the experimenter following approximately a 30- 
second time interval. More time was permitted when 
needed to complete a solvable puzzle. 

On the subject’s desk was a bowl of poker chips, 
along with two panels of seven buttons, The panels 
were labeled “win-take” and “lose-give back”; the 
buttons on each panel were numbered from 1 to 7. 
Following each successful task completion, the 
subjects were told to press the win-take button 
corresponding to the “number of chips you feel you 
deserve,” and to take that amount from the bowl. 
In a similar manner, following each failure, they 
were to press the lose-give-back button correspond- 
ing to the number of chips “you think you should 
return,” and to replace these chips in the bowl. The 
experimenter first demonstrated the button push and 
“give and take” behavior, as well as familiarizing 
the subjects with the puzzle task. The experimenter 
then removed herself from the visual field of the 
subject and recorded the self-reinforcement activity 
from behind a screen by noting a light activated 
by the button press, (See Kanfer & Marston, 


TABLE 2 


RESULTANT SELF-REINFORCEMENT (SELF-REWARD FOR 
Success MINUS SELF-PUNISHMENT FOR FAILURE) 
As A FUNCTION OF CAUSAL ASCRIPTION OF 
Success To Errort (M+) Minus 
ASCRIPTION OF FAILURE TO A LACK 
or Errort (M—) 


(M+) — (M-) 
Resultant reinforcement 
High Medium Low 
Above median 16 11 6 
Below median 8 10 12 
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1963, and Katz, 1967, for related self-reinforcement 
methodologies.) 

Ten chips were provided for each subject at the 
initiation of the experiment. In addition, the initial 
two tasks were solvable, and there were never more 
than two consecutive insolvable puzzles. These pre- 
cautions were taken to decrease the possiblity that 
a subject would have less than seven chips when 
attempting an impossible puzzle. 


Results 


Scores on the TAR scale representing the 
disposition to ascribe success to effort or 
motivation (M+) were related to self-reward 
for success, while scores representing the dis- 
position to attribute failure to lack of effort 
(M—) were related to self-punishment for 
failure. Only weak relationships in 
expected direction were found. 


tracted from his M+ score | 
resultant self-reinforcement i 


A high score on the (M+) — (M—) index 
indicates the disposition relatively to ascribe 
success, rather than failure, to effort. Twenty- 
one subjects (33%) received (M+) — (M—) 
scores of +1 and comprised an intermediate 
group. Thus, the independent variable had to 
be subdivided into thirds. } 

Table 2 shows that there is a monotonic 
relationship between resultant effort ascrip- 
tions and resultant self-reinforcement. A 
Kruskal-Wallace analysis of variance indi- 
cates a significant effect (x? = 8.02, df = 2, 
p < .02).° Thus, the greater the tendency to 
attribute success, rather than failure, to effort, 
the greater the self-reward for success relative 
to self-punishment for failure. Further non- 
parametric analyses with the Mann-Whitney 
U test yield significant differences between 
the high and low and middle and low groups 
(z = 2.88, p< .005; 2=2.00, p< .05, re- 
spectively). The ability responses on the TAR 
scale were unrelated to any of the dependent 
variables. 


Discussion 


The data indicate that attributions of suc- 
cess and failure to effort mediate between 


5 Nonparametric analyses were employed because 
of two atypical scores among subjects in the middle 
group. 


: (self-reward 
for success minus self-punishment for failure). 


tai i | 


Each ‘subjects M— score was then sub: - 
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achievement outcomes and relative rewards 
and punishments for achievement behavior. 
In contrast to prior research, in this investi- 
gation, the relationship between effort attribu- 
tions and affect was demonstrated when em- 
ploying an idiographic analysis of individuals’ 
self-attributional dispositions. 

Prior explanations of the influence of effort 
ascriptions on reward and punishment have 
focused on the control aspect of reinforce- 
ment. Inasmuch as effort generally is per- 
ceived as modifiable, effort expenditure theo- 
retically is controllable with appropriate 
reinforcement techniques. High reward for 
exertion and high punishment for lack of ef- 
fort ultimately should increase “trying.” But, 
in the present investigation, the effort-rein- 
forcement linkage was displayed when effort 
or motivation apparently was conceived as a 
stable disposition. Thus, the augmented re- 
ward and punishment may indicate a pre- 
~ vailing cultural ethic that one “ought” to try 
hard, rather than (in addition to) being a 
means of behavioral modification, If this ex- 
-planation has validity, then there is a simi- 
larity in the value systems of achievement and 
morality. There are intent-reinforcement 
bonds in both of these systems, which we sur- 
mise are only partly functional in character. 


EXPERIMENT II: CAUSAL ASCRIPTION, 
_ Expectancy CHANGE, AND 
PERFORMANCE 


The investigations most relevant to the 
supposition that the stability of the causal 
dimension influences expectancy shifts com- 
pare behavior over a series of trials in skill 
(internal control) versus chance (external 
control) situations (Phares, 1957; Rotter, 
Liverant, & Crowne, 1961). In these experi- 
ments the subjects indicated their expectancy 
of success fon each outcome, following success 
or failure at the prior trial. The data reveal 
that increments in probability of success fol- 
lowing a success, and decrements in proba- 
bility of success after failure, are more likely 
and of greater magnitiude given the percep- 
tion of the outcome as ability rather than 
luck determined. 

Rotter (1966) cited these data to support 
his contention that locus of control influences 
expectancy of success. However, investigations 
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comparing expectancy shifts given ability 
(an internal, stable factor) versus luck (an 
external, unstable factor) causal ascriptions 
confound the causal dimensions of locus of 
control and stability. We believe that the 
differential expectancy shifts in the Phares 
(1957) and Rotter et al. (1961) studies are 
mediated by attributions to the stability, 
rather than the control, dimension. In Experi- 
ment II this supposition is tested by relating 
expectancy of success during repeated failure 
experiences to the four perceived causal 
determinants of success and failure. In addi- 
tion, the relation of causal ascriptions to 
performance intensity also is examined. 


Method 
Procedure 


Subjects were 39 males, enrolled in either of two 
Gymnasiums (high schools) in Bochum and Dorsten, 
Germany.’ They were given a digit-symbol substitution 
task to complete, with the digits 1-6 and the highly 
similar symbols of L, 1, T, r, L, and J. After a 
l-minute familiarization period and approximately 
250 practice substitutions, the subjects were presented 
five cards, each containing four rows of 16 randomly 
selected digits from 1 to 6, with underlying space for the 
symbol substitution. The task was to complete all the 
substitutions on a card within the allotted time period, 
which wassaid to be 1 minute. Continual failure was then 
induced by interrupting the subjects after they completed 
a varying amount of more than three rows of the sub- 
stitutions. One of the dependent variables, speed of per- 
formance, was the time required to complete the initial 
three rows (75%) of the task. 

The experimenter also informed the subjects that: 


In this experiment we are not only interested in 
whether you can complete the task in the given 
time, but also your feelings about what caused 
your outcome. In general, the results of this kind 
of task are dependent upon four factors: effort 

. one can get positive or negative results be- 
cause he worked hard or did not put forth sui- 
ficient effort; luck . . . one can do well or poorly 
just because of chance; ability . . . one may or 
may not have the capacity to do the task; and 
task difficulty ... one can succeed because the 
task is easy or fail because it is difficult. 


Following each trial the subjects allocated the 
perceived causes of their failure to these four factors. 


6 There is extensive literature indicating that in 
achievement-related contexts, similar data are gen- 
erated by American and German subjects. In Experi- 
ment I of this paper, the subjects are American 
school children, while in Experiments II and III the 
subjects are German high school students and 
German teachers, respectively. 
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The attributions were in percentage figures (e.g., the 
outcome was % dependent upon ability) and 
were required to total 100%. Following the causal 
judgments, the probability of success for the next 
trial was estimated. The probabilities were rated on 
a scale ranging from 0% to 100%, ascending in 
intervals of 10%. The scale was anchored at the 
extremes with the statements: “Certainly will not 
complete the task-Certainly will complete the task.” 

In sum, attributions to the four causal elements 
(independent variables), and speed of performance 
and subsequent probability of success (dependent 
variables) were obtained during five repeated failure 
experiences. The causal attributions were constrained 
to total 100%, speed of performance was the time 
required to complete 75% of a difficult digit-symbol 
substitution task, and subjective probability was 
ascertained with a simple rating scale. 
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Results 


The relationships between the retrospective 
causal ascriptions and the prospective expec- 
tancy judgments are shown in Figure 1. 
Figure 1 reveals that expectancy of success 
following failure is greater given high than 
low ascription to effort and luck. This means 
that individuals who perceive their failures 
as due to lack of effort or bad luck do not 
decrease their expectation of success as mark- 
edly as those who relatively do not believe 
that not trying or hard luck were responsible 
for their failures. Ascriptions to ability and 
task difficulty reverse this relationship: low 
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Fic. 1, Expectancy of success as a function of above versus below median 
ascription to the four causal elements, (High ascription indicates that a Jack 
of ability, a difficult task, lack of effort, and bad luck are the perceived causes 


of failure.) 
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rather than high attributions to these stable 
elements are associated with greater goal ex- 
pectancies. This means that persons who per- 
ceive that their failures are due to low ability 
or a hard task decrease their probability of 
future success more than those who relatively 
do not believe that low ability or a hard 
task caused their failure. Analyses of variance 
yield the following four main effects: ability, 
F <1; effort, F = 4.43, df = 1/37, p < .05; 
task difficulty, F = 9.59, df =1/37, p < .005; 
luck, F= 2.55, df = 1/37, p< .25. (There 
are only three degrees of freedom among 
these four comparisons.) None of the inter- 
action terms reached significance. 

To test the hypothesis that the stability of 
the causal judgments determines probability 
shifts, attributions to the stable elements 
were combined, and the expectancies of the 
subjects scoring above the median on the 
ability plus task difficulty ascriptions were 
compared to the expectancies of the subjects 
scoring below the median on ascriptions to 
these two factors. (Because of the enforced 
dependency among the causal judgments, this 
comparison is identical to the respective below 
versus above median attribution to the com- 
bined unstable elements of luck plus effort.) 
Figure 2 shows that there is a significant main 
effect due to the causal ascription (F = 9.26, 
df = 1/37, p < .005) and a significant Trials 
x Attribution interaction (F = 9.77, df = 
1/37, p< .005). High attribution to the 
stable factors (or low ascription to the un- 
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Fic. 2. Expectancy of success as a function of 
above versus below median ascription to the fixed 
attributional elements. (Corresponds, respectively, to 
below versus above median ascription to effort plus 
luck.) 
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SPEED OF PERFORMANCE 
NM U KR OA DN 


TRIAL 1 MINUS TRIAL 2 
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Fic. 3. Increments in speed of digit-symbol substi- 
tution, in seconds (Trial 1 minus Trial 2), as a 
function of above versus below median ascription 
following Trial 1 to the four causal elements, and 
to a combination of the stable factors. 


stable factors) produces greater decrements 
in the probability of future success following 
failure than does low ascription to the stable 
factors (high attribution to the unstable fac- 
tors). Further, the probability disparities 
between the groups differing in attributional 
judgments increase as repeated failures are 
experienced. 

Inspection of Figure 1 indicates that com- 
paring the expectancy shifts of subjects high 
versus low in attributions to ability plus ef- 
fort (internal factors), or task difficulty plus 
luck (external factors), would virtually elimi- 
nate between-group probability differences. 
That is, combining elements over the locus 
of control dimension masks the linkage be- 
tween causal inferences and expectancy of 
success. 

Figure 3 shows the increments in speed of 
performance from Trial 1 to Trial 2 as a 
function of the causal attribution for failure 
at Trial 1. The figure indicates that the time 
taken to complete three rows of digits on 
Trial 1 exceeds the time required on Trial 2. 
Performance among all attribution groups 
increases as a function of practice at this 
substitution task, However, high ascrip- 
tion of failure to low ability or to a difficult 
task results in relatively slower performance, 
and thus in lesser increments in time to com- 
plete three rows of digit substitutions, than 
low ascription to these fixed factors. Con- 
versely, high ascription of failure to bad luck 
or lack of effort results in greater increments 
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in speed of performance than low attributions 
to these factors. The correlations between the 
causal ascriptions following Trial 1 and the 
performance increases between Trials 1 and 2 
are as follows: ability, r= —.05, ns; task 
difficulty, r = —.38, p < .01; effort, r= .25, 
p < .10; and luck, r = .24, p < .10. 

The correlational figures indicate that high 
ascriptions to task diffculty and ability 
(stable factors) have opposite effects on speed 
of performance than high ascriptions to effort 
and luck (unstable factors). Figure 4 therefore 
also includes the performance of individuals 
high versus low on attributions to the com- 
bined ability and task difficulty factors 
(which again corresponds to the comparison 
between subjects respectively low and high 
on the combined luck and effort factors). The 
correlation between the score on this com- 
bined index and performance increments be- 
tween Trial 1 and Trial 2 is —.43 (p < .005). 
That is, if one tends to ascribe failure to lack 
of ability or a hard task, rather than to bad 
luck or lack of effort, then performance speed 
is relatively retarded, Following Trial 2 there 
are no systematic changes in speed of per- 
formance as a function of attributions for 
failure. 


Discussion 


Considering the probability shift data, the 
similarity of the consequences of luck and 
effort ascriptions, as differentiated from the 
behavioral effects of ascriptions to ability and 
task difficulty, indicates that ascription of 
failure to variable versus stable factors pro- 
duces disparate probability shifts, If one be- 
lieves that failure was caused by a lack of 
exertion or bad luck, then he does not expect 
subsequent failure to as great an extent as 
when the outcome is ascribed to a lack of 
ability or a hard task, We presume that simi- 
lar relationships will be observed given success 
rather than failure outcomes. 

The consistency and strength of the em- 
pirical data reported in this experiment lead 
us to believe that the locus of control and 
stability dimensions of causality have been 
confounded in the pertinent locus of control 
literature. In the Phares (1957) and Rotter 
et al. (1961) investigations, differences in 
expectancy shifts given ability versus chance 
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instructions are displayed because these i 
structions activate differential attributions t 
the stability, rather than to the locus of 
control, causal dimension, 

The observed relationships between causal 
ascriptions and speed of performance reveal 
that causal attributions also influence inten- 
sity of motivation, The clearest conclusion 
from the data is that when individuals believe 
a task is overly difficult, they discontinue 
trying. However, it is also evident that at- 
tribution of failure to unstable factors rela- 
tively facilitates performance, The Luck x 
Performance linkage, which approximates the 
Effort X Performance association, was un- 
expected, and is not readily interpretable, 
Perhaps any causal scription for failure 
which maintains “hope” will havea fac 
effect, or at least not r 
decrements. But at this time w 
explanation of why subjects wh 
luck was an important determi 
work harder than subjects not citing luck 
inferences, Further discussion of this rela- 
tionship is neglected in this paper, pending 
additional empirical findings, 


EXPERIMENT IIT; THE Cur FUNCTION 
or Task DIFFICULTY 


Empirical investigations (e.g, Atkinson, 
1958) have demonstrated that motivation is 
most aroused at tasks of intermediate dif- 
ficulty, where the probability of success (P,) 
approximates .50. It also appears that indi- 
vidual differences in achievement concerns 


interact with task difficulty in determining 
performance; persons high in achievement | 


motivation especially are motivated when 
P, = .50 (see Heckhausen, 1967). 

An attributional interpretation of the rela- 
tionship between task difficulty and perform- 
ance intensity suggests that task difficulty is 
a cue concerning the importance of effort as 
a determinant of outcome, At a very easy 


task, one may feel that success does not _ 


necessitate great effort. On the other hand, 
one may think that trying hard at a very 
difficult task is a “waste” of energy. Thus, 
tasks of intermediate difficulty may be most 
likely to elicit the prospective inference that 
effort is the salient determinant of task out- 
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come, Individuals will then expend most effort 
when P, = .50. Í 


Method 
Procedure 


Results 


The results of this experiment are shown 
in Figure 4, The relationships between task 
difficulty and both effort attributions and ef- 
fort strategy are clearly curvilinear, The 
greatest elicitation of beliefs in effort as an 
outcome determinant, and the place where 
subjects state they should try hardest to 
maximize gain, is at tasks of intermediate dif- 
ficulty, Friedman analysis of variance of ranks 
yields highly significant probability of success 
effects for both the outcome determinant and 
strategy questions (x° = 27.1, df=4, p< 
O01; x = 35.6, dj=4, p<.001, respec- 
tively), 

GENERAL Discussion 

It has been argued that causal attributions 
mediate between antecedent stimuli and 
achievement-related behavior. Because effort 
is an internal causal ascription with both 


variable and fixed properties, it is an espe- 
attribution in achievement- 


ties, 
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Fio, 4, Judged importance of effort as a causal 
factor, and strategy of effort expenditure to maximize 
gain, as a function of perceived task difficulty (per- 
centage of successful others), (Low rank number 
indicates that effort is relatively unimportant.) 


at tasks of intermediate difficulty (see Weiner, 
1970), The data reported in this paper, con- 
sidered in conjunction with prior research (see 


Weiner et al., 1971; Weiner & Kukla, 1970) — 


suggest: (a) Approach to achievement activi- 
ties is mediated by attributions of success to 
internal factors (effort), which heighten posi- 
tive affect and increase the likelihood of 
undertaking subsequent achievement tasks, 
(b) Persistence in the face of failure is medi- 
ated by attributions of outcome to unstable 


factors (lack of effort and bad luck), which — 


result in the maintenance of the subjective 
probability of success, continued commerce 
with the task, and performance facilitation, 
(c) Tasks of intermediate difficulty are most 


likely to elicit effort attributions which in- 
crease motivation, Individual differences in — 
sensitivity to effort cues, and personal prel- — 
erences for effort or internal attributions, — 


could be responsible for the observed inter- 
action between achievement needs and task — 


difficulty. 
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CONTROL OVER STRESS, LOCUS OF CONTROL, 
AND RESPONSE TO STRESS* 


B. KENT HOUSTON ? 
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Lazarus proposed that the less control a person judges himself to have in 
a threatening situation, the more stressful it will be. Belief about control 
was manipulated by telling one group of subjects they could avoid shock by 
not making mistakes on a task and telling another there was no way of 
avoiding shock. Subjects in the latter group reported more anxiety but evi- 
denced less physiological arousal than the former group. Lazarus also asserts 
that general beliefs about helplessness increase the perception of threat. 
Contrary to expectation, external locus-of-control subjects (assumed to hold 
general feelings of helplessness) manifested less physiological arousal in stress 
than internal-control subjects, Results congruent with previous findings were 
obtained between locus of control and task performance. 


This study was conducted to investigate 
two factors which Lazarus (1966) asserts in- 
fluence a person’s perception of threat in 
potentially stressful situations. One factor is 
the person’s belief about his capacity to 
counter or avoid threat in that situation, in 
other words, the person’s belief about his 
control over the potential threat. The more 
control a person judges himself to have in a 
situation, the less threatening the situation 
will appear to him. Conversely, the less con- 
trol a person judges himself to have in a 
situation, that is, the more helpless he feels 
himself to be, the more threatening the 
situation will appear. 

To support his proposition, Lazarus pre- 
sented anecdotal material and described two 
experiments. One experiment demonstrated 
that individuals prefer predictable punishment 
over unpredictable punishment (D’Amato & 
Gumenik, 1960). The other experiment 
(Pervin, 1963) indicated that it is less anx- 
iety arousing for subjects to administer shock 
to themselves than for the experimenter to 
do it, and that predictable shock is less anx- 
iety arousing than unpredictable shock. In 
two other experiments not cited by Lazarus 
(Champion, 1950; Corah & Boffa, 1970), it 


1 Thanks ate due to Ray Mulry for his suggestions - 


on design of the study, to David S. Holmes for his 
generous assistance on the manuscript, and to Jay 
Fogelman for aid in scoring and analyzing the data. 

2 Requests for reprints should be sent to B. Kent 
Houston, Department of Psychology, University of 
Kansas, Lawrence, Kansas 66044. 


was found that under certain circumstances, 
subjects exhibited less stress in a situation 
in which they believed that they could termi- 
nate an electric shock than if they believed 
that they could not terminate it. In none of 
these experiments, however, could individuals 
control threat in the sense of actually being 
able to counter or avoid the aversive stimulus; 
so, to date, there has been no direct experi- 
mental verification of Lazarus’ proposition. — 

Lazarus’ formulation was tested in this 
study by experimentally manipulating the 
subjects’ belief about countering or avoiding 
threat and assessing whether this differentially 
affected the subjects’ anxiety. In one condition 
(unavoidable shock), the subjects were led 
to believe that there was no way of avoiding 
an electric shock which would occur randomly 
while they performed a memory task, This 
condition was designed to encourage a person 
to feel helpless since he could not counter 
or avoid the threat. In another condition 
(avoidable shock), the subjects were led to 
believe that they could avoid an electric 
shock by not making mistakes on a memory 
task. This condition was designed to encour- 
age a person to feel that he possessed some 
control over the situation since he could 
counter or avoid the threat. It was predicted 
that the subjects in the unavoidable-shock 
condition would be more anxious than subjects 
in the avoidable-shock condition. 

Anxiety was evaluated in terms of self- 
report and physiological arousal. Self-report 
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of anxiety was measured by Zuckerman’s 
(1960) Affect Adjective Check List (AACL), 
and physiological arousal was measured by 
heart rate. 

The second factor that Lazarus (1966) 
asserts influences perception of threat is a 
person’s general belief about his control over 
the environment. Specifically, if a person gen- 
erally believes that he lacks control (or is 
helpless), this should increase perception of 
threat, while the general belief that he has 
control should decrease perception of threat. 
In this study, general feelings of helplessness 
and control over the environment were identi- 
fied by the use of Rotter’s Locus of Control 
scale (1966). The Locus of Control scale is 
a measure of the extent to which an individ- 
ual believes that reinforcements in his life are 
under his control and are contingent upon his 
behavior (internal locus of control), as op- 
posed to believing that reinforcements are not 
under his control and are not contingent upon 
his behavior (external locus of control), It 
was predicted that external-control subjects, 
since they generally feel more helpless, would 
be more anxious in both threat-of-shock 
conditions than internal-control subjects. 

Independent of perception of threat, in- 
vestigators have been interested in the rela- 
tion between the internal-external control 
variable and task performance (Rotter & 
Mulry, 1965; Watson & Baumal, 1967). 
Typically, it has been found that an individ- 
ual’s performance is better in a situation in 
which there is congruence between his general 
belief about locus of control and his belief 
about the locus of control in that particular 
situation than is the case where there is in- 
congruence in the general and situational be- 
liefs. Therefore, in the present study, it was 
predicted that internal-control subjects would 
perform better in the ayvoidable-shock condi- 
tion where they thought that they had some 
control over the situation, while external- 
control subjects would perform better in the 
unavoidable-shock condition where they did 
not think they had control over the situation. 
The task used to assess performance was a 
specially constructed digits backward test, 
which is described below. 

Watson and Baumal (1967) have hypothe- 
sized that the relationships between locus of 
control and performance in different types 
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of situations are found because individuals in 
incongruent situations become anxious, and 
the ensuing anxiety interferes with task 
performance. Unfortunately, the study by 
Watson and Baumal did not include a reliable 
measure of anxiety, independent of task 
performance, to properly evaluate this hy- 
pothesis, Therefore, an ancillary purpose of 
the present study was to test Watson and 
Baumal’s hypothesis. Since the subjects in 
the avoidable-shock condition were told that 
receiving or avoiding an electric shock was 
contingent on their performance on a memory 
task, the situation was congruent with the 
generalized beliefs of internal-control subjects 
and incongruent with the beliefs of external- 
control subjects. Therefore, on the basis of 
the hypothesis of Watson and Baumal, it 
would be expected that in the avoidable-shock 
condition, external-control subjects would be 
more anxious than internal-control subjects. 
Since the subjects in the unavoidable-sh ck 
condition were told that there was no way of — 
avoiding an electric shock which would occur 
randomly while they performed a memory 
task, the situation was congruent with the 
generalized beliefs of external-control subjects 
and incongruent with the beliefs of internal- 
control subjects. Therefore, on the basis of 
Watson and Baumal’s hypothesis, it would 
be expected that internal-control subjects 
would be more anxious than external-control 
subjects in the unavoidable-shock condition. 

In addition to the two threat-of-shock 
gtoups, a nonstress group was included in the 
design for two reasons. One was to allow a 
check on whether subjects in the two shock 
groups did indeed experience stress. The 
second reason was to provide information on 
how internal-control and external-control sub- 
jects would perform and how anxious they 
would become when not in a stressful situa- 
tion. It was predicted that external-control 
subjects would perform better than internal- 
control subjects in the nonstress condition 
since none of the subjects were led to believe 
that they had control over the events in the 
situation, and this would be congruent with 
the general beliefs of external-control sub- 
jects. Applying Watson and Baumal’s hy- 
pothesis, it was also predicted that external- 
control subjects would be less anxious than 
internal-control subjects in the nonstress con- 
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dition since the belief that the subjects had 
no control over the events in the situation 
would be congruent with the general beliefs 
of external-control suubjects. 


METHOD 


Sixty-six males fulfilling an introductory psychol- 
ogy experimental requirement served as subjects in 
this experiment. The subjects were assigned randomly, 
in equal numbers, to the three treatment conditions: 
nonstress, avoidable shock, and unavoidable shock. 

Subjects were run in individual sessions. After 
being told that the research was concerned with the 
relation between mood, physiological response, and 
performance on a verbal task, an EKG electrode 
was attached to each of the subject’s wrists, and a 
ground electrode was attached to the subject’s right 
upper arm. The subject's heart rate was recorded 
on an E & M physiograph (Model DMP-4A). 

The session was divided into three parts: rest, 
assessment of limit for digits backward, and experi- 
mental. The procedures for the first two periods 
were the same for all subjects. Differential instruc- 
tions given to the two groups in the experimental 
period, described below, differentiated the groups 
into nonstress, avoidable-shock, and unavoidable- 
shock groups. 


Rest Period 


The subject was asked to relax for a period of 
approximately 5 minutes, during which he completed 
Rotter’s Locus of Control scale (1966). At the end 
of the rest period, the subject was asked to complete 
the Today form of the AACL by checking those 
adjectives that described how he felt at that 
moment. Heart rate was recorded continuously 
during the rest period, but only the number of heart 
beats in a 30-second interval during which the sub- 
ject was completing the AACL was used as the index 
of the subject’s resting level. For subjects completing 
the AACL in less than 30 seconds, their heart rate 
for 30 seconds was extrapolated from the time they 
took to complete the AACL. 


Assessment of Limit for Digits Backward 


After completing the AACL, the Digits Backward 
Test from the Wechsler Adult Intelligence Scale 
(WAIS) was given with standard instructions 
(Wechsler, 1955, p. 41). The subject’s limit was 
determined by the level at which he twice failed to 
repeat a set of specified number of digits, and then 
he was given two practice sets of digits, each one 
digit less than his limit. 


Experimental Period 


Subjects in the nonstress group were told that 
they would be given some additional sets of digits 
like the ones they had just done and then were given 
six sets of digits, each one digit less than their indi- 
vidual limits. Subjects in both the avoidable-shock 
and unavoidable-shock groups were told that the 
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experimenter was now jnterested in studying physio- 
logical response to electric shock, and “shock” elec- 
trodes were placed on the index finger of the left 
hand, Subjects in the unavoidable-shock condition 
were told that they would be given some additional 
sets of digits like the ones they had just done and 
that they would receive one or more painful electric 
shocks sometime during this period. The subjects 
were told that a shock machine was programmed to 
deliver shocks on a random basis and that there was 
no way of avoiding them. Subjects in the avoidable- 
shock condition were told that they would be given 
some additional sets of digits like the ones they had 
just done and that they might receive a painful 
electric shock after every mistake they made in 
repeating the digits. The experimenter explained that 
he would depress a pedal on the floor after each 
mistake the subject made, and the pedal activated 
a shock machine which delivered a shock on a 
random basis whenever the pedal was depressed. 
Therefore, the subject was told, he might or might 
not receive an electric shock after every mistake, 
but surely he would receive a shock after some, if 
not all, of his mistakes. The experimenter went on 
to say, however, that the subject could avoid ever 
getting shocked by not making any mistakes. 
(Actually, none of the subjects in either threat-of- 
shock group was ever shocked.) The number of sets 
of digits that the subjects repeated correctly 
(maximum of six) constituted the dependent measure ~ 
for task performance. 

At the end of the experimental period, the sub- 
jects were asked to complete the AACL according 
to how they felt while repeating the last few sets of 
digits. Heart rate was recorded continuously during 
this period, but only the data for the last 30 seconds, 
during which the subjects were repeating the last few 
sets of digits and for which the subjects indicated 
their feelings on the AACL, were used as the measure 
of heart rate during the experimental period. 


RESULTS AND DISCUSSION 
Control over Threat and Response to Threat 


Change scores (rest to experimental peri- 
ods) derived for both AACL and heart rate 
scores * were used to determine whether the 
threat of shock was effective in stressing the 
subjects and to test Lazarus’ hypothesis that 
belief about control over threat influences the 
perception of threat. Mean change scores for 
AACL and heart rate for the three treatment 
groups are presented in Table 1. For both 
the AACL data and the heart rate data, 


3As a check for the potential operation of the 
law of initial values (Lacey, 1956; Wilder, 1962), 
heart rate scores for the rest period were correlated 
with the change in heart rate from rest to experi- 
mental periods. The correlation was significant 
(r= —.37, df=1/64, p< 01), so a covariance ad- 
justment was made on heart rate change scores for 
initial scores (McNemar, 1962). 
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TABLE 1 


MEAN CHANGES In AACL AND HEART RATE FOR 
THE THREE TREATMENT CONDITIONS 


Treatment condition 


Change score 3 
Nonateea | Anpan EEA e 

AACL 2.05 4.27 5.36 

Heart rate 94 3.91 2.38 


3 X 2 analyses of variance were carried out 
comparing treatment conditions (nonstress, 
avoidable shock, and unavoidable shock) and 
locus of control (external control and inter- 
nal control). In these analyses, the sum of 
squares for treatments which has two degrees 
of freedom was partitioned into two compo- 
nents with one degree of freedom apiece. One 
component reflects the expected relationship 
between the treatment means (Winer, 1962, 
pp. 65, 70). Specifically, it was predicted that 
the mean AACL change scores and mean 
heart rate change scores for the unavoidable- 
shock condition would be greater than the 
mean for the avoidable-shock condition, and 
both of these would be greater than the mean 
for the nonstress condition. The other compo- 
nent was the remainder sum of squares indi- 
cating how much of the treatments sum of 
squares was not accounted for by the expected 
relationship between the treatment means. 
For the AACL data, the special comparison 
for treatment means was significant (F = 7.50, 
df = 1/60, p< .01), while the remainder 
sum of squares was not significant (F = .15, 
df = 1/60, ns). These results and inspection 
of the data in Table 1 indicate that (a) the 
subjects in both stress conditions reported 
more negative affect than the subjects in the 
nonstress condition, thus providing evidence 
that the threat of shock was successful in 
inducing stress in the subjects; and (b) that 


4 Since a high score on the Locus of Control scale 
indicates a belief in external control, the subjects in 
each condition who obtained a score above the 
median were assigned to an external-control group, 
and the subjects who obtained a score below the 
median were assigned to an internal-control group. 
The median scores for the subjects in the three 
conditions are as follows: nonstress=9, avoidable 
shock = 9, unavoidable shock = 8. These medians are 
congruent with the measures of central value for 
the Locus of Control scale reported for various 
subject samples by Rotter (1966). 
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the subjects in the unavoidable-shock condi- 
tion reported more negative affect than sub- 
jects in the avoidable-shock condition. 

When the heart rate data were analyzed, 
the results indicated that neither the special 
comparison for treatment means nor the 
component for the remainder sum of squares 
was significant (F = 2.43, df= 1/60, ns; 
F = 1.98, df = 1/60, ns, respectively). How- 
ever, inspection of Table 1 indicates that 
heart rate increased more for the threat-of- 
shock groups than for the nonstress group. 
This relationship approached significance 
(F = 3.23, df= 1/60, p< .09), a finding 
which provides additional evidence that the 
threat of shock was successful in inducing 
stress in the subjects. Inspection of Table 1 
also indicates that, contrary to prediction, 
heart rate increased more for the avoidable- 
shock group than for the unavoidable-shock 
group, which is opposite to the relation found 
for the AACL data. However, this difference 
did not reach statistical significance (F = 
1.18, df = 1/60, ns). The greater increase in 
heart rate for the avoidable-shock than 
unavoidable-shock group was probably due to 
the effort that the subjects in the avoidable- 
shock group expended in trying to get the 
digits correct in order to avoid the threat- 
ened shock. In other words, the subjects in 
the avoidable-shock group exhibited greater 
physiological arousal than the subjects in the 
unavoidable-shock group because the subjects 
in the avoidable-shock group were making a 
substantial effort to achieve control over the 
situation. This interpretation is supported by 
the following considerations. Subjects in the 
avoidable-shock group were given the oppor- 
tunity to avoid shock, but they were not led 
to expect that it would be easy or definite 
that they could do so. They were in effect 
challenged to exercise control in the situa- 
tion. Also, the digits backward task was dif- 
ficult since it required performance near the 
limit of the subjects’ ability. Therefore, it 
would take considerable effort to perform well 
enough not to make mistakes and, hence, not 
to get shocked, as the subjects in the 
avoidable-shock group were encouraged to do. 
In a subsequent unpublished study (Houston, 
1971) in which subjects were questioned about 
their reaction to an avoidable-shock condition, 
over 40% volunteered how hard they had 


ol 


CONTROL OVER AND RESPONSE TO STRESS 


tried to concentrate on, memorize, or remem- 
ber the digits in order to avoid the threatened 
shock. The physiological arousal observed in 
the avoidable-shock group, then, appears to be 
due to (a) apprehension about shock, which 
also influenced the increase in heart rate for 
unavoidable-shock subjects and (b) substan- 
tial effort to achieve control over the situation 
by performing the task well, which probably 
did not occur in the unavoidable-shock group 
since the subjects in this condition had been 
led to believe that control over the threat 
was not possible. 

In summary, then, it was found that the 
possibility of receiving electric shock generally 
increased the subjects’ report of anxiety and 
level of heart rate. As predicted, the increase 
in report of anxiety was greater when the 
subjects thought that they were helpless to 
do anything to counter or protect themselves 
from the shock than when they thought that 
their actions could lead to avoiding the shock. 
However, the subjects increased more in 
physiological arousal when they thought they 
could avoid shock but the avoidant behavior 
was near the limit of their capacity than when 
they thought they were helpless to avoid 
shock. 

The results indicate that a subject’s re- 
sponse to a potentially threatening situation 
in which he may or may not have control is 
more complex than Lazarus (1966) proposed. 
Difficulty in achieving control, evidenced here 
by the effort required to exercise control, 
appears to be an important factor. When a 
person has to perform a difficult task to avoid 
an aversive event, a different pattern of re- 
sponse to the situation is found than might 
be expected if it were not difficult to make 
an avoidant response. For future theory and 
research, it appears necessary to not simply 
consider what effect control over a situation 
has on response to threat but to consider 
the effect of various kinds and conditions of 
control on response to threat. 


Locus of Control and Response to Stress 


Watson and Baumal (1967) hypothesized 
that incongruence between subjects’ general 
and situational beliefs about locus of control 
creates anxiety which may lead to perform- 
ance decrement. On the basis of this hypothe- 
sis, it was predicted that external-control sub- 
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jects would be less anxious than internal- 
control subjects in unavoidable shock and non- 
stress and that internal-control subjects would 
be less anxious than external-control subjects 
in avoidable shock. This prediction was tested 
through the use of the interaction terms from 
the 3 X 2 analyses of variance described in 
the previous section, The interaction between 
treatments and locus of control was not sig- 
nificant for either the AACL data (F = .35, 
df = 2/60, ns) or the heart rate (F= 2.18, 
df = 2/60, ms). The results of the present 
study therefore offer no support for Watson 
and Baumal’s hypothesis. That is, subjects in 
incongruent situations (e.g. external-control 
subjects in avoidable shock) did not evidence 
greater anxiety as measured by AACL or 
heart rate increases than did subjects in con- 
gruent situations (e.g., external-control sub- 
jects in unavoidable shock).° 

On the basis of Lazarus’ (1966) formula- 
tion, it was predicted that external-control 
subjects would be more anxious in both threat- 
of-shock conditions than internal-control sub- 
jects. This prediction derives from the possi- 
bility that external-control subjects generally 
feel more helpless than do internal-control 
subjects and therefore will perceive the pos- 
sibility of shock in both threat-of-shock 
conditions as more threatening than internal- 
control subjects. Since no threat is involved 
in the nonstress condition, there is no reason 
to expect differences in anxiety between 
external-control and internal-control subjects 
in this condition. Analyses nonorthogonal 

5 This statement takes into consideration the pos- 
sibility described earlier that increase in heart rate 
in avoidable shock may reflect both apprehension 
and effort. One would expect from Watson and 
Baumal’s hypothesis that external-control subjects 
who were in an incongruent situation in avoidable 
shock would be more anxious than internal-control 
subjects and therefore increase substantially in heart 
rate. Even if the internal-control subjects, who were 
in a congruent situation in avoidable shock, may 
have increased in heart rate as a result of expending 
effort in attempting to achieve control over the 
situation, one would expect that the increase in 
heart rate for external-control subjects would at 
least approximate if not exceed the increase in heart 
rate for internal-control subjects. Internal-control 
subjects, however, increased significantly more in 
heart rate than external-control subjects (F = 9.27, 
df=1/60, ~<.01), which suggests that internal- 
control subjects may not only have expended more 
effort but were more aroused by threat than 
external-control subjects. 
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to the 3 X 2 analyses of variance described 
in the preceding section were made to test the 
expected relationship between means (Winer, 
1962, pp. 65, 70). Specifically, it was expected 
that increases in AACL and heart rate would 
be greater for external-control than for inter- 
nal-control subjects across both threat-of-shock 
conditions but there would be no differences 
between external-control and internal-control 
subjects in nonstress. The analysis for AACL 
change scores was not significant (F = .05, 
df = 1/60, ns). Analysis of heart rate change 
scores and inspection of the data revealed a 
significant relationship among the means 
(F = 9.47, df = 1/60, p < .01). As expected, 
there was no significant difference between 
external-control and internal-control subjects 
in change in heart rate in nonstress (F = .12, 
df = 1/60, ns). However, contrary to what 
was expected, it was found that internal- 
control subjects increased significantly more 
in heart rate than external-control subjects 
across the two threat-of-shock conditions 
(F = 9.47, df = 1/60, p < 01). 

It is clear, then, that these data do not 
support the prediction derived from Lazarus 


(1966), that people who generally believe — 


_ they have control over the environment per- 
ceive situations as less stressful than persons 
who generally believe that they are helpless in 
interacting with the environment, at least 
when general beliefs regarding control are 
measured by Rotter’s Locus of Control scale 
(1966). The data indicate that while internal- 
control and external-control subjects do not 
differ in reports of anxiety in stressful situa- 
tions, internal-control subjects evidence sig- 
nificantly greater physiological response than 
do external-control subjects. 

A possible explanation for the difference 
between external-control and internal-control 
subjects in response to stress is the following: 
External-control subjects view forces outside 
themselves as being responsible for their fate 
and do not become very aroused when faced 
with threat because they resign themselves to 
the situation. Internal-control subjects be- 
come highly aroused when threatened, but 
they are reluctant ot report anxiety; hence, 
a significant difference in heart rate change 
scores but no significant difference in AACL 
change scores was found between internal- 
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control and external-control TE 
the threat conditions. The interpretation that 
internal-control subjects are defensive about 
reporting anxiety is supported by two research 
findings which indicate that internalcontrol 
subjects are more defensive and re 
chronic anxiety than external-conth fol sub- 
jects. First, Tudor (1970) reported correla- 
tions which indicate that internal-control sub- 
jects scored significantly higher than external- 
control subjects on both the MMPI Denial 
scale (Little & Fisher, 1958) and Mmr K 
scale. Second, Watson (1967) repor 
relations between the Locus of Co 
and the Manifest Anxiety Scale | 
1953), indicating that internal-con 
jects report fewer characteristics of 
anxiety than do external-control sub 


failure to confirm Lazarus’ proposition then “7 


may be due to the possibility that the scale 
chosen to measure general beliefs about con- 
trol over the environme! A also measures dif- 
ferences in the use of dëi maneuvers. 
Differences in the use of ‘maneuvers 
may have therefore obseured the expected 
relationships between general belief about 
control and response to threat. 


Locus of Control and Performance 


To test the predicted relations between 
locus of control and performance, a 3 X 2 
analysis of variance was performed on the 
digits backward data comparing treatment 
conditions (nonstress, avoidable shock, and 
unavoidable shock) and locus of control 
(external control and internal control; see 
Footnote 4). In this analysis, 
of Control X Treatments interaction sum of 
squares with two degrees of freedom was 
partitioned into two components with one de- 
gree of freedom apiece. One component re- 
flects the expectation that in the avoidable- 
shock condition the internal-control subjects 
would perform better than the external-control 
subjects and that in the unavoidable-shock 
and nonstress conditions the external-control 
subjects would perform better than the 
internal-control subjects. The other compo- 
nent of the interaction sum of squares is 
the remainder which is not accounted for by 
the predicted relations. The predicted inter- 
action was significant (F = 4.68, df = 1/60, 
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p<.05), while the remainder was not 
(F= 1.28, df = 1/60, ns). This result is 
consistent with previous findings (Rotter & 
Mulry, 1965; Watson & Baumal, 1967) and 
provides support for the hypothesis that sub- 
jects will perform better in situations in 
which there is congruence between their be- 
liefs about locus of control in general and 
their beliefs about the locus of control in the 
particular situation in which they are working. 


CONCLUSION 


The results of this experiment reveal the 
following. (a) In general, the subjects found 
a threatening situation in which they had no 
control more anxiety provoking than one in 
which they had some control over the situa- 
tion. (b) Subjects became more physiologi- 
cally aroused (as measured by heart rate) 
when they thought they could exercise some 
control over the situation than when they 
thought they had no control. The greater 
arousal probably resulted from the effort that 
the subjects exerted in attempting to achieve 
control. (c) Subjects who characteristically 
see themselves as being in control of events 
in their lives (as measured by Rotter’s, 1966, 
Locus of Control scale) became more physio- 
logically aroused under stress than those who 
see control of events in their lives as being 
external. This finding may be attributed to 
the possibility that the Locus of Control scale 
measures defensiveness. (d) Subjects per- 
formed better in situations in which there 
was congruence between their beliefs about 
locus of control in general and their beliefs 
about the locus of control in the specific 
situation in which they were working. 

The results of this experiment suggest the 
following conclusions. (2) When investigating 
what effect control over a situation has on 
response to threat, it is necessary to consider 
the effects of various kinds and conditions of 
control. (b) The Locus of Control scale not 
only measures beliefs about control of events 
in a person’s life but may also measure tend- 
encies for use of defensive maneuvers. (c) The 
Locus of Control scale predicts task per- 
formance under conditions of certain rein- 
forcement contingencies, but the data of this 
experiment offer no new evidence for the 
processes responsible. 
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ROLE TAKING) AND REACTIONS TO 
l FILM VIOLENCE: 
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Many coni litions influence the fikelihood that people will display open aggres- 
sion after Wing film violence. The present experiment sought to determine 
whether implicit aggressive verbalizations would facilitate aggressive reactions 
to movie violence. In a 3 X 2 factorial design, one variation in these implicit 
verbalizations asked subjects, all made to be angry toward the experimenter’s 
confederate, either to (a) imagine themselves as the movie character who 
wins the film fight, (b) imagine themselves as the judge who watches this 
fighter, or (c) were not given any “imagine self” instructions. For the other 
variation, half of the subjects were to press a button every time they saw the 
film aggressor hit his opponent. The deliberately provoked men identifying 
with the fight victor were more aggressive to the confederate than either 
those taking the role of the judge or the control subjects, Within the group 
identifying with the film aggressor, the more frequently the subjects pushed 
the button (presumably thinking “hit” each time), the greater was the 
number of shocks they later gave. Questionnaire measures suggested that the 
identification with the film aggressor had also engendered hostility toward _the 
experiment and experimenter, The role of demand characteristics is examinee 


Portrayals of violence in the mass media 
obviously do not always produce aggressive 
reactions from the audience. Whether or not 
the people seeing (or reading about) this vio- 
lence will behave aggressively themselves de- 
pends upon a number of factors, some of 
which have been identified in previous re- 
search (see Bandura, 1965; Berkowitz, 1970). 
To single out just a few of these intervening 
conditions, the observer is more likely to at- 
tack an available target soon after encounter- 
ing the media event: if the witnessed scene 
had led him to think that aggression could 
have positive consequences for him and/or the 
observed violence was ethically justified, if he 
was angry with the available target person 
and/or was emotionally aroused or excited at 
the time (without feeling anxious or guilty), 
and if the possible target was semantically 


1 This experiment was carried out under Grants 
MHI7405 and MHI7955 from the National Institute 
of Mental Health to Leonard Berkowitz. The prob- 
Jem was formulated by Leonard Berkowitz, while 
Charles W. Turner set up the procedure and collected 
and analyzed the data. i 

2 Now at the University of Utah. 

3 Requests for reprints should be sent to Leonard 
Berkowitz, Department of Psychology, University of 
Wisconsin, Madison, Wisconsin 53706. 


associated with the victim of the 
lence or with aggressive behavi enerally, 
Berkowitz believes these factors (and pos- 
sibly others as well) facilitate the aggressive 
reaction but are not necessary for the response 
to occur; they increase the chances that 
people in the audience will act aggressively. 

Several processes undoubtedly contribute to 
the violent aftereffects. of media aggression. 
These include learning, disinhibition, and re- 
sponse elicitation. Berkowitz’s research has 
been devoted primarily to the latter two proc- 
esses, but especially to the conditions govern- 
ing the elicitation of aggressive responses. 
His classical conditioning model (Berkowitz, 
1970) holds that the observed event, whatever 
else it does, often serves as a conditioned 
stimulus that evokes associated responses. If 
the witnessed scene is clearly aggressive in 
nature, that is, has aggressive meaning for 
the observer (Berkowitz, 1970), aggressive 
reactions will be elicited. Most viewers in 
naturalistic settings have only short-lived 
implicit reactions, such as aggressive ideas 
and images. However, if the facilitating condi- 
tions are at work, there could be. open 
aggression as well. 
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The observer’s thoughts at the time he 
encounters the media violence can affect the 
ease with which the depicted event will evoke 
aggressive reactions from him. Aggressive 
thoughts might facilitate these reactions, 
while ideas incompatible with aggression 
could lessen the probability of violent conse- 
quences. It is especially important to note 
if the viewer is thinking of acting aggres- 
sively himself. For example, if he has an 
aggressive “preparatory set,” expecting to 
perform the responses involved in the mea- 
surement of the aggressive aftereffects, the 
Observed violence might evoke stronger im- 
plicit aggressive responses than would other- 
wise occur, He could be thinking that at the 
conclusion of the film he would have an 
opportunity to administer electric shocks to 
his partner.* In this case, the movie violence 
might readily elicit strong anticipatory shock- 
giving responses since both have the same 
ageressive meaning and the viewer is set to 
make these responses. Then, soon afterwards, 
when the time came to give the shocks, these 
internal ongoing reactions could add to the 
strength of the electrical punishment he would 
administer. 

There is another possibility as well. 
Whether or not the viewer expects to give 
shocks in the near future, he might imagine 
himself to be the film aggressor. Here, he 
covertly performs the actions he sees. As he 
symbolically takes the role of this aggressor, 
the movie violence elicits strong aggressive 
responses within him which then intensify his 
later attacks. This process involves more than 
cloes attention to the film character. In addi- 
tion to watching this person, the observer 
implicitly carries out his behavior. He thinks 
of himself as hitting the movie victim along 
with the film aggressor. Bandura, Grusec, and 
Menlove (1966) have reported some related 
findings, although they were concerned pri- 
marily with imitative learning. When young- 
sters watched a film showing an adult exhibit 
novel aggressive action, those children who 
had verbalized the model’s behavior as he per- 


4The procedure followed in many of the experi- 
ments in this area creates this kind of expectation in 
the subjects and thus probably enhances the chances 
that the screen violence will have openly aggressive 
aftereffects. 
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formed it were later better able than non- 
verbalizing viewers to reproduce these novel 
responses. Translating the actions into words 
had facilitated their recall. The observer who 
imagines himself to be the aggressive model 
might be doing the same kind of implicit 
verbalizing, leading to better memory for the 
witnessed violence—and also easier elicitation 
of these and other semantically associated 
responses by the aggressive cues in the 
environment. 

We could refer to this implicit role taking 
as identification. Weiss (1969) believes that 
“identification” is the central concept in the 
interpretation of mass media effects, and sug- 
gests that when someone identifies with a 
fantasy character, “he participates vicariously 
in the events, feelings, and behavior that 
relate to the object of identification... 
[p. 98].” Of course, identification can be 
defined operationally in many different ways. 
Tannenbaum and Gaer (1965) studied this 
mechanism in terms of the extent to which 
an observer rated himself as similar to a 
particular film character. They found, as we 
would also expect, that the greater this iden- 
tification, the stronger were the viewers’ 
reactions to the movie character’s plight. The 
subjects who had regarded themselves as simi- 
lar to the person on the screen might have 
imagined themselves to be this individual as 
they watched the filmed action so that they 
vicariously experienced the events happening 
to him. 

In the present experiment, one group of 
deliberately provoked subjects was induced 
to have aggressive thoughts as they watched 
film violence by giving them two different 
kinds of instructions. They were asked to 
think of themselves as a particular character 
in the movie (who, it turns out, wins the 
filmed fight). We expected that this identifi- 
cation with the film aggressor would heighten 
the aggression-eliciting effects of his witnessed 
behavior. In addition, the men in this par- 
ticular condition were required to make sym- 
bolic aggressive responses as they watched — 
the movie aggressor’s punches (following a 
procedure somewhat closer to the facilita- 
tive symbolization manipulation studied by 
Bandura et al., 1966, than the imagine-self 
variation just described). The viewers were — 


al 


encouraged to think “hit” with each blow 
they saw. As a consequence, the screen vio- 
lence should also evoke comparatively strong 
aggressive reactions within them. Each of 
these two treatments—the imagine self and 
the implicit verbalization—should, we felt, 
lead to enhanced attacks upon the subjects’ 
tormentor at the conclusion of the movie, and 
the two should combine to make this group 
more aggressive toward the target person 
than any other group in the study. 


METHOD 
Subjects 


Seventy-eight male undergraduates enrolled in 
introductory psychology classes at the University of 
Wisconsin participated as subjects, and for their 
service received partial credit toward their course 
requirements, Other subjects had also been “run.” 
Seven men who expressed a clear awareness of the 
experimental deceptions were eliminated from the 
experiment and replaced by other students. In addi- 
tion, seven other subjects were excluded because of 
breakdowns in the apparatus (five), or as a result 
of procedural errors (two). None of these rejected 
groups was concentrated in any particular experi- 
mental condition. (Thus, only one of the suspicious 
rejects was in the condition required to identify with 
the film aggressor and also explicitly asked to think 
of the observed hitting.) 


Procedure 


Six experimental conditions were established in a 
3 X 2 factorial design, with 13 subjects in each condi- 
tion. Every subject was first deliberately insulted 
by a peer and then was shown the 7-minute prize 
fight excerpt from the movie Champion, routinely 
used by Berkowitz in the Wisconsin research pro- 
gram. The first set of experimental treatments had 
to do with the nature of the identificand as the 
subjects viewed the flm. Each person was to imagine 
himself to be either (a) the film character named 
Dunne, (b) a judge watching Dunne, or (c) was 
given no specific identification requirement. The 
second set of treatments involved the type of re- 
sponse made by the observer while watching the 
movie he was either to press a button whenever the 
fight victor, Dunne, hit the loser, Kelly, or was not 
required to make any such motor response. 

The experimenter met each subject, and an 
experimental confederate posing as a fellow subject, 
in a waiting room outside the experimental room, 
and directed the two men to prearranged positions 
in separate booths inside the experimental room. The 
booths were open at the front but were situated 
so that the men could not see each other. The sub- 
jects were informed that the experiment would em- 
ploy mild and harmless electric shocks. The study 
ostensibly was concerned with relationships between 
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several physiological variables and various aspects of 
“social learning and social perception” in complex 
situations which employ “negative reinforcement or 
punishment.” The subjects were given an oppor- 
tunity to withdraw from the experiment if they 
objected to working with electric shocks. One person 
who did object was dismissed from the experiment 
at this point and was replaced by another subject. 

The experimenter then connected two sham physio- 
logical devices to the subject’s nondominant hand 
and went through a pretense of adjusting his record- 
ing equipment which had been placed, supposedly, 
behind a screen out of the subject’s view. The experi- 
menter decribed the first task for the two men as 
dealing with creative problem solving under the 
stress of punishment. One of the two men would 
have 5 minutes in which to think of five ideas for a 


promotional campaign to improve the popularity and 


under the threat of punishment since the second 
person wou d evaluate the problem solver’s answers 
by giving him ffm one to five shocks for each idea. 
One shock had to be given, and indicated a very 
favorable judgment, The worse the idea, the greater 
number of shocks that was to be administered, up 
to a maximum of five. The subject, was ¢ 
apparently arbitrarily, to be the problem | y 

When a 5-minute probleni-solyiñg period had 
elapsed, the experimenter attachec D electrodes 
to the wrist of the subjects dominant arm. The 
evaluation procedure was again explained to the two 
men. The problem solver (the subject) was to read 
the proposals aloud one by one. Immediately after 
each idea, the other person, the evaluator, was to 
administer one to five shocks as his evaluation of 
that proposal. After making sure that the instructions 
were understood, the experimenter indicated that the 
two men were to proceed. Following a prearranged 
schedule, the confederate gave each subject two 
shocks for his first proposal, four for the second, 
five for the third, three for the fourth, and four for 
the fifth. These relatively negative evaluations were 
designed to make the subject moderately angry with 
his supposed fellow subject. 

After removing the shock bracelet from the sub- 
ject’s wrist, the experimenter explained that the next 
part of the experiment was still concerned with 
physiological reactions to complex simulus situations, 
but that it involved physiological correlates of “social 
perception.” He said that this task would not be 
concerned with the accuracy or quality of judgments 
as the former part had been. )It was also explained 
that since the two men would have different jobs 
in this phase, their instructions had been written 
out for them. These written instructions established 
the experimental conditions. 


record eM a recording artist. He was to do this 


Experimental Manipulations 


The instruction cards producing the six conditions 
were randomly ordered and placed face down in a 
stack between the two men. At the appropriate time, 
the experimenter gave the subject the top card 
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without seeing its contents so that neither he nor 
the confederate could determine which experimental 
treatment was applied to the subjects (the confed- 
erate was given a blank card). Each of the cards 
first indicated that the film which they were about 
to see depicted a championship match, “in which 
Kirk Douglas plays a fighter named Kelly” and his 
“opponent is named Dunne.” 

For the self-imagining (or identification) condi- 
tions, one-third of the subjects were induced to think 
of themselves as Dunne. Another one-third of the 
subjects were induced to think of themselves as 
a watching judge in order to control for the effects 
of the covert role taking per se. The remaining sub- 
jects were not given any specific instructions for 
identifying with the filmed character: 


[imagine self as Dunne instructions]: You are to 
think of yourself as the character named Dunne, 
Kirk Douglas’ opponent in the film. Try to place 
yourself in his shoes during the fight and react 
the way he would react to the fight. 

[Imagine self as a watching judge|: Tty to think 
of yourself as a judge watching the fight whose 
job it is to watch Dunne specifically and to judge 
his performance. Place yourself in this judge’s 
shoes and try to react the way he would react to 
the fight. 


The control subjects received no specific imagine-self 
instructions. 

Orthgonal to the identification manipulation, one- 
half of the men in each group were made to be 
highly aware of the hits Dunne inflicted on his 
opponent so that they presumably would think “hit” 
with each blow. For these subjects the instruction 
card ended with this statement: 


To help focus your attention in observing the film, 
take the black switch connected to the brown cord 
lying on the table and hold it in your hand. Each 
time you see Dunne hit the other boxer, press the 
red button at the end of the cord. 


For the other half of the subjects, this paragraph 
was omitted from the instructions. In each condi- 
tion the hand switch was lying in a conspicuous 
position near the subject. If we assume that the sub- 
jects’ verbalized “hit” to themselves every time 
they pressed the button, this treatment is similar to 
the Bandura et al. (1966) “facilitative symbolization” 
treatment. 

The prize-fight scene from the movie Champion 
was then introduced. This was a brief story summary 
which implied that Kirk Douglas (Kelly) deserved 
the beating he was about to get, and which appar- 
ently serves to lower the observer's inhibitions 
against aggression (Berkowitz, 1970; Berkowitz & 
Rawlings, 1963). Both the introduction and the audio 
portion of the film was presented through earphones. 


Dependent Measures 


The number of times each subject in the think 
“hit” condition pressed the button during the film 
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was automatically recorded.. (Since the recorder 
could be heard by the experimenter and the con- 
federate, they were no longer blind as to this specific 
variation.) None of the subjects receiving the 
button-pressing instructions failed to utilize the hand 
switch during the movie. 

When the film ended the experimenter explained 
that the third task for the two men would again 
be concerned with “the physiological concomitants 
associated with the effects of negative reinforcement 
on social performance.” This time, however (they 
were told), the roles would be reversed. The con- 
federate would now have to think up five ideas for 
a promotional campaign for a gas station. The 
experimenter also explained that this problem had 
earlier been found to be “easier” than the first one 
(the recording artist campaign) so that only 2 min- 
utes would be allowed for the five ideas. In actual- 
ity, this shortened time was established in order to 
reduce the time between the end of the film and 
the subject’s evaluation of his partner since the 
effect of the movie could be quite transitory. We 
also hoped that this change would make it less likely 
that the subject would use the number of shocks he 
had received earlier as a “base line” in shocking 
the confederate. 

At the end of the 2 minutes (during which the 
confederate supposedly had been thinking of his an- 
swers), the experimenter returned to the confederate, 
pretended to place a shock electrode on his wrist, 
and repeated the evaluation procedure exactly as 
it was presented earlier in the session. In all cases 
the confederate read the five following ideas which 
were to be evaluated by the subject: (a) advertising 
in newspapers and pamphlets; (b) giving out trading 
stamps; (c) lowering the prices; (d) making the gas 
station more modern and luxurious; (e) keeping 
the gas station open longer hours. 

The primary dependent measure was the number 
of shocks administered by the subject for each trial. 
The total duration of all shocks given by the subject 
was also recorded. 


Other Measures 


After the subject’s “evaluation” of his partner, 
he was given several questionnaires designed to assess 
his feelings about himself, the film characters, his 
partner, the experiment, and the experimenter. The 
first questionnaire was a modification of the Nowlis 
(1965) Mood Adjective Check List which employs 
24 items (e.g., “angry,” “rebellious,” and “forgiv- 
ing”). The items are rated on a S-step scale, ranging 
from “definitely applies” to ‘definitely does not 
apply.’ Nowlis’ factor analysis of these items indi- 
cated that they form eight three-item factors 
(Nowlis, 1965), and the clusters were analyzed 
separately. 

In addition, a second unvalidated questionnaire 
was employed to assess the effectiveness of the identi- 
fication manipulation. Subjects were asked to rate 
their feelings toward the film characters and the 


perceived similarity of the partner to the film chara 
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TABLE 1 


ANALYSIS OF VARIANCE OF NUMBER OF 
SHocks GIVEN TO CONFEDERATE 


at 
Source df MS F 
Identificand (A) 2 2.78 3.28* 
Think “hit”? (B) 1 38 
Confederate (C) 1 1.43 
AXB 2 1.42 
AXC 2 1.16 
BXC 1 .02 
AXBXC 2 52 
Subjects within groups| 66 85 
Ideas (D) A} 15.24 14.56** 
AXD 8 Rota 
BXD 4 1.13 
CXD 4 .70 
AXBXD 8 1.33 
AXCXD 8 Xie) 
BXCxXD 4 58 
AXBXCXD 8 49 
Subjects X D 264 1.05 
*p <.05. 
** p < 001. 


acter. Only one item, to be discussed later, produced 
a significant result. 
A final questionnaire asked the subjects to rate 
their feelings toward the experimenter on five 11- 
step items. When each subject completed this ques- 
tionnaire, he was asked to place it in a sealed en- 
velope addressed to the Psychology Department 
office, so that his comments supposedly were anony- 
mous. He was led to believe that negative evaluations 
of the experimenter on the questionnaire could have 
deleterious effects for this person. 


| Sub ject Suspiciousness 


The subjects were interviewed at the end of the 
experiment about their suspicions of the experimental 
manipulations. In this debriefing session three pri- 
mary sources of suspicion were evaluated: (a) the 
subject was suspicious that his partner was not a 

~ real subject; (b) the subject suspected that his part- 
ner somehow did not have control over the number 
of shocks that the subject received; (c) the subject 
suspected that he was not actually shocking his 
partner in the final evaluation session.® Each subject's 
suspiciousness was then rated on a 0-2 scale. 


5 Our focus was on these sources of suspicion be- 
cause almost all of the subjects’ expressed doubts 
had to do with the confederate. There was very 
little questioning of the film or the identification in- 
structions, as if the cover story dealing with “social 
perception” and the effects of close attention to the 
movie had been almost completely accepted. Con- 
sistent with this, there was more suspicion in the 
control condition than in the two identification 
groups. Our impression also was that the subjects 
were not reluctant to voice their suspicions if they 
believed they had a good basis for their doubts and 
thought their interpretation was not silly, 


Following the final interview, all subjects were 
fully informed of the nature of the deception, and ` 
were asked to avoid discussing the experiment with ~ 
other students until the end of the semester. 


rs RESULTS 
Electric Shocks 


The main depen ent measure, based on the 
response that the Djects were set to per- 
form, was the num of 
to the confederate pr eacl 


been employed, the an, 
used was kept as a separate term in the 
analysis. We can see, however, that this varia- 
tion did not affect the outcome, and it will þe | 
disregarded after this. The particular idea 
evaluated did produce significant differences 
in the number-of s A a 
second proposal drew the 
while the third 
shocks.) However, since E 
interact Ron with any of ed peri- 
mental treatments, it will also be ce ga ded 
from here on. 
The only significant effect of real interest 
to us in Table 1 stems from the identification 
manipulation. Looking at the top half of 
Table 2 for the condition means, we see that 
the provoked subjects administered teliably 
more shocks to their tormentor if they had 
identified with the fight victor, Dunne, than 
if they had imagined tances to be the 
fight judge or if they had not been given 
any imagine-self instructions. Because of the 


TABLE 2 
MEAN Smock MEASURES IN Eacu CONDITION 
Identificand 
Measure 
Dunne Judge Control 
Shock number 
Think “hit” 2.66, Deke 2.26, 
No think 2.50.» 2.45ab 2.32av 
DE 2.583 2.334 2.324 
Total duration (sec.) 
Think “hit”? 4.80ab 3.16, 3.39, 
No think 5.29, 3.68.1, 3.83.a5 
x 5.055 3.424 3.624 


Note.—Cells sharing the same subscript are not significantly 
different at the .05 level by the Duncan multiple-range test. 
Each mean is based on 13 cases. The shock number refers to 
the mean for each of the five ideas evaluated, 
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specific a priori hypothesis, we also examined 
the differences among the six experimental 
groups. The Duncan multiple-range test 
yielded some support for our initial expecta- 
tion: The people identifying with Dunne who 
had been also encouraged to verbalize Dunne’s 
hits implicitly (by their button pressing) gave 
significantly more shocks to the confederate 
than any other group in the think “hit” condi- 
tion. As the table also suggests, the button- 
pressing requirement apparently dampened 
the aggressive aftereffects of the movie vio- 
lence in the judge and control groups. These 
_Jatter men may not have had aggressive 
thoughts when they pushed the button; it 
as not they who were hitting someone (if 
only in their fantasy). If this post hoc analy- 
sis is correct, the button-pushing implicit 
verbalization should lead to enhanced aggres- 
sion only in the case of the men thinking of 
themselves as Dunne (the movie aggressor). 
The total duration of all of the shocks given 
by the subject was also recorded. As with the 
number of shocks, the only significant effect 
was for the identification variable (F = 5.32, 
p < .01). Looking at the bottom half of Ta- 
ble 2, we see, as expected, that the longest 
shocks were given by the men identifying 
with Dunne. There were no reliable differences 
within the think “hit” variations. The sub- 
jects, thinking of themselves as Dunne, who 
were also required to press the button with 


i each hit gave somewhat longer shocks than 


the people in the two other think “hit” groups, 
but their shocks were inexplicably somewhat 
shorter than those administered by the sub- 
jects in the Dunne-identifying—no-think “hit” 
“group. 

| Although the button-pushing manipulation 
did not produce clear condition differences in 
aggression, other findings provide some tenta- 
tive support for our post hoc suggestion re- 
garding the importance of thinking of one’s 
self as hitting. The data summarized in Table 
3 indicate that the three groups required to 
do the button pushing paid equal attention to 
the film violence; they counted about the same 
number of blows. But, as our previously pro- 
posed reinterpretation of this manipulation 
implies, the number of button pushes was sig- 
nificantly and positively correlated with the 
number of shocks subsequently administered 
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TABLE 3 


MEAN NuMBER OF BUTTON PUSHES AND CORRELATIONS 
BETWEEN NUMBER OF BUTTON PUSHES AND SHOCKS 


Identificand 


Variable p 
Dunne Judge Control 
No. pushes 65.31 65.31 69.23 
r 54* 09 02 
kp = 05 


to the confederate only in the Dunne identi- 
fication condition. It was only when the men 
imagined themselves hitting the victim that a 
higher number of button pushes (and pre- 
sumably, more thoughts of hitting) led to 
the greater number of attacks upon the in- 
sulting confederate. 


Questionnaire Measures 


Several questionnaire items, all completed 
at the end of the session after the shocks were 
given, corroborated the findings obtained with 
the electric shocks. As indicated earlier, some 
of the questions attempted to assess the effec- 
tiveness of the identification manipulation. 
One item yielded a significant main effect for 
the identification variation and no other sig- 
nificant terms: “Which of the film characters 
do you feel should have won the fight?” The 
condition means presented on line 1 of Table 
4 show the expected ordering, with the 
men asked to imagine themselves as Dunne 
expressing a reliably stronger preference for 
Dunne than the subjects in either the judge 
or control groups. Tangentially, we can also 
note the generally favorable disposition to- 
ward Dunne rather than his opponent, Kelly, 
in all three conditions; since this was a 15- 
step scale, all three means are definitely on 
the pro-Dunne side, suggesting that the movie 
introduction had indeed, as intended, cast 
Kelly in a poor light. 

The second line of Table 4 reports the re- 
sults with the only mood cluster revealing a 
significant main effect for identification. The 
people thinking of themselves as Dunne rated 
themselves as feeling definitely less forgiving— 
kindly—affectionate than the men identifying 
with the judge. They were also somewhat less 
forgiving (etc.) than the subjects in the con- 
trol group. We cannot say for sure, of course, 


ew 
TABLE 4 


IDENTIFICATION CONDITION MEANS ON QUESTION- 
NAIRE ITEMS YIELDING SIGNIFICANT 
IDENTIFICATION EFFECTS 


Identificand 
Questionnaire item Main 
Dunne | Judge | Control | effect 
F ratio 
1. Who should have 
won the fight ?* 1.96, | 4.58, | 4.16, | 5.11** 
2. Forgiving-Kindly- 
Affectionate? 6.73, | 8.88a | 7.969 | 3.04** 
3. How relaxed was E? | 5.46, | 4.12» | 4.19, | 3.30** 
4. How well did £in- 
struct? 4.19, | 2.58} 2.35, | 5.64*** 
5. Experiment worth- 
while? 4.65, | 3.46, | 3.46, | 2.69* 


Note.—Each mean is based_on 26 cases. Cells haying a 
common subscript are not significantly different by Duncan's 
multiple-range test at the .05 level. The F ratios are based on 
df = 2/72, 

r a “he lower the number, the more strongly the subject 
believed Dunne, rather than Kelly, should have won the fight. 

b For this mood cluster, the higher the score, the more the 
items "definitely apply“ to the subject’ s mood, 

e For this and the following items, the higher the score on 
the 1 peor” the more unfavorable the evaluation. 


** p <05. 
#EKD <01. 


whether this hostile mood in the Dunne con- 
dition was created by the identification treat- 
ment, the great number of shocks the men 
had given, or a combination of these, but the 
remaining results obtained with the question- 
naire ostensibly intended for the Psychology 
Department indicate that the hostility was 
also directed toward the experimenter and 
the experiment. 

As shown on line 3, the experimenter was 
rated as being less relaxed in the Dunne con- 
dition than in the judge and control groups. 
Remember, however, that the experimenter 
did not know the identification treatment to 
which the subjects had been assigned, so that 
he was unlikely to be more anxious in one 
condition than in another. The people identi- 
fying with the film aggressor were probably 
expressing their hostility toward the experi- 

menter rather than describing his behavior. 
This ill feeling is shown more clearly in the 
next two measures. The Dunne subjects said 
that the experimenter delivered his instruc- 
tions less well, and thought the experiment 
was a less worthwhile scientific endeavor, 
than the people in the judge and control con- 


ditions. 
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DISCUSSION 


Several considerations argue against any 
simple “demand characteristics” explanation 
of the shock results. Most important of all, 
the experimental variations were established 
by means of written instructions transmitted 
to the subjects, with the experimenter being 
blind to the particular instructions. But even 
if the experimenter learned what the condition 
was after it had been created, this probably 
would not influence the outcome. We pointed 
out earlier that our recording equipment — 
clicked each time a subject in the think “hit” ~ 
condition pressed his button, clearly telling 
the experimenter that the subject was ex- 
posed to this think “hit” treatment. Yet, even 
though the experimenter was more definitely ~ 
aware of this particular variation, it led to 
much weaker effects than the less panen 
identification manipulation. 

The questionnaire eyaluations aiy 
sent to the Psychology Department also go ~ 
against the demand interpretation. How can 
we account for the negative ratings? For ex- 
ample, suppose the unfavorable ratings of the 
experiment and experimenter in the Dunne 
condition were manifestations of a general 
hostility produced by identification with the 
moyie aggressor. The unforgiving mood ex- — 
pressed by the subjects in this group could 
reflect this hostility. Assuming it existed, this — 
general hostility undoubtedly would have 
dampened the subjects’ desire to comply with ` 
the experimenter’s wishes (if they knew — 
what he wanted them to do). On the other 
hand, what if the unfavorable evaluations 
were the subjects’ reactions to the great num- 
ber of shocks they had administered? The ini- 
tial electrical aggression probably would have 
led to this later hostile condemnation of the 
investigation if the subjects felt guilty about 
their earlier behavior or thought their aggres- 
sion was unjustified (Brock & Pallak, 1969). 
But surely this guilt or dissonance would not 
have arisen if the subjects were merely com- 
plying with the experimenter’s “demands.” 

Then, too, there is no reason to think that 
the subjects in the Dunne condition had 
caught on to the experimenter’s deceptions or 
knew what he expected them to do. Only two 
subjects were discarded in the Dunne group 
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ecause of their strong suspicions, as opposed 
to two in the judge condition and three in the 
control group. Moreover, when the experi- 
menter rated the strength of the subjects’ 
suspicions on the basis of his postexperiment 
interviews, he found that the mean ratings 
were virtually the same in all three condi- 
tions: the means were .75, .82, and .84, on 
the 0-2 scale for the Dunne, judge, and con- 
trol groups, respectively. Even if the Dunne 
subjects had caught on, however, would they 
necessarily have been more aggressive? Earlier 
research in the Wisconsin program suggests 
that our college student subjects are generally, 
more interested in having the investigator be- 
lieve that they have a “good” personality than 
in confirming his hypotheses. Consequently, 
since we frequently think ill of people who act 
aggressively, the more subjects suspect that 
the researcher is observing their aggression, 
the less aggression they tend to exhibit.° All 
in all, the indications are that we probably 
obtained our results in spite of any suspicions 
rather than because of them. 

Accepting the findings, then, we believe 
they lend additional support to Berkowitz’s 
classical conditioning model of media vio- 
lence effects. There is no doubt that observed 
violence sometimes teaches a lesson. The wit- 


6Data from a manuscript in preparation by 
Berkowitz, Alioto, and Turner are illustrative. 
In this study angry male college students watched 
either our standard prize fight film or an equally 
long football movie after having been told that the 
filmed contest winner(s) had wanted to hurt the 
opponent (film has aggressive meaning) or that the 
contestants had a neutral attitude toward each other 
(nonaggressive meaning). Each subject was judged 
to be either nonsuspicious or aware of the experi- 
mental deceptions on the basis of his answers to a 
postexperimental interview. We found that the sub- 
jects’ awareness of the deceptions and purpose of 
the study was associated with a decreased number 
of attacks on the available target person: 65% of 
the men in the prize-fight-aggressive-meaning condi- 
tion who had any awareness of the deceptions and 
the purpose of the study gave fewer shocks than the 
mean given by the nonsuspicious subjects in this 
condition, while fully 867% of the similarly aware 
men in the football-aggressive-meaning group gave 
fewer shocks than their nonaware counterparts. In 
general, the more definitely aware the subjects were 
of the deceptions, the less aggressive they tended 
to be. The significant results obtained in this study 
seemed to. emerge in spite of the subjects’ suspicions, 
not because of them, 
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nessed event may say that aggression can be 
rewarding and can persuade the viewers, at 
least momentarily, that aggression pays. Some 
might interpret the present findings in just 
this way; the subjects’ identification with the 
film aggressor had helped to remind them how 
gratifying it can be to attack one’s tormentor. 
The questionnaire results suggest, however, 
that the consequence was not this specific; the 
men imagining themselves to be the aggres- 
sive model also felt resentful generally and 
were critical of the experiment and experi- 
menter, as if a fairly diffuse aggressive ten- 
dency had been evoked in them. We believe 
that the identification with the observed ag- 
gressor had caused the viewers to react more 
impulsively to the depicted violence; the ag- 
gressive scene was a stimulus evoking the 
aggressive responses that the observers were 
disposed to make (Berkowitz, 1970). Factors 
independent of any lesson taught by the mod- 
els can influence the likelihood that aggressive 
responses will appear in open behavior, such 
as the label-mediated association between the 
available target and the observed aggression 
victim (cf. Berwowitz & Geen, 1967), the 
viewers’ anger toward the possible target 
(Geen & Berkowitz, 1967), and even the gen- 
eral arousal level within the observers (Geen 
& O’Neal, 1969). The present study adds to 
this list: The screen stimulus is more apt to 
evoke overt aggression if the viewers imagine 
themselves to be the aggressive models as they 
watch their violent actions on the screen. 
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RELIGION AND HUMANITARIANISM IN EYSENCK’S 
T DIMENSION AND LEFT-RIGHT 
POLITICAL ORIENTATION 


JAMES DreFRONZO 1 


Indiana University 


Religious and humanitarian values were measured in 118 students by means 
of fragmentation of Eysenck’s T dimension. The students’ left-right political 
orientations were also ascertained. It was hypothesized that high traditional 
religious orientation would be associated with political conservatism and that 
high humanitarian orientation would be associated with political radicalism. 
This hypothesis was supported, It was also found that the relative strength 
of humanitarian and religious orientations within individuals was more closely 
associated with political orientation than either religion or humanitarianism 
alone. A slight negative relation was observed between religious and humani- 
tarian orientations. Theoretical background for the findings is discussed. 


In attempting to identify basic personality 
similarities between fascists and communists, 
Eysenck (1954 used factor analysis to dis- 
cover two orthogonal dimensions, radicalism- 
conservativism (R factor) and tender-minded- 
ness—tough-mindedness (T factor). Eysenck 
hypothesized that fascists and communists 
would score at opposite ends of the R dimen- 
sion (the scores being derived from 26 of the 
items in Eysenck’s 40-item questionnaire), 
but would produce similar scores along the 
T dimension (the T score being derived from 
the remaining 14 items). Christie (1956) re- 
ported a study performed by Coulter which 
yielded higher tough-mindedness for both com- 
munists and fascists when compared to the re- 
sults for political moderates, Eysenck (1954) 
reported a study which indicated the com- 
munists were more tough-minded than con- 
servative, liberal, or socialist party members. 
Unfortunately, the “raw” data for Coulter’s 
study were not reported; however, item-by- 
item results were reported by Eysenck. In re- 
analyzing Eysenck’s data, Rokeach and Hanley 
(1956) were able to conceptually separate the 
14 items of Eysenck’s T dimension into 6 
items that seemed to focus on a “traditional 
religious” orientation and 8 items that seemed 
to center on “humanitarianism.” They then 
made the discovery that the communists had 
scored the most tough-minded of all four par- 


1 Requests for reprints should be sent to James 
DeFronzo, Box 109, Indiana University, 2101 Coli- 
seum Blvd., Fort Wayne, Indiana 46805. 


ties on the 6 religious items (most anti- 
traditional religious orientation), but the most 
tender-minded (most humanitarian) on the 8 
humanitarian items. When a total T score 
had been summed by Eysenck for the com- 
munists, their extremely and consistently anti- 
traditional responses to the religiously ori- 
ented questions outweighed their tender- 
minded responses to humanitarian questions 
and left them with a final summed score indi- 
cating that they had the highest level of 
tough-mindednes among the four sets of 
subjects. 

In light of the results of the Rokeach and 
Hanley reanalysis of Eysenck’s data, Brown 
(1965) suggested that a similarity in tough- 
mindedness between fascists and communists 
(in terms of total 7 score) might be super- 
ficial because a total T score cannot indicate 
that the communists might be scoring “tough” 
on religion and “tender” on humanitarianism, 
while fascists might be scoring “tender” on 
religion and “tough” on humanitarianism, 
Brown backed up his suggestion by remind- 
ing us of a serious methodological flaw in 
Eysenck’s research procedure: Eysenck per- 
formed the factor analysis which yielded the 
R and T dimensions on 750 conservative, 
liberal, and socialist party members and then 
later assumed unity of factors for communists 
and fascists. A factor analysis of Eysenck’s 
40-item-schedule results after administration 
to a population including equal proportions 
of fascists, conservatives, liberals, socialists, 
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and communists might yield three factors 
(radicalism, xeligion, and humanitarianism), 
inste: i 


f two. 

Using Brown’s suggestion as a starting 
point, I reviewed Eysenck’s (1954) data and 
noticed 4 rather obvious indication that in- 
creasing levels of tough-mindedness on reli- 
gious items tended to be associated with a 
right-to-left progression along the political 
spectrum, while the reverse relation was evi- 
denced between humanitarian items and the 
political dimension. This observation led me to 
the following questions: Is there a general 
positive relationship between humanitarian 
orientation and political radicalism, but a 
general negative relationship between a tradi- 
tional religious orientation and political radi- 
calism? What is the relationship between 
humanitarianism and traditional religiosity 
(as measured by Eysenck’s 14 T-dimension 
items? Is there a relationship between po- 
litical radicalism and the way individuals 
are oriented to both humanitarianism and 
religion? 


HYPOTHESES 


My first hypothesis derives from the more 
recent reexamination of Eysenck’s data and 
proposes the following answer to the initial 
question asked above: 

1. There is a positive relationship between 

' humanitarianism and the tendency for politi- 
cal radicalism, and there is a negative rela- 
tionship between traditional religiosity and 
political radicalism. 

My second hypothesis concerns the rela- 
tionship between humanitarianism and tradi- 
tional religiosity. The reanalysis of Eysenck’s 
data as presented by Roger Brown (1965) 
might lead one to expect a simple negative 
relationship. However, Allport (1954), in his 
investigation of racial prejudice in the United 
States, found a curvilinear relationship be- 
tween frequency of attendance at religious 
services and the level of racial prejudice in 
the population in that the highest level of 
racial prejudice was associated with irregular 
church attendance, while relatively low levels 
of racial prejudice were associated with no 
attendance and regular attendance. Allport 
concluded that many people simply use reli- 
gious services “instrumentally” to maintain 
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a respectable image in the community (ir- 
regular attendance) and consequently had no 
use for the humanitarian elements of religious 
ideology; hence, they had higher levels of 
racial prejudice than both nonattenders (who 
may be alienated from institutional religion 
because of its lack of “serious” concern for 
humanitarianism) and the regular attenders 
(the religiously devout who accept and believe 
in the total ideology of their religion because 
of an “intrinsic satisfaction” with their reli- 
gious experience). Since the measure of hu- 
manitarianism includes two items concerning 
racial prejudice, I suggest a curvilinear rela- 
tionship between humanitarianism and reli- 
giosity in the following hypothesis: 

2. Both low and high levels of religi- 
osity will be associated with high levels of 
humanitarianism, but intermediate religious 
orientation will be associated with 
humanitarianism, i 

Finally, since I am assuming a positive 
relationship between political radicalism and 7 
humanitarianism and a negative relationship | 
between traditional religiosity and political 
radicalism, I have also assumed that humani- 
tarian and traditional religious orientations 
must be, in some way, reflective of opposing 
social forces. Thus, I suggest that an indi- 
vidual’s political orientation is a function of 
the relative strengths of his humanitarian and 
traditional religious orientations. Or, in other 
words: 

3. Political rightist tendencies will be asso- 
ciated with a greater traditional religious 
orientation than humanitarian orientation, 
and political leftist tendencies will be asso- 
ciated with a greater humanitarian than 
religious orientation. 


METHOD 
Subjects 


The subjects were 118 introductory sociology stu- 
dents at a midwestern university who were admin- 
istered a brief questionnaire as part of a prepara- 
tion for the study of the sociology of religion and 
political sociology. Approximately 95% of the stu- 
dents were freshmen, and the rest were sophomores, 
juniors, and seniors. Approximately. three-fifths of 
the students were female, and one-fourth were 
Catholic, while the remaining three-fourths were 
Lutherans, Baptists, Methodists, and members of 
several other Protestant denominations. 


\ 
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Measurement 


In order to measure the subjects with respect to 
their humanitarian and religious orientations, each 
was asked to answer a questionnaire containing the 
14 items originally identified by Eysenck as the 
tender-mindedness-tough-mindedness factor. Each of 
the questions was followed by a five-category re- 
sponse sequence: strongly agree, agree, undecided, 
disagree, strongly disagree. Depending on which re- 
sponse he checked, a subject received a score from 
0 to 4. The scores from the 6 religious orientation 
items were combined and averaged to give each 
individual a traditional religious orientation score. 
The same procedure was followed for the 8 humani- 
tarian items to obtain a humanitarian orientation 
score for each person. The 14 items? taken from 
Eysenck were: 


— H-1 Colored people are innately inferior to white 
people. 

— H-2 War is inherent in human nature. 

— H-3 Persons with serious hereditary defects and 
diseases should be compulsorily sterilized. 

+ H-4 In the interests of peace, we should give up 
part of our national sovereignty. 

— R-1 Sunday observance is old-fashioned and 
should cease to govern our behavior. 

+ H-5 It is wrong that men should be permitted 
greater sexual freedom than women by 
society. 

— H-6 Conscientious objectors are traitors to their 
country and should be treated accordingly. 

— R-2 The laws against abortion should be abol- 
ished. 

+ R-3 Only by going back to religion can civiliza- 
tion hope to survive. 

— R-4 Divorce laws should be altered to make 
divorce easier. 

+ R-5 It is right and proper that religious education 
should be compulsory. 

— R-6 Mean and women have the right to find out 
whether they are sexually suited before mar- 
riage (i.e., by living together for a while be- 
fore marriage). 

+ H-7 The death penalty is barbaric and should 
be abolished. 

— H-8 The Japanese are by nature a cruel people. 


A simple measure of political orientation was con- 
structed in the following manner. Before responding 
to the 14 items listed above, the students were asked 


2A negative sign appearing before a statement 
indicates the following response-scoring sequence: 
strongly agree (0), agree (1), undecided (2), dis- 
agree (3), strongly disagree (4). A positive sign pre- 
ceding a question indicates the reverse scoring se- 
quence; strongly agree (4), agree (3), undecided (2), 
disagree (1), strongly disagree (0). The 14 items 
are listed in the same order as they appeared on the 
questionnaire, and each humanitarian question is pre- 
ceded by an H, while each religious item is preceded 
by an R, 
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to circle one of five possible president-vice-president 
slates as if they were voting in a national election. 
The slates contained candidates associated with dif- 
ferent positions along a right-left political continuum. 
Immediately following the 14 religious and humani- 
tarian orientation questions was a question requiring 
the respondent to circle his preference from among 
seven political parties. After this question, the sub- 
jects were again asked to select one of several slates 
of political candidates (slightly different from the 
first set of slates) as if they were voting in a na- 
tional election. The three political orientation ques- 
tions were: 


1. Please circle the ticket of your choice as if you 
were voting in a national election, Please be sure to 
make one, and only one, choice and do not write in 
other candidates’ names or cross tickets by selecting 
a presidential candidate from one slate and a vice- 
presidential candidate from another. 


President 
Vice-president 


(a) George Wallace (b) Richard Nixon 


Curtis Lemay Henry Cabot Lodge 
(c) Edmund Muskie (d). Eugene McCarthy 
Abraham Ribicoff John Lindsay 


(e) Julian Bond 
David Dellinger 


2. Please circle the political party of your choice 
as if you were voting to give it the power to imple- 
ment its particular political philosophy. 


American Nazi Party American Independent Party — 
Republican Party Democratic Party Social Dem- 
ocratic Party Socialist Labor Party American 
Communist Party 


3. Please circle the ticket of your choice as if you 
were voting in a national election. 


President 
Vice-president 


(a) George Wallace 
J. Edgar Hoover 

(c) Edward Kennedy 
Birch Bayh 


(6) Richard Nixon 
Mark Hatfield 

(d) Julian Bond 
David Dellinger 


The most right-wing selection for each question 
yields a score of zero, while the most left-wing choice 
in each of the three questions yields scores of 4, 6, 
and 3, respectively. If an individual selected the two 
Wallace tickets and the American Nazi Party, he 
received the extreme conservative score, zero. If an 
individual circled the Bond ticket twice and the 
American Communist Party, he received the extreme 
radical score of 13, (An individual voting for the 
Nixon tickets and the Republican Party received a 
score of 4; an individual voting for the Muskie 
tacket in the first question and the Kennedy ticket 
in the last question and selecting the Democratic 
Party received a score of 7.) 
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RESULTS 
Hypothesis 1 


By sorting all the subjects into four politi- 
cal categories, right (scores of 0-3), right of 
center (4-6), left of center (7-8), and left 
(9-13), the mean religious and humanitarian 
orientation scores were obtained and are pre- 
sented in Table 1. 

In general, the mean scores progress in the 
predicted direction. The correlation coefficient 
for religious orientation and political radical- 
ism is —.25, which is significant at the Ol 
level (F = 7.73, df = 1/116, though only 
6.25% of the political variation is explained). 
The correlation coefficient between humani- 
tarian orientation and political radicalism is 

AO, which is significant at the .001 level 
(F = 21.13, df = 1/116, and 16% of the po- 
litical variation is explained) .’ Hypothesis 1, 
then, is supported. 


Hypothesis 2 


It was hypothesized that a curvilinear rela- 
tionship existed between religious orientation 
and humanitarian orientation. In order to test 
this hypothesis, both a correlation coefficient 
which tests for a linear relationship) and a 
correlation ratio (which tests for curvilinear- 
ity) were computed.* The correlation coeffi- 
cient between religious orientation and hu- 
manitarian orientation is —.20, which is 
significant at the .05 level (F = 4.62, df= 
1/116, and 4% of the variation in humani- 
tarian orientation is explained). The correla- 
tion ratio 7’ur is 5.2% of the humanitarian 
orientation explained by religious orientation. 
Since the additional amount of variation ex- 
plained by the correlation ratio is only 1.2% 


8 The multiple correlation R*).ar (the amount of 
variation in political orientation explained in both 
humanitarian and religious orientations) is 19.1%. 

4 Before testing Hypothesis 2, the religious and 
humanitarian orientation ‘scores for each individual 
were averaged by dividing the total religion score for 
each person by 6—the number of religious orienta- 
tion questions—and by dividing the total humani- 
tarian score for each person by 8—the number of 
humanitarian orientation question, Thus, the aver- 
aged religious and humanitarian scores for each 
individual both ranged along the same interval (0-4). 

5 The correlation ratio n2ur between humanitarian 
orientation and religious orientation was obtained by 
dividing the subjects into eight sets on the basis of 
their religious orientation scores (Set 1 included 
all subjects with religious orientation scores in 
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TABLE 1 


Mean RELIGIOUS AND HUMANITARIAN ORIENTATION 
Scores FoR Supjects IN Eacu or Four 
CATEGORIES OF POLITICAL ORIENTATION 


Right of | Left of 


Value Rigbt center center Left 
Religion 2,119 2.345 1.874 | 1.606 
Humanitarianism | 1.833 | 2.274 | 2.323 | 2.874 

n 10 57 41 10 


of the total humanitarian variation, Hypothe- 
sis 3 is not supported. The relationship be- 
tween religious orientation and humanitarian 
orientation, as indicated by the data, is best 
described as linear, negative, and slight. 


Hypothesis 3 


In testing Hypothesis 3, two statistical 
tests were employed. First, the subjects were j 
divided into those who had a higher religiqus 
orientation than humanitarian scores (this set» 
of subjects were designated R subjects) and 
those who had higher humanitarian orienta- 
tion than religious orientation scores (this set 
of subjects were designated H subjects). The 
subjects were then dichotomized (on a con- 
ceptual basis consistent with the categories 
used in Table 1) into rightists (political ori- 
entation scores in the interval 0-6) and left- 
ists (political orientation scores in the interval 
7-13). The results are presented in Table 2. 

This analysis lends support to Hypothesis 3. 
A second analysis was performed in the fol- 
lowing manner. Each subject’s averaged reli- 
gious orientation score was subtracted from 
his average humanitarian orientation score, 


the interval 0-499; Set 2 = .500-1.000,..., Set 8 
= 3.500-4.000). The mean humanitarian orientation 
score was computed for each set and used to con- 
struct the correlation ratio. 


TABLE 2 


POLITICAL ORIENTATION AS A FUNCTION OF THE 
RELATIVE STRENGTHS OF RELIGIOUS (R) 
AND HUMANITARIAN (H) ORIENTATIONS 
IN SUBJECTS 


Political orientation 


Subjects —] No. cases 
Right Left 
R ` 82% 18% 54 
H 41% 59%, 64 
No. cases 69 49 


Note.—x? = 18.8, df = 1, p <.001. Total N = 118. 
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and the resulting difference score was run 
against political orientation. Through this 
procedure a correlation coefficient of .46 was 
obtained between the difference scores and 
political radicalism, which is significant at the 
.001 level (F = 31.00, df = 1/116, and 21% 
of the political variation is explained, more 
than either religious orientation or humani- 
tarian orientation alone), This analysis also 
supports Hypothesis 3. 


INTERPRETATION 


Durkheim (1965) observed that religion 
performs a key role in society in maintaining 
social solidarity by ‘“‘sacredizing” the social 
structure. Marx (Marx & Engels, 1964) em- 
phasized the essentially conservative nature 
of the function of religion in his characteriza- 
tion of religion as the “opiate of the people.” 
My findings with regard to Hypothesis 1 indi- 
cate that a traditional religious orientation 
toward matters such as Sunday observance, 
divorce, religious education, sex behavior, 
abortion, and the dependence of the welfare 
of civilization on religion is significantly 
associated with political conservativism. 

On the other hand, one should expect to 
find that people who have a highly favorable 
orientation to mankind in general (including 
low racial prejudice, a relatively high evalua- 
tion of “human nature” and the importance 
of human life, and a positive orientation 
toward equality of treatment) will react 
against any “injustices” of the status quo by 
opting for social change. My results indeed 
indicate that high humanitarian orientation is 
a good predictor of political leftist orienta- 
tion. Further, it should be noted that though 
political conservativism ténds to be positively 
associated with a traditional religious orienta- 
tion, and political radicalism tends to be 
positively associated with a humanitarian ori- 
entation, humanitarian orientation explains 
about two and one-half times as much political 
variation as religious orientation. 

Analysis of the data yielded a slight but 
statistically significant negative relationship 
between religious and humanitarian orienta- 
tions. Allport (1967) had found a curvilinear 
relationship between attendance at religious 
services and racial prejudice (relatively high 
racial prejudice for irregular attendance; 
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relatively low prejudice for no attendance and 
regular attendance), which led me to expect 
that I would find a similar relationship be- 
tween religious and humanitarian orientations. 
It should be noted, though, that racial preju- 
dice was only one of several components of my 
measure of humanitarian orientation and, 
more importantly, that Allport used as his 
independent variable religious behavior (at- 
tendance) while I used religious attitudes. 

In testing Hypothesis 3, results were ob- 
tained which indicated a significant relation- 
ship between the relative strengths of the 
humanitarian and religious orientations in the 
individual subjects and their political tend- 
encies. That is to say, I found evidence that 
it is not the absolute strength of either the 
religious or humanitarian orientation that af- 
fects political orientation, but rather the 
strength of humanitarian orientation relative 
to the strength of raditional religious oriena- 
tion. Hence, just as institutional religious 
values and humanitarian values seem to op- 
pose each other in the cultural system, so too 
they seem to constitute conflicting elements 
socialized into the subjects. The results lead 
me to conclude that when humanitarian orien- 
tation is stronger than traditional religious 
orientation, one should expect political leftist 
tendencies, when humanitarian and religious 
orientations are about equal in strength, po- 
litical moderation, and when traditional reli- 
gious orientation is the stronger, political 
conservativism. 
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A series of four experiments was carried out in an attempt to understand a 
curvilinear relationship found between the frequency of stimulus exposure and 
affective ratings. The inverted-U function was suggestive of satiation effects. 
Preliminary experiments indicated that stimulus discriminability may work to 
attenuate the relationship between exposure and affect, but further experiments 
using a variety of stimuli failed to confirm the initial findings. The possibility 
was examined that for some stimuli, satiation effects occur early in the exposure 
series, and if they occur at all for other stimuli, they do so only after a very 
large number of exposures. It was no 
properties of stimuli make them vulnerable to satiation effects. 


Recent studies have indicated that the re- 
peated exposure of a stimulus object to an 
individual seems to be a sufficient condition 
for the enhancement of his attraction to the 
object (Harrison, 1968; Johnson, Thomson, 
& Frincke, 1960; Matlin, 1970; Zajonc, 
1968). These studies have been limited to 


semantic or attitudinal ratings made in re- 


sponse to repeatedly presented nonsense 


words, Chinese ideographs, or faces of men. 


But the older literature on the effects of re- 
peated exposure has been primarily concerned 
with aesthetic judgments. The results of these 
older studies show a pattern that is strikingly 
consistent with the data collected more re- 
cently on neutral stimuli, especially in one 
important respect: the most pronounced atti- 


tudinal effects are obtained when exposure is 


manipulated for novel or unfamiliar objects. 
Thus, for instance, American students’ liking 
for Oriental music was increased more by 
virtue of repeated exposure than was their 
liking for jazz (Meyer, 1903; Moore & Gilli- 
land, 1924). Mull (1957) was successful in 
making the music of Schoenberg and Hinde- 
mith more enjoyable after several exposures 
to subjects not familiar with that form of 
music. Similar effects have been obtained in 
the area of art appreciation (Krugman & 
Hartley, 1960; Maslow, 1937). 


1 This research was supported by National Science 
Foundation Grant GS-3119. 

2Requests for reprints should be sent to R. B. 
Zajonc, Research Center for Group Dynamics, Insti- 
tute for Social Research, University of Michigan, 
Ann Arbor, Michigan 48106. 
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The finding that the exposure effect is 
readily obtained for a novel stimulus, but not 
for a familiar one, is consistent with the fact 
that the relationship between exposure and 
frequency seems to be logarithmic. The most 
pronounced effects are obtained early in the 
exposure series. After the individual has had 
considerable experience with the i 
lus, each successive increment ifr 
quires an increasingly greater number 
posures. The possibility also exists tha 
exposure effect is curvilinear and that fol- 
lowing an “excessive” number of exposures 
affect may begin to decline. In fact, Maddi 
(1961) in studying the effects of environmen- 
tal homogeneity in humans, found such a 
curvilinear relationship between affect and 
familiarity, with the highest affective reaction 
obtaining for moderately familiar objects. 
Maddi (1968) argued that the linear effects 
of exposure which have recently been found 
are probably limited to meaningless objects. 
This assertion is contradicted to some extent 
by a clearly monotone relationship between 
log frequency and liking found in experiments 
that used photographs of men’s faces—stimuli 
that can hardly be considered meaningless, 
even though the photographs were taken from 
a school yearbook (Zajonc, 1968). Neverthe- 
less, the exposure effect which obtained for 
the photographs of graduating seniors was 
weaker than one found for nonsense words or 
Chinese ideographs. The view that exposure 
effects are curvilinear is also consistent with 
the optimum discrepancy hypothesis (McClel- 
land, Atkinson, Clark, & Lowell, 1953) and 
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Berlyne’s (e.g., 1967) theory of responses to 
novel and complex stimulation. 


The present paper reports a curvilinear re- 
lationship obtained as a function of exposure 
and a series of experiments utilizing a variety 
of stimuli designed to analyze its antecedants. 


EXPERIMENT I 


Studies on exposure effects in aesthetic 
judgments carried out during the first part 
of the century did not always utilize proper 
experimental controls. The present experiment 
which had as its purpose exploring the effect 
of repetition on aesthetic judgments, utilized 
a design in which frequency of presentation 
was manipulated without confounding it with 
stimuli or order effects. 


Method 


Subjects. Seventy-two subjects from the University 
of Michigan paid subject pool were employed in the 
experiment, They were all males and for the most 
part undergraduates. All subjects who were familiar 
with any of the stimuli were excluded from analysis, 
and they were replaced by naive subjects. The sub- 
jects were paid $1,25 per hour for taking part in 
the study. 

Stimuli. Color reproductions of 12 contemporary 
nonrepresentational paintings from the late 1950s 
were exposed. The reproductions were the size of a 
postcard and were obtained from an art supply 
dealer. Reproductions of the following paintings 
were used: Boille, Peinture; Baumeister, Diver and 
reflection; Cassinari, Composition; Hofman, Sorcier; 
Le Moal, Racines; Marca-Relli, Midi d'été; Marini, 
Le cheval; Masson, Esprits animaux; Schneider, 
Peinture 95 B; Soulages, Composition; Sugai, Ono; 
and Zack, Oil. 

Procedure. Each subject was run individually. 
Upon arriving at the laboratory, the subject was 
told that the experiment consisted of two parts and 
that he would learn about each part separately. He 
was first told that he would be shown some repro- 
ductions of paintings and that his only task was to 
pay close attention. Following the exposure series, 
the subject was asked to rate each painting on a 
7-point scale from “I like it” to “I dislike it.” The 
purpose of the experiment was then explained, and 
the subject was asked not to discuss it with his 
friends and acquaintances for a few weeks. 

The reproductions were shown for a period of 
about 2 seconds each. There was a 4-second interval 
between each presentation. The 12 reproductions 
were counterbalanced against six frequencies (0, 1, 
2, 5, 10, and 25), such that each painting occurred 
equally often in each of the six frequencies, but for 
different subjects. There were thus two stimuli shown 
in each frequency constituting a form of internal 
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replication. The position of each stimulus in the 
series of 86 presentations (i.e, two stimuli in each 
of the six frequencies) was randomly determined, 


Results 


Figure 1 shows the results of the experi- 
ment which were analyzed by analysis of vati- 
ance with frequency and replications as the 
main factors. It is clear from the figure that 
the effect of frequency is curvilinear, with the 
extreme frequencies showing least positive 
ratings. The effects of frequency were signifi- 
cant at the .001 level (F = 3.76, df = 3/355). 
Trend analysis has shown the quadratic com- 
ponent to be significant at the .001 level 
(F = 11.93, df = 1/355), while the linear 
component was also statistically reliable 
(F = 6.08, df = 1/355, p < .05). These two 
components account for 95% of the variance 
associated with the frequency manipulation, 
with the quadratic component accounting 
for 63%. 

The effects of replication were also reliable 
(F = 14.91, df = 1/71, p < .001). The means 
associated with this effect are as follows. In 
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Fic. 1. Favorability ratings of paintings as a 
function of frequency of exposure. 
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each frequency category, one painting was 
shown first and one second. The first item 
was rated, on the average, at 2.89, while the 
second item received a higher average rating 
(3.45). Thus, stimuli appearing later in the 
series were, on the whole, liked better, indi- 
cating perhaps a generalized exposure effect. 

There were strong differences in the ratings 
of the various paintings (F = 13.01, df= 
11/852, p < .001), ranging from 1.58 to 4.15 
Because of these differences, the stimuli were 
divided into two sets of six on the basis of 
the ratings they received by those subjects 
who had not seen them previously. The curves 
of the stimuli initially liked and of stimuli 
initially disliked are also shown in Figure 1. 
It appears that both are of the inverted-U 
type, with the disliked paintings showing a 

~ maximum at lower frequency than the liked 
` paintings, and having a more pronounced 
quadratic component. Hence, the initial reac- 
tion to the stimulus does not fully explain the 
“shape of the frequency curve. 


EXPERIMENT IT 


The stimuli used in the above experiment 
were readily discriminable from one another, 
‘and the subjects had no difficulty in distin- 
" guishing them eyen when they had seen them 
very infrequently. The sharp differences 
among aesthetic judgments of the paintings, 
in themselves, suggest a great deal of discrimi- 
" nability between the stimuli. The question 
must be raised whether discriminability of 
~ these stimuli was responsible for the down- 
‘turn of the exposure curve at high frequencies, 
= and whether, in general, stimulus discrimina- 
"bility sets limits on the exposure effect, such 
_ that enhancement of attraction as a function 
__ of exposure can be obtained only or primarily 
for stimuli that are difficult to discriminate. 
It is obvious that the very same manipula- 
_ tions which enhance affective ratings also en- 
hance recognition. The experiment reported in 
_ the monograph on exposure effects (Zajonc, 
_ 1968) which employed Turkish-like words, ex- 
_ cept for its dependent measures, was identi- 
cal in procedure to one carried out by Solomon 
_ and Postman (1952) and another by Zajonc 
and Nieuwenhuyse (1964), in which the main 
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concern was the effect of stimulus repetition 
on recognition threshold. 

An analysis of the psychological process 
implicated in the exposure effect suggests the 
following features (Harrison, 1968; Zajonc, 
1968). Upon the first presentation of a novel 
stimulus, the individual, by definition, has 
no fixed (conditioned or unconditioned) re- 
sponses that the stimulus is able to elicit. 
Aside from the rather general orienting re- 
action that takes place, the stimulus has no 
ability to elicit a particular response because 
there has been no previous experience with 
the stimulus. However, the stimulus is not 
totally devoid of similarity to stimuli with 
which the individual has had commerce in 
the past. By virtue of this similarity, gen- 
eralized response tendencies are elicited, and 
the greater the number of known stimuli to 
which the newly exposed stimulus is similar, 
the greater is the number of generalized re- 
sponse tendencies that are elicited. Some of 
these tendencies may be incompatibl 
one another, and all compete for emi 
mild state of response competition or res 
conflict takes place, and the individual a 
tributes his feelings to the stimulus viewed. 
When asked, he says “I don’t like it,” “It does 
not please me,” or “It isn’t pretty.” Subse- 
quent exposures succeed in eliminating some 
of the generalized response tendencies that 
have originally been elicited, recurrent re- 
sponses that are elicited are strengthened, and 
eventually a fairly steep response hierarchy is 
built up, such that now one response domi- 
nates all others and has the highest likelihood 
of being elicited. 

The two assumptions involved in this 
account of the exposure effect were supported 
by experimental evidence from a series of 
studies by Harrison (1968), Matlin (1970), 
and Zajonc (1968). The first assumption is 
that exposure reduces response competition, 
and it was substantiated by a drop in galvanic 
skin response associated with repeated expo- 
sures (Zajonc, 1968), by a reduction of 
latencies of association responses made to 
previously frequently exposed stimuli (Har- 
rison, 1968; Matlin, 1970) and by a reduc- 
tion of errors made on a repeated association 
test (Matlin, 1970). The second assumption 


——< 


Í 


Exposure, SATIATION, AND STIMULUS DISCRIMINABILITY 


holds that response competition is negatively 
related to liking. Stimuli generating different 
degrees of response competition were rated 
in experiments by Harrison and by Matlin, 
and in both series of experiments there was 
a clear indication that liking for the stimuli 
increases as response competition is reduced. 

The responses that take part in this com- 
petition could very well be categorizing or 
encoding responses. Increased recognition is, 
in fact, one form of reduction in such com- 
petition of categorizing or encoding responses. 
Tt is possible, therefore, that in comparison 
with previously used materials (nonsense syl- 
lables, Chinese ideographs, etc.), the paintings 
were extremely discriminable, and hence there 
was very little response competition to reduce. 

The question remains why was there a 
downturn in the affect as the paintings were 
presented more than twice. This drop in affect 
is consistent with the optimal discrepancy 
hypothesis (McClelland et al., 1953) and 
other theorizing which extols the virtues of 
moderation. These theories, however, are for 
the most part various restatements of the 
empirical phenomenon, observed more inform- 
ally and anecdotally than systematically. The 
most convincing results in this respect are 
those of Haber (1958), who had subjects rate 
the pleasantness of water temperature. 

In the context of the experimental para- 
digms used in the study of the exposure ef- 
fect, one could suggest the following possibil- 
ity. To the extent that repeated exposure fa- 
cilitates encoding and discrimination, there 
exists some number of exposures which is suf- 
ficient for the establishment of perfect identi- 
fication, categorization, and discrimination of 
the stimulus. Once this has been accomplished, 
successive exposures contribute nothing to the 
recognition process. But the experimental pro- 
cedure requires the subject to view other stim- 
uli as well, some of which are seen extremely 
infrequently. If the subject is imposing a 
recognition task on himself and if he is 
struggling to establish proper encoding of 
all the stimuli in the series, then reexposure 
of well-categorized stimuli can only be frus- 
trating because it blocks the subject from 
achieving encoding of infrequent stimuli. 

The present experiment is therefore con- 
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cerned with the effect of stimulus discrimina- 
bility on the relationship between repeated 
exposure and liking. Experiment I was re- 
peated with a new set of stimuli which were 
minimally discriminable from each other with 
the expectation that a considerably more 
pronounced exposure effect would be obtained. 


Method 

Subjects. The subjects were 36 male undergraduates 
enrolled at the University of Michigan. They were 
paid $1.25 per hour for their participation. 

Stimuli. For the purpose of the present experiment, 
a postcard reproduction of a nonrepresentational 
painting (Tauromachie) by Singier (1948) was cut 
into 12 equal segments, each measuring 48 X 20 milli- 
meters, These segments were mounted on 3 X 5 inch 
index cards and shown to the subjects in their 
horizontal orientation. 

Procedure. The procedure was otherwise identical 
to that used in Experiment I, 


Results 


The results of the experiment are shown in 
Figure 2, where, for the purpose of compari- 
son, data from Experiment I are also included. 
While the general trend is also of the type 
found in Experiment I, we note that the drop 
in effective ratings begins at much higher 
frequencies. Moreover, the drop is rather 
small. In Experiment I the affective ratings 
of stimuli seen 25 times were actually lower 
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Fic. 2. Favorability ratings of similar and distinct 
paintings as a function of frequency of exposure. 
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than those never seen. This is no longer the 
case. While the quadratic component ac- 
counted for over 60% of variance in Experi- 
ment I, only 26% of variance is accounted 
for by it in the present experiment. 

The lack of clear discriminability among 
the stimuli was also associated with a lack 
of differentiation in the ratings, which were 
now found to be quite homogeneous across 
all the stimuli. The best liked piece showed 
a mean of 3.50, while the least liked piece 
had an average of 2.97, which is to be com- 
pared with 4.15 and 1.58, respectively, in the 
previous experiment. The expectation that the 
satiation effect would be attenuated by de- 
creasing stimulus discriminability has thus 
been substantiated. 


EXPERIMENT III 


These results suggest that the relationship 
between exposure and affect may indeed be 
curvilinear. It is possible that the previous 
studies which found a monotonically increas- 
$ ing trend as a function of frequency dealt 
with only part of the entire function. Had a 
frequency manipulation been carried out far 
enough, a drop in affect would eventually have 
been obtained. 

The limiting case for the linear relation- 
ship between frequency and affèct can be 
taken, for the moment, to be the one involving 
neutral and novel stimuli which are fairly 
similar to one another. Increasing discrimina- 
bility among the stimuli must reduce the 
effects of exposure, even to the point of 
producing what looks like a satiation phe- 
nomenon. To explore this possibility further, 
two experiments were conducted, one utilizing 
a comparison with stimuli previously used 
(Experiment III) and one using an entirely 
new set of stimuli (Experiment IV). In 
Experiment III faces of men were used to 
compare the results thus obtained with those 
from a previous experiment (Zajonc, 1969), 
in which faces of men that were not well 
discriminable were used. 

In the original monograph on the exposure 
effect (Zajonc, 1968), an experiment was te- 
ported which showed a monotone positive 
relationship between logarithm of frequency 
of exposure and affect. Used in this experi- 
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ment were Xeroxed photographs of men taken 
from a college yearbook. The photographs 
were not well reproduced, and the faces, it 
appeared, were not altogether easy to disi- 


criminate from one another. The present 


experiment also utilized photographs of men, Sr 
but ones that were extremely discriminable. 


We should expect that an increase in dis- 
criminability will reduce the exposure effect. 


Method 


Subjects. Again, subjects were 36 paid volunteers 
from the University of Michigan paid subject pool, 
and they were paid $1.25 per hour for taking part 
in the experiment, 

Stimuli. The stimuli were 12 photographs of men 
which differed in several respects. Some of the faces 
were of young men, others were of middle-aged 
men, and some were of old men. Some photographs 
were taken with hats or berets, others without hats, 
Some men wore glasses, some did not, The apparent 
racial background of the faces also differed. The 
attire varied from photograph to photograph, some 
were taken in shirtsleeves, others in vests, still 
others in sweaters, jackets, or suitcoats. Some photo- 
graphs were posed en face, others were profiles, or 
partial profiles, Some men were smiling when photo- 
graphed, others looked morose. To facilitate dis- 
criminability further, 4 of the photographs were 
mounted at the top of a 3X5 inch index card, 
4 were mounted in the middle, and 4 on the bottom. 
Preliminary tests showed that the faces were easily 
discriminable. 

Procedure. The same procedure as in the previous 
experiments was followed. 


Results 


The results of the present experiment are 
shown in Figure 3, together with data reported 
in the 1968 monograph (Zajonc, 1968). The 
earlier experiment utilized Xeroxed photo- 
graphs of graduating seniors, and they were 
reproduced in that monograph as they were 
shown to the subjects, although somewhat 
reduced in size. 

It is evident from Figure 3 that, contrary 
to expectations, a frequency effect was again 
obtained, without any suggestion of a satia- 
tion effect. The effect is significant at the .05 
level (F = 2.88, df= 5/175). The curves 
from the two different studies are nearly 
identical. 

While pretest data showed differences in 


®This data was collected by Christine Linder 
whose help in this respect we appreciate. 
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Fic. 3. Favorability ratings of similar and dis- 
tinct photographs as a function of frequency of 
exposure. 


discriminability between the stimuli from 
Experiment III and the one reported in the 
monograph, there were unfortunately equal 
ranges of difference in liking. In the present 
experiment the best liked face had a mean 
of 4.19, and the least liked face had a mean 
of 2.50. The comparable figures from the 
previous experiment with photographs are 
4.01 and 2.01. As a result of this ambiguity, 
an experiment was carried out in which the 
discriminability of stimuli was previously 
well and independently established. 


EXPERIMENT IV 


In 1962, Goss, Nodine, Gregory, Taub, and 
Kennedy carried out a study of paired- 
associate learning in which two lists of non- 
sense syllables were used. The nonsense syl- 
lables were matched for a number of attributes 
including meaningfulness across the lists, but 
the lists differed in internal similarity. Thus, 
for instance, the list characterized by high 
internal similarity contained nonsense sylla- 
bles that were each other’s mirror images, 
such as HUW and wuH, for example. There 
were such instances both among the stimulus 
items and among the responses paired with 
the stimuli. The items in the lists character- 
ized by low internal similarity were distinct 


in every respect. The lists were carefully 
chosen, and they were used in a series of 
experiments with consistent results. Of inter- 
est, for present purposes, is the difficulty 
encountered in learning these lists. Thus, for 
example, in one experiment, pairs constructed 
of similar stimulus sets and similar response 
sets required 44.7 trials to a criterion of four 
perfect repetitions, while lists constructed of 
distinct stimulus sets and distinct response 
sets were learned in 16.5 trials on the average. 
Because the discriminability of these stimulus 
lists was clearly documented, they seemed 
ideally suited for the study of the limiting 
effects of stimulus discriminability on the 
exposure effect. 

It was previously shown by Harrison 
(1968) and by Matlin (1970) that a reduc- 
tion of response competition which occurs as 
a result of repeated exposures is reflected in 
the reduced latencies of association responses 
made to the exposed stimuli. It was therefore 
expected in the present experiment that (a) 
since stimuli which are initially difficult to 
discriminate from one another have more to 
gain (in the reduction of response competi- 
tion) by repeated exposure than stimuli which 
are initially distinct, a stronger exposure 
effect should obtain with stimuli that are 
initially hard to distinguish from one another; 
(b) these effects should be manifested not 
only when they are measured by affective 
ratings but also when they are measured by 
the latency of association responses. Since in 
most of the previous experiments the perma- 
nence of exposure effects was never examined, 
measures of stability of exposure effects were 
collected over a 1-week period; such a retest 
would also show whether a satiation effect if 
obtained would last over a period of time. 


Method 


Overview of the design. During the first session, 
16 subjects were run in each of two conditions, 
labeled “similar” and “distinct.” Liking measures and 
word-association response latencies were obtained for 
all subjects. During the second session, 1 week later, 
each subject was retested in exactly the same manner. 
This design allowed for between-group comparisons 
at Session 1, to discover the immediate effect of 
the similarity manipulation, and within-group com- 
parisons, to check for systematic changes in the 
relationship between frequency and liking (or 
latency) between Sessions 1 and 2. 


"sure levels were used: 2, 
frequency and stimulus sets were counterbalanced 
according to a Latin square design, such that each 
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The initial instructions consisted simply of the 
following statement: “In a moment you are going to 
see a series of slides projected on the wall in front 
of you. Please watch them carefully, When all the 
slides have been shown I'll tell you what to do next.” 

Subjects. Subjects were 32 female undergraduates 
at the University of Michigan, who fulfilled a re- 
quirement in an introductory psychology course by 
taking part in the experiment. 

When subjects were first contacted, they were told 
that to meet the requirement, they would be par- 
ticipating in two different experiments since each 
one would require less than $ hour of their time. 
Appointments were made at that time for them to 
appear at the lab for the initial experiment, and 
1 week later for (presumably) the second one. 

Similar treatment at Session 1. Sixteen subjects 
were presented with eight stimuli from the following 
two lists of CVC trigrams: JEX, XEJ, ZEX, XEz, and 
HUW, WUH, QUW, WUQ. Following Goss et al. (1962), 
“similarity” was defined as having two letters in 
common. Pairs of corresponding trigrams from the 
two lists (e.g. Jex and HUW) were permanently 
treated as a set, such that both members of a set 
teceived the same number of exposures for a given 
Subject (note that this does not mean that they 
necessarily appeared one after the other). Four expo- 
5, 10, and 25. Exposure 


set of two trigrams appeared in each frequency 
equally often but for difference subjects. 5 
Each subject viewed each stimulus, projected from 


~a 35-millimeter Nikkormat slide projector, for 3 


seconds; 2 seconds elapsed between exposures. This 


procedure continued, according to a predetermined 
random order, until all stimuli had been presented. 
At this point all subjects were tested for liking and 
response latencies. 

Testing. Eight subjects were asked to rate each 
trigram in the Jex series on a 102-millimeter line 
from “most pleasing” to “least pleasing.” They were 
then shown each trigram from the HUW list and 
asked to respond to it with “the first thing that 
came into their minds.” (Each stimulus was shown 
for 10 seconds, while the experimenter recorded the 
latency of response by using a Hunter timer.) Half 
of this group did the ratings first, then the associa- 
tions; half were tested in reverse order. 

The other eight subjects rated the HUW list and 
gave associations to the Jex series. The order in 
which they were tested was likewise alternated. 

The liking scales were printed in a standard order 
for all subjects in a questionnaire format. 

Distinct treatment at Session 1, This treatment 
Was exactly the same as the first, except that the 
16 subjects were exposed to “distinct” stimuli: 
WUH, GEC, YOV, QAS, and KEZ, BOF, JID, NAX. Tri- 
gtams were arranged into sets of two, corresponding 
to pairings of the similar series, but the pairing was 
done at random. The sets were constant over the 
various exposure levels. 

Session 2 procedure, both conditions. When the 
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subject returned for the subsequent session, she was 
given the same tests (and in the same order) for 
liking and latency that she had taken the week 
before. Unlike Session 1, this test, of course, was 
not preceded by the repeated exposure of the stimuli. 
The purpose of the experiment was then explained. 


Results 


Liking. Results for the affective ratings of 
the trigrams are shown in Figure 4. It is clear, 
first of all, that neither the group exposed to 
similar trigrams nor the group exposed to the 
distinct trigrams manifest, a decrement in 
affect at the high frequencies. The quadratic 
trend expected to characterize the data on 
distinct trigrams was not obtained. There 
seems to be a drop at 10 exposures in the 
distinct treatment, but the ratings rise again 
at 25 exposures. There is, however, a slight 
difference in the slopes of the two curves be- 
tween the two groups. Subjects viewing simi- 


lar stimuli show a somewhat steeper frequency * 


function than those given distinct stimuli. 
And this is so at Sessions 1 and 2. The results 
of the experiment were analyzed by analysis 
of variance in which the treatments were a 
between-subject factor and frequency and the 
time of testing were within-subject factors, 
Only frequency was shown to have a signifi- 
cant effect (F = 2.66, df= 3/90, p < .05). 

Since the graphed results offered some sup- 
port for the hypothesized difference between 
the treatments (i.e., a perfect frequency effect 
for the similar group but not for the distinct 
group), separate one-way analyses of vari- 
ance were performed on the data from each 
treatment at Session 1 separately, even though 
there was no significant interaction between 
frequency and treatment, The frequency effect 
was found to be statistically significant 
only for the group viewing similar trigrams 
(F = 3.53, df = 3/45, p < 05). 

Figure 4 also suggests that there was a great 
deal of stability over the 1-week test-retest 
period. The average test-retest correlation in 
the similar group was equal to .64, while 
that in the distinct group was equal to .76. 

Latencies. The results for word association 
latencies are presented in Figure 5. The graph 
offers moderate support for the prediction 
that latencies at Session 1 would decrease as 
a function of frequency of exposure. How- 
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Fic. 4. Favorability ratings of similar and distinct nonsense syllables as a 
function of frequency of exposure. 


ever, it is obvious that this pattern changes (F = 4.41, df= 1/30, p < .05); for the in- 
considerably over the 1-wéek test-retest teraction between test-retest and frequency 
period, and that the nature of the change (F = 2.93, df = 3/90, p< 05; and for the 
is different for the two treatments. The analy- interaction between test-retest and treatments 
sis of variance bears this out statistically: (F = 4.18, df = 1/30, p< 05). 

there is a significant effect for test-retest Average test-retest correlations were com- 


Similar Trigrams Distinct Trigrams 


5.0 


IN SECONDS 
a 
© 


LATENCY 


20 3 10 25 


ASTOR 
FREQUENCY 


Fic. 5. Latency of word associations to similar and distinct nonsense syllables 
as a function of frequency of their exposure. 
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puted separately for the two treatments, as 

was done for the liking ratings. These were 

AO for the similar group and .46 for the 

distinct group—substantially lower than the 

correlations for the liking ratings. 

Figure 5 shows that, on the average, word 
association latencies decreased from Session 1 
to Session 2 at each frequency level in the 
distinct treatment, while they increased on 
the average for two of the four frequency 
levels in the similar treatment. This difference 
is partly explained by the following process. 
Although subjects were not instructed at 
Session 2 to recall their associations from 
Session 1, it was reasonable to expect that 
many would attempt to do so. This was a 
more difficult task for the similar stimuli 

_ than for the distinct trigrams. In fact 31 of 

64 associations remained the same over 1 

week in the distinct group, but only 19 of 

64 remained the same in the similar group 

(x2 = 3.97, p < .05). Moreover, there was a 

slight indication that intralist confusion errors 

were more frequent in the similar condition. 
This suggests that associations for stimuli 

in the distinct group were more clearly or 

firmly established than associations in the 

similar group, and thus were more easily 

elicited at Session 2. It does not explain, 

however, the Session 1-Session 2 change in 

shape of the relationship between frequency 
~ and latency. 

Tt cannot be asserted on the basis of the 
above results that evidence has been obtained 
for a curvilinear relation between exposure 
and liking. Although there was some difference 
in the slopes of the curves between the similar 
and the distinct conditions (shown only by 
means of liberal statistical procedures), the 
predicted downturn at high frequencies did 
not materialize. In fact, the highest affective 
rating obtains for the highest frequency in 
both treatments. The frequency effect for the 
' similar treatment is strong and linear. But 

the curve for the distinct stimuli rises quickly 
from Frequency Level 2 to 5, and in line with 
_ the expectations falls from 5 to 10. But, 
- contrary to the hypothesized inverted-U rela- 
tionship, the curve veers upward at 25 expo- 
sures. There is no ready explanation for these 

results. Had the shape of the curve for the 
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distinct stimuli begun with a rise at the low 
frequencies and showed a downturn at 25 
exposures, one could regard the discrepant 
point at 2 exposures as perhaps experimental 
error, and due to unreliability associated with 
the small number of exposures. With a small 
number of exposures, there is a possibility 
of confusing one stimulus with another be- 
cause the subject had forgotten it. But when 
the stimulus is rated as most attractive when 
it is presented 25 times, confusion and for- 
getting are out of the question. The stability 
of the affective ratings over the 1-week period 
further attests to the fact that the data are 
not accidental, and that the exposure effect is 
not trivial. ; 

The results on latencies are perp! 
These results are consistent with the res 
competition theory when data at Session 1 
are considered. There is a general drop in 
latencies as a function of repeated exposure, 
and although there are some irregularities in 
the curves, the overall trend seems to be the 
same in the two treatment groups: a decline 
in latencies with frequency in exposure. What 
we fail to observe, however, is a difference in 
latencies between the distinct and similar 
stimuli with frequency held constant. It would 
clearly be expected that because the distinct 
stimuli produce less response competition than 
the similar stimuli, they should produce a 
lower latency of association responses than 
similar stimuli. This is not the case at Ses- 
sion 1. This difference does appear, however, 
at Session 2. One way of explaining these 
results is to suggest that the latency measure, 
while generally reflecting response competi- 
tion, fails to do so unequivocally in the 
present experiment. For, while there was per- 
haps some confusion between the stimuli in 
the similar list, the subject was able to come 
up with an association just on the basis of the 
first two letters of the trigram. Moreover, she 
felt no compulsion to produce unique associa- 
tions, Thus, one subject in the similar treat- 
ment gave the following four associations: 
JEX-JUNE; XEZ-JEW; ZEX-SEX; and XEZ-SEX- 
Another subject in this treatment responded 
as follows: JEX-JET; XEJ-JET. ZEX-SEX; and 
xXEZ-ZEX. But when asked to give word associ- 
ations a week later, the subject did feel that. 
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they should produce the same association 
that they offered a week ago, and hence the 
distinct list gained an advantage at Session 
2. This tendency to reproduce the earlier 
association response may also be the factor 
which obliterated the effects of frequency 
during the retest. 


Discussion 


An effort was made in the Present series 
of experiments to understand a “satiation” 
effect found with one type of stimulus—ab- 
stract paintings, The account assumed stimu- 
lus discriminability to play a major role in 
setting limits on the exposure effect. Indeed, 
when discriminability of paintings was ma- 
nipulated, the data showed a displacement of 
the frequency curve consistent with the hy- 
pothesis that the exposure effect holds only 
for stimuli that are hard to discriminate from 
one another. But the manipulation of stimu- 
lus discriminability with two other types of 
stimuli failed to reveal satiation effects. Why? 

First, it is possible that the procedure used 
with faces and nonsense syllables did not 
utilize a sufficient number of exposures. Thus, 
it is possible that satiation eventually obtains 
with any stimulus, provided that exposures 
are extended to a sufficiently large number. 
But we have evidence from other work (see 
Zajonc, Swap, Harrison, & Roberts, 1970) 
that there is no satiation even with as many as 
81 exposures of Chinese ideographs. More- 
over, it is not immediately apparent why 
satiation should occur at 2 exposures with 
rather handsome paintings but fail to appear 
even at 25 exposures when photographs of 
men’s faces, or worse, nonsense syllables, are 
used. Needless to say, it is still more surpris- 
ing that 81 exposures of a Chinese ideograph 
also do not produce a downturn in the ex- 
posure curve. 

The second possibility is that the curvi- 
linear exposure effect found with abstract 
paintings is an exception to the general case, 
which is the positive log function previously 
found on many occasions, and that it is due 
to some as yet unknown causes. Perhaps the 
curve obtained with abstract paintings repre- 
sents a composite of two types of individuals, 
those whose liking grows with exposures but 
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reaches an asymptote rather quickly, and 
those whose liking is initially unaffected by 
tepeated exposures but declines rapidly as 
exposures reach a certain level. In fact, all 
exposure curves reported in the literature are 
averages of numerous observations, and the 
average Curve is not always fully representa- 
tive of individual data. The F ratios reported 
for the frequency effects are not overwhelm. 
ing. But on the other hand, it is not exactly 
surprising that there is variation among the 
individual exposure curves because the indi- 
vidual data are subject not only to the fre- 
quency effect, but they are also influenced 
by the particular stimuli, the sequence of 
their presentation, anchor and contrast effects, 
and other momentary as well as more chronic 
effects dependent on the individual’s state. 
Considering that these effects must be over- 
come, and considering that the manipula- 
tion producing the enhancement of affective 
ratings consists of mere exposure, having no 
external reinforcement of incentive to rely 
on, the psychological process underlying the 
exposure phenomenon must be fairly robust. 

Further analysis of Experiment I reveals 
that in this case at least—which is the 
strongest evidence for an inverted-U effect— 
the average curve is a fair representation of 
the individual reactions. To examine these 
individual reactions, adjacent frequency levels 
were collapsed thus forming a three-point 
curve for each subject with each point based 
on four observations. The largest number of 
subjects (36%) exhibited an inverted-U curve — 
as a function of repeated exposures. Of the 
remaining subjects, 23 (32%) produced 
curves with a monotone negative slope; 14 
(19%), with a monotone positive slope; and 
9 (13%), with a U shape. An explanation 
based on a composite effect of averaged 
positive and negative functions could only 
apply to at most 50% of the subjects in 
Experiment I, 

While discriminability of stimuli had a 
significant effect on the shape of the exposure 
curve when abstract paintings were used, it 
did not seem to affect reactions to faces of 
men or to nonsense syllables in this way. 
Perhaps material which is of greater interest 
can lead to the sort of effects about which 
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speculations were made above. That is, a 
form of reactance (Brehm, 1966) or a form 
of frustration arises when a well-known stimu- 
lus is presented. repeatedly while the indi- 
vidual is eager to become acquainted with 
all the stimuli in the series, especially with 
those which he has seen only once or twice, 
and this reactance or frustration affects the 
judgments of frequently exposed stimuli. 
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Experiments I and II were designed to differentiate between alternative inter- 
pretations of the contrast effect, Experiment I examines and rejects the hy- 
pothesis that the effect is specific to rating scales, since the effect also occurs 
when subjects are asked to make comparative judgments, Experiment II leads 
to the rejection of the hypothesis that the effect is purely semantic (i.e. 
specific to situations in which verbal labels are applied to attitudes); when 
subjects are asked to write items that would be endorsed by people with whom 
they agree or disagree, they write much more extreme items for those with 
whom they disagree than for those with whom they agree. In fact, the items 
subjects write for people with whom they disagree are not endorsed by those 
people, the reason most often given being that the items are too extreme. Ex- 
periments III, IV, and V demonstrate that the differences are so clear-cut that 
such attitude statement writing could be used as an indirect measure of atti- 
tude. Judges were able to categorize writers’ attitudes rather accurately, but 
accuracy was no greater when the contrast effect was explained to them than 
when it was not. The implications of these findings for intergroup conflict are 
discussed, particularly the implication of a positive feedback effect that will 


lead to exacerbation of group differences. 


The contrast effect in social judgment 
(Hovland, Harvey, & Sherif, 1957) refers to 
an individual’s tendency to exaggerate the 
discrepancy between his own attitudes and the 
attitudes represented by opinion statements 
endorsed by people with opposing views, at 
least when these people are held in low esteem 
(see Manis, 1966, pp. 71-78, for a review of 
the literature documenting this effect). To 
the degree to which this tendency is strong 
or prevalent, it will exacerbate the conflict 
between opposing groups; for if members of 
each group believe that statements of mem- 
bers of the other group represent more ex- 
treme attitudes than they in fact do, each 
group will believe it to be more difficult to 


1 This research was supported in part by National 
Institute of Mental Health Grant MH-12972 and in 
part by National Science Foundation Grant GS-1116. 
The authors are especially indebted to Judy Boylan 
for her assistance in data collection and analysis. 

2 Requests for reprints should be sent to Robyn M. 
Dawes, Oregon Research Institute, P. O. Box 3196, 
Eugene, Oregon 97403. 


compromise with the other than it in fact 
would be; further, the belief that members 
of the other group hold extreme attitudes 
will lead to a lowering of esteem for that 
group, which in turn will strengthen the 
contrast effect. Thus, this effect is both 
destructive and self-perpetuating. 

While the above statements seem intuitively 
reasonable, it is important to ask exactly 
what is meant by the phrase “tendency to 
exaggerate,” or, equivalently, to ask how the 
authors who have investigated this tendency 
have established its existence. The purpose 
of the present series of experiments is to 
examine alternative interpretations that may 
be placed on previous findings, to collect data 
meant to discriminate between these interpre- 
tations, and finally to develop and demon- 
strate the validity of an indirect measure of 
attitude resulting from the earlier studies. 
Then, the implications of the contrast ef- 
fect for intergroup communications will be 
reassessed. 
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In the past, the method used for evaluating 
the contrast been that of asking 
subjects to ratë attitude expressed by a 
particular opinion ‘statement. These ratings 
have been made with respect to a dimension 
predetermined by the experimenter (e.g, 
pro-anti), and they have all involved cate- 
gorical judgment; that is, the subject is to 
place the attitude expressed by a particular 
statement in one of several categories on the 
specified dimension. This procedure involves 
two serious problems. 

1. The procedure assumes interpersonal 
comparability of categorical judgment. For 
example, when a pro individual places a pro 
statement in a +2 category, whereas an anti 
individual places it in a +3 category, it is 
assumed that the categories have the same 
meaning for the two individuals but that the 
attitude expressed appears different to them. 
It could, however, just as easily be assumed 
that the pro individual and the anti individual 

assess the attitude equivalently but use the 
pro and anti categories of rating scales in a 
different manner. This duality between as- 
suming differential judgment of stimuli and 
assuming differential use of category bounda- 
ries is illustrated in Figure 1. Part A of 
Figure 1 illustrates the data interpreted ac- 
cording to the assumption that the categories 
have the same meaning to both pro and anti 
judges, whereas these judges believe that a 
statement expressing an opposing attitude 
- represents a more extreme position than does 
a statement with which they agree. Part B of 
Figure 1 illustrates the data interpreted ac- 
cording to the assumption that the judges do 
not believe that the statements represent dif- 
ferent extremities of attitude, but each judge 
allows the extreme categories to cover a 
broader range of the attitudes with which he 
disagrees than of those with which he agrees. 
This latter interpretation is compatible with 
the current vogue of labeling one’s political 
opposition “extremist.” (For a fuller discus- 
sion of the duality between interpreting cate- 
gorical judgments in terms of perception of 
stimuli and interpreting them in terms of 
differential use of category boundaries, see 
Coombs, 1963, or Dawes, 1963.) 
2. The procedure requires judgment with 
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respect to an attribute specified by the experi- 
menter, rather than one chosen by the subject. 
It is perfectly possible that if the subject 
were asked to judge which attitude statements 
are similar, or which he responds to in the 
same manner (e.g., with approval or anger), 
the position of the statements on the specified 
attribute would not be related to such judg- 
ments. In other words, the procedure involves 
assuming, rather than demonstrating, that the 
specified attribute is important in the judge’s 
perception of attitude, 

In the context of psychophysics, the phi- 
losopher Nelson Goodman (1951) pointed out 
20 years ago that procedures which involve 
judgment with respect to an attribute speci- 
fied by an experimenter beg the most impor- 
tant perceptual question, which is: What are 
the relevant perceptual dimensions? Hastdorf, 
Richardson, and Dornbusch (1958) make 
much the same point in the domain of inter- 
personal perception when they argue that this 
phenomenon should be studied „m 
asking the perceiver to choose his ow 
butes and categories in describing ` a 


ing to traits selected by the experimenter. 
The fact that the experimenter himself is 
a perceiver and a social judge as well as an 
investigator argues that he will choose i impor- 
tant attributes, but even then it is not clear 


that a judgment concerning the position of — 


stimuli on these attributes will predict other 
behavior of the judge toward these stimuli, 
(Of course, the judgment will nevertheless be 
made—if only because the investigator has 
asked for such a judgment, and most subjects 
are polite enough to acquiesce even in tasks 
that they consider meaningless. ) 

Furthermore, the procedure of specifying 
an attribute evaluates the contrast effect only 
within a semantic framework; that is, it is 
concerned only with how judges label state- 
ments with respect to certain semantic con- 
cepts such as pro—anti. It does not follow that 
judges who label statements differently—even 
judges using categories in the same way— 
perceive reality differently, or that one or the 
other has distorted it. 

Experiment I examines the contrast effect 
by using a comparative, rather than a cate- 
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a) Statement Interpretation 


Anti Statement 


3 -2 gu 


Pro Statement 


+1 +2 +3 


Anti Pro 
Pro's Judgment 
Anti Statement Pro Statement 
f 
| | 
Anti 3 oe ai +1 +2 +3 ane 
Anti's Judgment 
b) Category Interpretation 
Anti Statement Pro Statement 
i 
| à i i 1 | 
Anti 3 : a ie +3 Pro 
Pro's Judgment 
Anti Statement Pro Statement 
a e Tere Tae 
Anti af af a! z +3 Pro 


Anti's Judgment 


Fic. 1. Alternative interpretations of the contrast effect. 


gorical, evaluation technique. Comparative 
judgments have previously been used to eval- 
uate attitude (see Ager & Dawes, 1965; Kelly, 
Hovland, Schwartz, & Abelson, 1955), but 
not in the manner used in this experiment. 
Experiment II examines the effect in a situa- 
tion in which the attribute of judgment is 
not specified by the experimenter. Experi- 
ments III, IV, and V develop the indirect 
assesment measure derived from the previous 


experiments. 


EXPERIMENT I 


The technique used in this experiment is 
one adapted from a perceptual judgment ex- 
periment of Coombs (1958). In Coombs’ 
experiment, subjects were asked to make 
judgments about which achromatic color 
patches were best representative of what they 
meant by the label “gray.” Sometimes the 
patches were both quite dark and sometimes 
both quite light, but the crucial judgments 
in the experiment were of pairs consisting of 


24 


one light and one dark patch; in such pairs, 
one of the members lay on one side of what 
the subject considered an ideal gray, whereas 
the other member lay on the other side. 
(Coombs termed such pairs bilateral pairs.) 

A judgment of which member of a bilateral 
pair is more representative of gray can be 
conceptualized as an extremity judgment. If 
the dark member appears more extremely 
dark than the light member appears extremely 
light, then the subject will say the light 
member is more representative; if the light 
member is perceived as more extreme, then 
the dark one will be judged as more repre- 
sentative; “extremity” is, of course, defined 
with respect to the subject’s conception of 
ideal grayness. 

The present study makes use of such ex- 
tremity judgments to study the contrast ef- 
fect.* Consider two statements, one pro and 
one anti, that are judged equal in extremity 
by some hypothetical (and probably non- 
existent) neutral judge. If an individual who 
is pro tends to exaggerate the discrepancy 
between his position and that expressed by 
the anti statement, he will judge the anti 
statement to be more extreme; conversely, if 
an individual is anti, he should judge the pro 
statement to be more extreme, However, in 
addition to differences in categorical judg- 
ments of statements, pros and antis should 
also comparatively rate the statements dif- 
ferently, irrespective of categorization. If the 
contrast effect were due solely to differences 
in categorical judgments, then no such differ- 
ences of extremity judgments should be found 
when the comparative method is used (see 
Figure 1b). 

It is not possible to obtain statements that 
are precisely equal in extremity as judged by 
a neutral individual; further, it was hypothe- 
sized that there will be differences in the 
perception of opinion statements—even by 
neutral individuals—that exist over and above 
the hypothesized contrast effect. However, it 
is possible to test the hypothesized effect on 
a statistical level. That is, there should be 


3Jt should be noted that only the technique has 
been borrowed from Coomb’s (1958) experiment— 
that the hypothesis tested in this experiment is 
unrelated to that tested by Coombs, 
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a tendency for pro subjects to judge anti 
statements as more extreme and anti subjects 
to judge pro statements as more extreme when 
making extremity judgments of bilateral 
opinion statements. 


Method 


The opinion statements used in this experiment 
all expressed attitudes toward the war in Vietnam. 
There were nine pro statements (henceforth referred 
to as “hawk” statements) and nine anti statements 
(“dove” statements). These statements, displayed 
below, can be classified according to three levels of 
extremity—here labeled “extreme,” “moderate,” and 
“mild.” The basis for this classification was a series 
of judgmental studies involving both categorical 
judgments and unilateral extremity judgments, ie 
final statements, chosen from an initial pool of 48, 
have the property that virtually no subject when 
asked to judge hawk statements alone or dove state- 
ments alone judged a statement in the moderate 
category to be more extreme than one in the extreme 
category, or one in the mild category to be more 
extreme than one in either the moderate or the 
extreme category.t No bilateral extremity judgments 
were collected during the item selection procedure, 

The subjects used in this experiment were 100 paid 
University of Oregon students, who were solicited 
through the campus newspaper during the fall of 
1967 and the winter of 1968, The newspaper adver- 
tisement specifically asked for hawks and doves on 
the Vietnam issue, and subjects were accepted in the 
experiment until there were 50 of each type; the 
terms hawk and dove were not further defined since 
their use around campus was quite common, but the 
newspaper advertisement did specify that some feel- 
ing about the Vietnam war was necessary for par- 
ticipation in the experiment. (If any completely rt 
neutral subjects lied in order to participate in the 
experiment, their responses should simply obscure the 
hypothesized effect.) 

The subjects were presented with a questionnaire 
consisting of 135 pairs of the items displayed above. 
There are 153 possible pairs, but for reasons that 
were clearer at the time than they are now, 18 pairs 
consisting of two hawk statements or two dove 
statements were omitted. There were two forms of 
the questionnaire; these forms had the same random 
order of the 135 pairs, but the order of ‘the two 
Statements within each pair was different on the ` 
two forms. The subjects were given the following 
instructions: 


This questionnaire is designed. to measure how 
extreme various statements about the war in 
Vietnam appear to be. There are 135 parts to this 
questionnaire, and in each part you will be pre- 


4A more detailed account of the item selection 
procedure is available on request from the authors. 
The procedure is omitted here because of the high 
face validity of the result, 
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HAWK STATEMENTS 
Extreme 
E1. Only fools and cowards can fail to see that we must defeat the Communists in Vietnam. 
E2, Any and all means are justified in blocking the expansion of Communism in Vietnam. 
E3. No American can in good conscience oppose our efforts in Vietnam; those who do so are, in a word, cowards. 


Moderate 


M1. Successful fulfillment of our commitment to Vietnam is necessary if other nations are to respect our national 


honor. 
M2. As long as the Communists persist in their aggression against South Vietnam, America must resist that 


aggression. 
M3. Peace can come to Vietnam only if United States military intervention is maintained. 


Mild 
ml, Although mistakes are being made by the United States in Vietnam, we seem to be following a reasonable 


course there, 
m2. The advantages of our Vietnam policy somewhat outweigh its disadvantages. 


m3. Our presence in Vietnam is a necessary evil. 


Dove STATEMENTS 
Extreme 
El. United States intervention in Vietnam is ethically immoral, internationally illegal, and in violation of our 


own constitution. We should get out now. 
E2. We should get out of Vietnam now, without concern for “face” or prestige, or any other of those nonsensical 


abstractions with which we justify our barbarity. 
E3. Our involvement in Vietnam is the reflection of a power-mad obsession on the part of a few individuals in our 


government and is totally without justification. 


Moderate 
Mi. The United States interference in Vietnam has done little good and much harm. 


M2. There is very little justification for the lives lost in Vietnam. 
M3. Our Vietnam policy is not well thought out, nor isit motivated by any real interest in the people of Vietnam. 


Mild 

mi. ‘The American people should become more aware of the cost of our actions in Vietnam. 

m2. The disadvantages of our Vietnam policy somewhat outweight its advantages. 

m3. The Vietnamese problem should be handled by some international body suchas the U. N., not by the United 


States alone. 


sented with two statements. Your job is to tell us more difficult, since what must be judged is 


which of the statements you judge to be more 
extreme, 

For example, listed below are two statements 
about the war in Vietnam. Place a check in front 
of the more extreme statement. 


Example 


A. Any and all military means, including 
nuclear weapons, should be used to stop the 
threat of Communism in Vietnam. 

___ 8B. The war in Vietnam is a costly but necessary 
expense for the United States. 


Given the above pair, most people would judge 
Statement A to be more extreme. 

In the above example, both statements favored 
fighting in Vietnam, and hence the choice of which 
statement was more extreme was reduced to a 
choice of which statement was more prowar. In 
the pairs that follow, however, sometimes one 
statement will be prowar and one statement will 
be antiwar. Judgments about these pairs may be 


whether the pro statement is more pro than the 
anti statement is anti, or vice versa; that is, extrem- 
ity per se must be judged. Please place a check 
in front of the more extreme statement in every 
pair. Even if at first glance the two statements 
seem to be about equally extreme; place a check 
in front of the one that appears to be the more 
extreme to you. 


Results 


The results are presented in Tables 1 and 2; 
the labels of the rows and columns of these 
tables refer to the statements displayed above, 
while the cell entries in the tables indicate 
the proportion of subjects who judged the 
column statement to be more extreme than 
the row statement. Table 1 presents the re- 
sults for the dove subjects, while Table 2 
presents the results for the hawk subjects. 
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TABLE 1 


EXTREMITY JUDGMENTS: PAIRED COMPARISONS—DOVES 


T 


| | 
Statement | 1 | 2 | 3 12 | EEA fans T A og G fe y | 18 
1. Dmi 71 65.78. 842 96: == 586 IA 100: S O 0E 100 100 100 
2. Dm3 51. 82 69 — 76 71 188 94 86 96 94 100 — 100 100 100 
3. Dm2 16: 18 8S BON a NOG Oe SON 0S te ISIB 100 98 100 
4. Hm1 BR 60.7 58605, 1S) be 8 04 O 94) 1).8B 91198, ES 
5. Hm2 ot FSB SS A BA ESOS Fa tO E a OF wn Ol. hoo eater: 
6. DM3 58 47 «(58 70 78 84 76 78 — 96 98 100 
7. DM1 63 66 72 — 82 90 84 98 98 100 100 
8. DM2 70 76 76 78 — 82 94 96 96 100 
9. Hm3 50 62 — 62 67 78 — 990 98 
10. HM1 53. OU Ge SO S TA 19K se OO 
11. DE1 53 SO S288) OSETE: Od 
12. HM3 ST 100 2 A = =) Fe BB EES 
13. DE2 ME ated Wes Sse 
14, HM2 60 718 sda 
15. DES 62 60 80 
16. HES 65 80 
17. HE1 63 
18, HE2 
i 
t Note.—Entries indicate the proportion of people who chose the column statements as more extreme than the row statements. 
First letter refers to whether statement is hawk or dove, second letter refers to the range (extreme, moderate, etc.), number refers 
k order of statement (see display of statements in text). For description of boldface entry, see text, 
3 
3 The statements are ranked in these tables due to one subject. Thus, the ranking of the 


according to the principle of stochastic domi- 
nance. That is, one statement is ranked above 
the other if it is judged to be more extreme 
than the other by 50% or more of the sub- 
jects. With a single exception in each table 
~ (the boldface entries) this stochastic domi- 
nance is transitive; each of these exceptions 1s 


statements in the two tables is a good repre- 
sentation of how extreme-the statements are 
judged to be by the two groups. This ranking 
is consistent with the hypothesized contrast 
effect; the doves have a tendency to judge 
hawk statements as being more extreme, while 
the hawks have a tendency to judge dove 


TABLE 2 


EXTREMITY JUDGMENTS: PAIRED COMPARISONS: -HAWKS 


T 


pi | 
Fofule|ols fos 


Statement 1 | 2 | 
ng ga 64 90. 80 BA 92 86 8B 192-496 
2 Hind TS Oo a o 74. 90. 88 SO ROO oo e 
3. Dml Ota 66 16 T2 18. SAI Se eg ee 
4. Dm3 O 72 T2 TA 84 82 8G BA T R 8G 
5. Dm2 oo sgo oao 48 84 San E 
6. HM1 7) 58 54 76.72 720 82 — 90 86 88,86 
7. HM2 Z e2 62 6h Ou OZ rahe) =" B00 86) 86 
8. Hm3 72 72 72 = 8 78 86 BA 8B 
9, HM3 6472 66. e829 88. 184 190 
10. DM3 54 56,6664 62> 70) 82 ce 
11. DM2 58 62 66 72 — 86 86 
12. DM1 SL. 550068.) 82 = = 04 
13. HE3 56 54 64 60 74 
14. HE1 34 68 60 82 
15. HE2 32 50 55 
16. DE2 62 80 
17. DEI 
18. DES 


ou, ies indi i who chose the c 
Note.—Entries indicate the proportion o people wh second letter refers to the range (extreme, moderate, etc.), number refers 


First letter refers to whether statement is hawk or 


to order of statement (see display of statements in text). For descripti 


olumn statements as more extreme than the row statements, 
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a) Dove 


DE3 DE2 DE1 


b) Hawk 


DM3 0M2 DMI DM3 Dm2 Dmi 
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Judgments 


Hm2 Hm Hm3 HM? HM3 HM2 HE3 HE) HE2 


The dotted line indicates the 
Statements judged to be most 
neutral, 


Judgments 


DE3 oi DE2 DM3 DM2 DMI Dm2 DM3 Bt Km2 Hmi HMI HM3 HM2 ‘a3 HET E HEZ 


Fic. 2. Comparative judgment representation of dove and hawk judgments. 


statements as being more extreme; in fact, 
with the exception of Hm3 in Table 2, there 
is a perfect ordering when each category of 
statements is considered separately; that is, 
ranking by stochastic dominance, the hawks 
ranked all of the dove extreme statements 
as more extreme than any of the hawk ex- 
treme statements, while the doves ranked all 
the hawk extreme statements as more extreme 
than any of the dove extreme statements, and 
so on. (It should be noted in passing that the 
original categorization of the statements is 
replicated; that is, again with the exception 
of Hm3, the extreme hawk statements are 
ranked above the moderate hawk statements, 
which in turn are ranked above the mild hawk 
statements; a perfect ordering is found for 
the dove statements. ) 

The statements were scaled according to 
the “double law of comparative judgment” 
(Coombs, Greenberg, & Zinnes, 1961). This 
procedure converts the probability with which 
one statement is judged to be more extreme 
than another into a distance between the ex- 
tremity of the two statements, which in turn 
is converted to a position on the pro—anti 
continuum; where there were missing com- 
parisons, the incomplete data procedure out- 
lined in Torgerson (1958, pp. 173-176) was 
employed.® 

5 It was necessary to make a number of assump- 


tions in order to apply the procedure. First, it was 
assumed that all judgments of which of two dove 


The results of this scaling procedure are 
presented in Figure 2. Figure 2a illustrates 
the scale positions of the statements based 
on the doves’ judgments, while Figure 2b 
illustrates these positions based on the hawks’ 
judgments, As is compatible With the earlier 
analyses, the hawk statements are more ex- 
treme than the dove statements on the dove 
scale, and vice versa. 

A statistical test of the contrast effect was 
devised by noting the number of times hawk 
statements were judged to be more extreme 
than dove statements—irrespective of which 
statements were being considered. The con- 
trast effect predicts that this number should 
be greater for doves than for hawks. 

The mean number of hawk statements 
chosen as more extreme by the hawks was 
32.94, while the mean number of hawk state- 
ments chosen as more extreme by the doves 
was 58.64. A ¢ test evaluating the difference 
between these two means yielded a value of 
—10.01, which is significant well beyond the 
001 level (df = 98). 


statements or which of two hawk statements was 
more extreme were unilateral judgments, whereas 
the judgments comparing a hawk with a dove state- 
ment were bilateral. Second, the distribution of the 
concept “neutrality” was assumed to have a vari- 
ance equal to that of the distributions of the state- 
ments, which, in turn, were all assumed to have equal 
variance and to be uncorrelated. Finally, the absolute 
zero point (neutrality) was identified with the 
statement scaled at least extreme. 
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Discussion 


Extremity, of course, makes sense only with 
respect to what is considered neutral. One 
possible interpretation of the results is that 
hawks and doves have a different idea of what 
represents a neutral position toward the war 
in Vietnam. Upon close inspection, this inter- 
pretation is seen not to be an alternative to 
the interpretation that hawks and doves have 
different ideas about extremity. That is, to 
say that two people judge neutrality differently 
is equivalent to saying that they judge ex- 
tremity differently; extremity means, by defi- 
nition, distance from neutrality, and what the 
present data indicate is that a given state- 
ment’s distance is judged differently by doves 
and hawks, in a direction predicted by the 
contrast effect. 

The interpretation of the finding in terms 
of neutrality is compatible with adaptation- 
level theory (Helson, 1964). Granted such 
factors as selective exposure and attention, 
it is reasonable to suppose that a pro has 
more contact with pro attitudes and an anti 
with anti attitudes—which would result in the 
pro’s subjective neutral point to be more pro 
and the anti’s more anti. The results reported 
in Tables 1 and 2 and Figure 2 would thus 
follow. 

Even if the contrast effect cannot be ex- 
purely in terms of differences in cate- 
gorical judgment, however, it may be specific 
to judgments of statements’ positions on a 
pro—anti continuum. Tf it were, it would be of 
restricted interest—except to investigators 
concerned with semantic judgment and label- 
ing. If, on the other hand, it can be shown 
to represent a genuine distortion of the po- 
sition of people holding opposing attitudes, 
it is of more general social-psychological 
interest, and it has the implication for inter- 
group communication and conflict discussed 
at the beginning of this paper. 


plained 


EXPERIMENT IT 


To evaluate whether the contrast effect is 
specific to verbal labeling, the usual procedure 
for studying it was reversed. That is, rather 
than having subjects categorize or compare 
opinion statements constructed by the experi- 
menter, the subjects were asked to generate 
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the opinion statements themselves. Specifi- 
cally, subjects were asked to construct opinion 
statements that they thought would be en- 
dorsed by the “typical” hawk or dove on 
campus. If the contrast effect were purely a 
labeling phenomenon, the statements so gen- 
erated would be neither systematically more 
extreme nor less extreme as a function of the 
attitude of the writer. If, however, the con- 
trast effect is due to genuine social distortion, 
subjects writing statements to be endorsed by 
people holding opposing views should write 
more extreme statements than those they 
write when they are writing statements to be 
endorsed by people with whom they agree. 
Or, finally, there is the possibility that they 
write less extreme statements; for, if a sub- 
ject is to be consistent with his judgment 
that statements with which he disagrees ex- 


press extreme attitudes, he should write very j3 
mild statements if he wishes to write state- 


ments that are—for him—moderate. These 
three possibilities can be stated as conflicting 
hypotheses. 

1. The semantic hypothesis. The contrast 
effect is specific to categorizing or comparing 
statements on a pro-anti continuum. If this 
hypothesis is correct, statements written to be 
endorsed by the typical pro or anti individual 
should not vary systematically as a function 
of the writer’s own attitude. 


2. The person displacement hypothesis. The » 


contrast effect occurs because individuals 
exaggerate the differences between their own 
attitudes and the attitudes of those with 
whom they disagree. It follows that an indi- 
vidual attempting to write a statement to be 
endorsed by the typical person holding an 
opposing attitude will write a more extreme 
statement than he will when he attempts to 
write a statement to be endorsed by the 
typical person with whom he agrees. 

3, The statement displacement hypothesis. 
As indicated by Experiment I, people who 
disagree with a statement tend to regard it as 
more extreme than do people who agree with 
it. It follows that if people do not disagree 
about the extremity of their own and others’ 
positions, but only exaggerate the extremity 
of statements, then a person writing a state- 
ment that is meant to characterize the opinion 


A 
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of someone with whom he disagrees should 
write a statement that would sound rather 
moderate to others (e.g., since hawk state- 
ments seem more extreme to doves than to 
hawks, the dove wishing to produce what 
he regards as a moderate statement should 
generate one that is less extreme than one 
that would be generated by a hawk; if a 
person sees a statement as extreme, he is less 
likely to attribute it to another person than 
is someone who sees it as moderate). 

One way to evaluate “the extremity” of the 
statements an individual writes is to ask peo- 
ple whether they will endorse these state- 
ments. If either the person displacement 
hypothesis or the statement displacement 
hypothesis is correct, the individual will write 
statements that are not in fact endorsed by 
people holding opposing views. If the person 
displacement hypothesis is correct, the state- 
ments not endorsed will be rejected because 
they are too extreme, whereas if the statement 
displacement hypothesis is correct, they will 
be rejected because they are too mild. 


Method 


Again the issue studied was the war in Vietnam. 
Twenty hawks and 20 doves, responding to a news- 
paper advertisement similar to that described in 
Experiment I, were asked to write four statements 
that they thought would be endorsed by the typical 
hawk on campus and four statements that they 
thought would be endorsed by the typical dove on 
campus. The actual instructions read as follows: 


We are planning to engage in research concern- 
ing attitudes toward the Vietnamese war. We are 
going to use college students in this research, and 
we will be asking them to endorse or reject the 
sorts of statements that college students typically 
make about the war. We are asking your help in 
obtaining such statements, Please write four state- 
ments that you think would be endorsed by the 
typical hawk on campus. .. . Now, please write 
four statements that you think would be endorsed 
by the typical dove on campus. 


Questionnaires were constructed from the state- 
ments that the subjects wrote. Two of the statements 
that the subject thought would be endorsed by the 
typical hawk and 2 he thought would be endorsed 
by the typical dove were randomly selected from 
each subject’s statements. The resulting pool con- 
sisted of 40 statements written by doves (hence- 
forth referred to as dove-written statements) for 
endorsement by the typical dove, 40 statements 
written by hawks (hawk-written statements) | for 
endorsement by the typical dove, 40 dove-written 
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statements for endorsement by the typical hawk, and 
40 hawk-written statements for endorsement by the 
typical hawk. Two questionnaires were formed from 
these statements, the first consisting of the 80 state- 
ments written for endorsement by the typical hawk, 
and the second consisting of the 80 statements written 
for endorsement by the typical dove; these state- 
ments were ordered randomly on the questionnaires. 

Twenty-five hawks and 25 doves were recruited in 
the usual manner.6 The hawks were presented with 
the questionnaire made up of statements to be en- 
dorsed by the typical hawk, and the doves were 
presented with the questionnaire constructed of state- 
ments to be endorsed by the typical dove. The sub- 
jects were asked to endorse or reject each statement, 
and, if they rejected the statement, to state whether 
their reason for doing so was that the statement was 
too strongly worded or too mildly worded. The 
actual instructions read as follows: 


Listed below are 80 statements which various 
people have made about the Vietnam war. Your 
task is to place a check (V) in front of the state- 
ments with which you agree. If you do not agree 
with the statement because you feel it is too 
strongly worded, place an S in front of the state- 
ment. If you do not agree with a statement be- 
cause you feel that it is too mildly worded, place 
an M in front of that statement. If you do not 
agree with the statement for any other reason; 
ie., you feel that it is too ambiguous, you don’t 
like the way it is worded, etc., place an X in 
front of the statement; then please go back and 
try to determine whether the general attitude ex- 
pressed by the statement is too strong or too 
mild; if it is, place an S or an M, respec- 
tively, in parentheses after your X mark; if not, 
place a V in parentheses. 


Results 


The number of hawk-written and dove- 
written statements rejected by each subject 
and the reason for rejection were tabulated. 
The semantic hypothesis predicts that there 
would be no difference in the number of hawk- 
written statements and dove-written state- 
ment rejected by hawks and doves. Both the 
person displacement hypothesis and the state- 
ment displacement hypothesis predict that the 
hawks should reject more dove-written state- 
ments and the doves more hawk-written state- 
ments, and, equally important, they predict 
opposite reasons for rejection. According to 
the person displacement hypothesis, the pre- 
dominant reason should be that the statements 
are too extreme; according to the statement 


® Great caré was taken so that no subject partici- 
pated in more than one phase of this experiment, 
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Dove anp Hawk REJECTION OF HAWK-WRITTEN AND 
Dove-Writren STATEMENTS AND 
REASON FOR REJECTION 
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TABLE 5 


ANALYSIS OF VARIANCE TABLE 


Reason 
Statement 
Too strong Too mild 

Dove rejection 

Dove written 7.84 3.00 

Hawk written 9.32 3.36 
Hawk rejection 

Dove written 15.68 1.44 

Hawk written 11.40 2.00 


displacement hypothesis, the predominant 
reason should be that they are too mild. 

Table 3 presents the mean number of dove- 
written and hawk-written statements rejected 
by doves and hawks together with the reason 
for rejection, The pattern of the means sup- 
ports the person displacement hypothesis. 
That is, doves reject more hawk-written state- 
ments, and the hawks reject more dove-written 
statements; further, the reason for the vast 
majority of rejections is that the statements 
are too strong. 

In order to simplify the statistical analysis, 
the raw data of this study were converted to 
difference scores. That is, a 2 X 2 analysis of 
variance was conducted based on the follow- 
ing independent variables: endorser’s atti- 
tude (hawk versus dove) and reason for 
rejection (too strong versus too mild). Within 
each of the resulting four cells (see Table 4), 
the observations for each respondent (en- 
dorser) consisted of the number of hawk- 
written statements he rejected minus the num- 
ber of dove-written statements he rejected. 
Tn this analysis, the hypothesis predicting that 
hawks should reject more dove-written state- 
ments and doves more hawk-written state- 
ments would emerge as a main effect, with the 

TABLE 4 
NUMBER OF HAWK-WRITTEN STATEMENTS REJECTED 


MINUS NUMBER OF DOVE-WRITTEN STATEMENTS 
REJECTED PATTERN OF MEANS TABLE 


Reason 
Group ; 
Too strong Too mild 
1.48 36 
Doves 
Hawks —4.28 56 


Source of variation PAn df MS F 
Between subjects | 600.41 | 49 
Groups (A) 193.21 1 | 193.21 | 22.78* 
Error 7.20 8. 
Within subjects | 708.50 | 50 
Reasons (B) 86.49 1 86.49 | 10.38* 
222.01 1 222.01 | 26.65* 
Error 400.00 | 48 8.33 
*p <.01, amet y 


prediction being that the difference scores for 

hawks would be lower than the difference \ 4 
scores for doves. The person displacement 
hypothesis predicts an interaction effec! 
well; the difference scores will be highe 
doves and lower for hawks when the re 
for rejection is that the statements are 1 
extreme and vice versa, whereas the statemer 
displacement hypothesis makes the reverse 
prediction. 

The means of the difference scores are pre- — 
sented in Table 4; as follows from Table oy 
the direction of these means supports thé 
person displacement hypothesis. First, the 
overall means are higher for doves than for SE 
hawks (i.e., relative to the number of dove- } 
written statements rejected, doves rejected  / 
more hawk-written statements than did 
hawks), and the effect is much stronger when 
the reason given for rejection was that the 
statement was too strong than when the 7 
reason given was that the statement was © ai 

bi 
> 


too mild. 
An analysis of variance is summarized in 
Table 5. The predicted difference between 
the groups and the interaction between groups 
and their reasons for rejecting statements are i 
both highly significant. SEA 
Note that the reasons factor is also signifi- Í 
cant; across both groups there was a tendency A A 
to reject dove-written statements when the A a, 
reason was that the statements were too ex- 7 
treme and hawk-written statements when the : 
reason was that the statements were too mild,” 


TA speculation: since the time of this study, the 
prevailing mood in the country seems to have 
switched from one of hawkishness to one of dovish- 
ness; perhaps this interaction reflects a greater com- 
mitment or emotional involvement on the part of 
the doves than on the part of the hawks, 


sy 
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Discussion 


The results of this experiment are incon- 
sistent with the semantic hypothesis and the 
statement displacement hypothesis; they are 
consistent with the person displacement hy- 
pothesis. This person displacement interpreta- 
tion is particularly striking when viewed in 
conjunction with the results of Experiment I, 
for not only do subjects write statements that 
are too extreme to be endorsed by people 
with opposing attitudes (Experiment II), but, 
in addition, subjects tend to judge such 
opposing statements to be extreme (Experi- 
ment I). (The hawk who refuses to endorse 
a dove-written statement because he judges it 
to be too extreme is refusing to endorse a 
statement that may be seen as even more 
extreme by the dove who wrote it than by 
himself.) 

Further checks could, of course, be run; 
for example, it would be possible to present 
judges with hawk-written statements meant 
to be endorsed by the typical hawk and dove- 
written statements meant to be endorsed by 
the typical hawk and ask them to judge which 
items are more extreme. Such a procedure was 
deemed unnecessary. The differences are so 
clear-cut that it was decided to explore the 
possibility of classifying people on the basis 
of their statement writing. 


EXPERIMENT II 


Selltiz, Edrich, and Cook (1965) have sug- 
gested that differences in categorical ratings 
can lead to an indirect measure of the rater’s 
own attitude. Since, in general, an item is 
given a more extreme rating by a person who 
disagrees with it than by a person who agrees 
with it, the more extreme the rating given to 
an item, the higher the probability that the 
rater disagrees with it. These authors collected 
evidence that supported their suggestion, but 
the effect appears to have been too slight for 
the raters in their study to be categorized 


with much accuracy-® 


8 This is our judgment derived from examining 
the data presented by the authors; they themselves 
did not treat rating as the independent variable and 
the rater’s attitude as the dependent variable, but 
adopted the more traditional approach of their 
predecessors of treating the rater’s attitude as the 


independent variable and the rating as the dependent 
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In Experiment IJ, which supported a per- 
ceptual interpretation of the contrast effect, 
subjects generated more extreme statements 
when writing for endorsement by the typical 
person with whom they disagreed than when 
writing for endorsement by the typical person 
with whom they agreed. The purpose of this 
experiment was to determine how well state- 
ment writing could be used to classify atti- 
tudes of writers, more specifically, how well 
psychologists could classify the hawks and 
doves of Experiment II on the basis of the 
attitude statements they wrote. 


Method 


Six PhD psychologists from the Oregon Research 
Institute (one of whom was a visitor from the 
Netherlands) and three graduate research assistants 
in psychology served as judges. They were pre- 
sented with the eight statements generated by each 
of the writers; the four statements written for en- 
dorsement by the typical hawk were typed on the 
left side of a large index card, and those for endorse- 
ment by the typical dove were typed on the right 
side. Each judge was asked to separate the 40 cards 
into two piles of 20 cards each—one pile for the 
statements written by the hawk subjects and one for 
the statements written by the dove subjects. The 
basic principle for sorting was explained as follows. 


If, in fact, people have a tendency to exaggerate 
the differences between themselves and those with 
whom they disagree, then when they are asked to 
write items typical of those with whom they dis- 
agree, they should write relatively extreme items; 
when they are asked to write items typical of 
those with whom they agree, they should write 
less extreme items. Hence, these items could be 
used to assess their attitudes. 


Results 


All nine judges were able to classify the 
statement writers as hawks or doves with 
over 50% accuracy (p < 01, by sign test). 
The average percentage of correct classifica- 
tion was 65%. 

The consensus judgments were 78% accu- 
rate, as indicated in Table 6, which presents 
the contingency between consensus judgment 
and actuality (x? = 10.02, P< 01).° 


one; thus, while their study constitutes a replication 
of earlier findings, it is not certain how well the 
ratings they obtained could have been used to classify 
the attitude of the rater. 

9 All statistical tests reported include a “correction 
for continuity.” 


TABLE 6 
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TABLE 7 


Actual classification 


Consensus judgment 


Hawk Dove 
Hawk 15 4 


Dove S 16 


Discussion 


The judges were accurate. Analyzing the 
results across statement writers and within 
judges, we find that all of the judges are more 
accurate than chance expectation. Analyzing 
the results within statement writers and 
across judges, we find that a significant pro- 
portion of the statement writers are catego- 
rized accurately by consensus judgment. 


EXPERIMENT IV 


The purpose of Experiment IV was to repli- 
cate the results of Experiment III using the 
statements written by Wallace supporters and 
the supporters of the write-in protest. The 
procedure was almost identical to that of 
Experiment ITI mutatis mutandis—with the 
exception that two different sets of statements 
were used. Just prior to the 1968 elections, 30 
supporters of the write-in protest and 15 
Wallace supporters were recruited for state- 
ment writing in a manner similar to the hawk— 
dove procedure. Statement Set 1 consisted of 
the statements written by the 15 Wallace sup- 
porters and statements written by a randomly 
selected group of 15 of the supporters of the 
write-in protest. Set 2 consisted of the state- 
ments written by the same 15 Wallace sup- 
porters and those written by the remaining 15 
supporters of the write-in protest. A tenth 
judge, a terminal graduate student in psy- 
chology, was used so that five judges sorted 
each set of statements. 


Results 

All 10 individual judges were able to clas- 
sify the writers as Wallace supporters or sup- 
porters of the write-in proetest with over 
50% accuracy (p < .01 by sign test). The 
average percent accuracy was 77%. 

The consensus judgments were 87% accu- 
rate, as indicated in Table 7, which presents 
the contingency between the consensus judg- 


Actual classification 


Consensus judgment 
g Wallace Supporters of 


the write-in 
supporters TEGLARI 
Wallace supporters 12 3 
Supporters of the write-in 
protest $ 27 


ment and the actual category to which thë — 
writer belonged (x? = 19.01, p< .001).. ~ 


Discussion : 


The improvement in accuracy in Experi- 
ment IV as compared to Experiment III is 
probably due to the fact that the attitudinal 
differences between Wallace supporters an 
supporters of the write-in protest were 
than those between hawks and doves. 


EXPERIMENT V 


In Experiments III and IV, the judges were 
told about the contrast effect. The purpose 
of this experiment was to determine whether 
naive judges could categorize writers with A 
above-chance accuracy, and whether accuracy Gi 
of categorization is greater when judges are 
informed of the contrast effect than when i 
they are not. The statements written by the 
Wallace supporters and the supporters of the 
write-in protest were used. Judges were di-~ 
vided into two groups, one of which was in- 
formed about the contrast effect and one of 
which was not. It seemed reasonable to hy- 
pothesize that judges would do better when 
told of the effect. 


Method 


Judges in this study were drawn from two dif- 
ferent populations. Sixty-two were students at the 
University of Oregon during the summer of 1970 
and were paid for their participation. The other 
group of judges were drawn from a slightly more 
sophisticated population; it consisted of 19 employees 
of the Oregon Research Institute—including secre- 
taries, research assistants in fields other than psy- 
chology, computer programmers and operators, and 
various other employees with the exception of pro- 
fessional psychologists. 

The procedure used for both groups follows, 
although the data are treated separately. The judges 
were divided randomly into two groups; an informed 
group received roughly the same instructions as in 
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TABLE 8 
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TABLE 9 


Actual classification 


Consensus judgment 
Supporters of 


Actual classification 


Consensus judgment 


Wallace z Wallace | Supporters of 
supporters | he write-in supporters | tHe write-in 
Wallace supporters 12 ó Wallace supporters 13 2 
Supporters of the write-in Supporters of the write-in 
protest 3 24 protest 2 13 


Experiment IV, while an uninformed group had the 
explanation of the contrast effect omitted. In the 
student sample, a randomly selected half of each 
group was asked to sort the writers in Set 1, the 
remainder in Set 2. In the Oregon Research Institute 
sample, only Set 1 was used. 


Results 


All but 5 of the 62 student judges were 
able to classify the writers as Wallace sup- 
porters or supporters of the write-in protest 
with over 50% accuracy (p < .001, by sign 
test). The average accuracy was 64%. 

The consensus judgments were 80% accu- 
rate, as indicated in Table 8, which pre- 
sents the contingency between consensus judg- 
ment and actual classification (x* = 12.60, 
p < .001).2° 

The performance of the judges in the in- 
formed group was virtually identical to that 
of the judges in the uninformed group. Indi- 
vidual judges in both groups averaged 64% 
accuracy; the consensus accuracy of the in- 
formed group was 78%, while that of the 
uninformed group was 76%. 

Eighteen of the 19 Oregon Research Insti- 
tute judges were able to classify the writers 
as Wallace supporters or supporters of the 
write-in protest with over 50% accuracy 
(p< .001, by sign test). The average 
accuracy was 67%. 

The consensus judgments were 87% accu- 
rate, as indicated in Table 9, which pre- 
sents a contingency between consensus judg- 
ment and actual classification (x? = 13.33, 
p < .001). 

Again, however, the performance of the two 
groups was virtually identical. Individual 


10 Since 62 student judges classified each of the 
Wallace supporters, it was necessary to run an addi- 
tional judge in order to obtain the odd number of 
judges necessary to guarantee a consensus judgment. 


judges in the informed group averaged 67% 
correct, as did those in the uninformed group. 
Further, the consensus judgment of the in- 
formed group was 78%, while that of the 
uninformed group was 70%."* 


Discussion 


The results do not support the hypothesis 
that awareness of the contrast effect increases 
the accuracy of judgment. Many subjects re- 
ported difficulty in performing the task, and 
accuracy was somewhat lower than that of 
psychologists in Experiment IV. On the aver- 
age, the psychologists were 77% correct, 
whereas the other 83 judges on the average 
were only 65% correct. The number of writers 
correctly classified by the psychologists and 
other judges differed significantly (¢ = 2.65, 
p<.01). (A é test in this context actually 
lacks power, since it fails to control for the 
fact that the same writers were being classi- 
field by the two sets of judges; a more power- 
ful test, however, could only increase the 
significance level.) 

The judges in the uninformed group in this 
experiment were informed of the contrast ef- 
fect after they had completed their sorting. 
Many of these judges reported that they had 
used their own version of the contrast effect 
in making these judgments. Thus, a paradox 
arises: If people naively understand that 
people tend to exaggerate the differences be- 
tween themselves and those with whom they 
disagree, why do they continue to do it? 

At any rate, the success of the psycholo- 
gists in classifying the 40 hawk and dove 
statement writers and the 45 writers who 


11 Since the judges themselves were equally accu- 
rate in each group, there is little reason to believe 
that this consensus difference would be statistically 
reliable were a larger sample collected. 
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supported Wallace or the write-in protest is 
quite impressive. The success of the other 
two groups in classifying the latter group of 
45 subjects is somewhat less impressive, but 
is nevertheless well above chance expectation. 
The success rate of all our judges compares 
very favorably, for example, with the success 
of expert clinical psychologists in attempting 
to differentiate psychotic from neurotic pa- 
tients on the basis of their MMPI profiles 
(Goldberg, 1965); these judges also were at- 
tempting to differentiate two groups in a 
population with an approximately 50-50 
base-rate split. 

The subject whose attitude is to be as- 
sessed is asked to do something very straight- 
forward: to write, to the best of his ability, 
statements that would be endorsed by the 
typical person with whom he agrees and the 
typical person with whom he disagrees. The 
statements he writes are not only indicative 


of his own attitude, but in contexts such as 


T those of the present studies, may be used to 
" categorize him with a high degree of accuracy. 
' Statement writing might, then, be an effective 


" means of indirectly assessing attitude (if future 


" studies show it works in other contexts), as 
well as one that does not involve any serious 


deception, or infringement on the privacy of 


~ the person whose attitude is being assessed. 


GENERAL DISCUSSION 


The conclusion from Experiments I and II 
is that the contrast effect is not a labeling 
phenomenon; subjects in the present experi- 
ments had a strong tendency to exaggerate 
the differences in attitude between themselves 
and the people with whom they disagree. 
Experiments III, IV, and V demonstrate that 
the differences are so clear-cut that it is even 
possible to classify people on the basis of 
their statement writing with a high degree 
of accuracy. 

There are many possible explanations for 
the contrast effect. Perhaps we have more 
difficulty than we are aware of in assimilating 
social information and hence “gate out” 
(Posner, 1964) incoming information we be- 
lieve to be irrelevant—that is, information 
indicating moderation or neutrality (see 
Manis, Gleason, & Dawes, 1966). This dif- 
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ficulty-based explanation is consistent with 
the finding that while we may process huge 
amounts of information when unconsciously 
performing habitual tasks, our capacity for 
consciously processing information in new 
tasks is severely limited (see Fitts & Posner, 
1967), and forming impressions about atti- 
tudes is generally a conscious task; the dif- 
ficulty may further be exacerbated by the 
fact that we form many impressions about 
attitude on the basis of what we read, and 
we may be much worse at understanding what 
we read than we think we are (see Bartlett, 
1932; Dawes, 1964, 1966). In contrast toad) { 
the information-processing explanation, Manis 
(1961) has proposed a motivational explana- 
tion, which holds that pressure to change our 
own attitudes is reduced by displacing the 
attitude of the person with whom we disa; 

As noted at the beginning of this 
the contrast effect may be self-perpetuatin 
functionally autonomous in Allport’s (19% 
terminology—because it may well lead to a 
lowering of esteem for those holding opposing. ` 
views, and such lowered esteem may enhance 
the effect. Moreover, it is detrimental to 
cooperation and compromise. 

A closely allied source of intergroup con- 
flict that may also be self-perpetuating is one ~ / 
termed autistic hostility by Newcomb (1947). 
Briefly, autistic hostility is the process of 
avoiding contact with opposing groups and 
hence avoiding information that may serve to l 
reduce intergroup conflict. To quote New- ; 
comb, Turner, and Converse (1965); 


ny 


If the members of one group share hostile attitudes 
oward another group, they are quite likely to take 
steps to reduce communication with that group. Such 
behavior tends to maintain the hostility, because of 
reduced opportunities for the kind of unlearning 
that is necessary if the group norms are to prescribe 


less rather than more hostility [p. 444]. 
Moreover, the contrast effect and autistic 1 
hostility may even enhance each other, The’ 
distortion involved in the contrast effect may 
increase hostility, which may in turn reduce \ 
communication, which may in turn exacerbate _ 
selective exposure and attention to extreme 
statements, which may in turn enhance the 
contrast effect. ; 

The present research indicates that the l 
contrast effect involves a distortion of reality, 
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rather than a particular use of verbal labels. 
Further, it may be antonomous, and perhaps 
even result in a self-amplifying conflict be- 
tween groups. As has been pointed out by 
Maruyama (1963), such positive feedback 
phenomena (which he terms “deviation ampli- 
fying”) may be found in a wide variety of 
social and clinical situations. Further, sub- 
systems that may in themselyes involve nega- 
tive feedback (the contrast effect may lead 
to equilibrium within the individual) can be 
embedded in larger systems that involve posi- 
tive feedback (the resulting group conflict). 
The research reported here suggests that the 
contrast effect may be part of such a system.*? 


12 Since the time that this article was written, the 
basic effect has been replicated in two additional 
contexts. Results are reported in Dawes, MacPhil- 
lamy, and Singer (1971). 
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AGGRESSION, DISPLACEMENT, AND GUILT‘ 


DAVID S. HOLMES ? 


University of Kansas 


The experiment was carried out to determine whether displaced aggression 
resulted in more guilt than directly expressed aggression and whether frustrated 
subjects displaced less aggression than they would express directly toward the 
source of the frustration. High-frustration subjects were forced to wait for a 
late subject, while low-frustration subjects were not. In a pseudolearning ex- 
periment, half of the subjects in each frustration condition shocked the late 
subject, while the other half shocked an “innocent bystander.” Number of 
shocks given were the measures of direct and displaced aggression. Subjects 
were then asked how many shocks they were willing to accept. Number of 
shocks requested was the expiation measure of guilt. High frustration resulted 
in more aggression than low frustration, but there were no differences in the 
amounts of aggression expressed directly or displaced. High frustration aggres- 
sion was associated with less guilt than low frustration aggression, but displace- 
ment did not result in more guilt than directly expressed aggression. 


A large amount of research has been car- 
ried out on aggression and related concepts 
(Berkowitz, 1962; Buss, 1961). One impor- 
tant topic on which relatively little work has 
been done, however, is the degree of guilt 
which may be felt by an aggressor after 
expressing aggression. With regard to this 
problem, the findings of Brock and Buss 
(1964) indicated that subjects reported higher 
levels of guilt when they had no justification 
for aggressing against another person, no com- 
munication with the other person, and when 
the level of aggression was high. Berkowitz 
and Holmes (1960) reported that on a post- 
aggression questionnaire, subjects who had 
been highly aggressive tended to rate the 
victim of their aggression more favorably than 
did less aggressive subjects, a finding that 
was interpreted as evidence for higher levels 
of guilt or anxiety in the more aggressive 
subjects. Similarly, the results of Pepitone and 


Reichling (1955) indicated that subjects who 

1 This research was supported in part by Public 
Health Service Research Grant 20819. 

2The author would like to thank John Schallow 
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original development of this experiment, and Arnold 
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ments on the manuscript. Credit is due to John 
Schallow, Wynn Chapman, William Schmidt, and 
Ward Evans who carried out this and a pilot 
experiment. ; 

Rigues for reprints should be sent to David Ss. 
Holmes, Department of Psychology, University of 
Kansas, Lawrence, Kansas 66044. 
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had expressed the strongest hostility toward n i 
an absent instigator later tended to give him 
the most favorable questionnaire ratings. 

In measuring guilt, a number of other in- 
vestigators have relied on the relati@hship bea 
tween expiation and guilt and have used” 
behavioral measures rather than verbal re- 
ports or inferences based on attitude change. 

That is, it was assumed that the degree of 
self-punishment was determined primarily by l 
the levels of guilt the subjects felt, and, 
therefore, the investigators used the degree to 
which the subjects punished themselves (e.g, 
volunteered to participate in more or painful ~ 
experiments; Freedman, Wallington, & Bless, \ 
1967; Wallace & Sadalla, 1966) as a behav- 
ioral measure of guilt. This approach seems to. 
have considerable face validity, and it also 
involves actual guilt-related behaviors that 
may be more relevant and accurate than i 
verbal reports of *guilt® Unfortunately, this 
means of measuring guilt has been used pri- _ a 
marily after nonaggressive transgressions. One A 
exception, however, is the investigation of , i 
Carlsmith and Gross (1969), in which it was NE ae 
reported that subjects who had administered 
electrical shocks to another subject showed 
greater compliance in volunteering to par- 
ticipate in a campaign to solicit signatures for 
a petition than did subjects who had not i j 
shocked anyone, a finding which the authors 
attributed to “generalized guilt.” Further ; 
| 
| 
l 


work needs to be done, however, using the 
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expiation approach to the measurement of 
postaggression guilt. 

Another question concerning the relation- 
ship between the expression of aggression and 
subsequent guilt revolves around the differ- 
ences in levels of guilt as a function of 
whether the target of the aggression was ap- 
propriate or whether the aggression was dis- 
placed onto another target. Since there would 
be no justification for aggression against an 
“innocent bystander,” it might be suggested 
that greater guilt would be attendant on dis- 

¿Placed than on appropriately placed ageres- 
sion. If this were the case, the tension re- 
duction accomplished by the expression of 
placed aggression might be offset by addi- 
tional feelings of guilt, an occurrence which 
would reduce the value of the aggressive 
behavior. 

The present experiment was carried out to 
determine first whether subjects who were 
frustrated and, therefore, expressed high levels 
of aggression (Dollard, Doob, Miller, Mowrer, 
& Sears, 1939) would express more guilt as 
measured by attempts at expiation than sub- 
jects who were not frustrated and did not 
express high levels of aggression. Second, an 
attempt was made to determine whether 
aggression which was displaced would result 
in more guilt than aggression which was 
directed at the source of the frustration. In 
this experiment high and low levels of frustra- 
tion were generated by forcing subjects to 
wait different lengths of time for a second 
subject who was to be the subject’s partner 
in the experiment, but who was late. Half of 
the subjects were then given the opportunity 
to express their frustration-generated aggres- 
sion toward the source of the frustration by 
shocking the late subject in what they thought 
was a learning experiment. The other half of 
the subjects were given the opportunity to dis- 
place their aggression onto an innocent by- 
stander, that is, another student who hap- 
pened to come down the hall and who the 
experimenter asked to stand in for the late 
subject. The degree to which the subjects 
shocked the late subjects was the measure of 
direct aggression, while the degree to which 
subjects shocked the innocent bystanders was 
the measure of displaced aggression. After 
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shocking the other individual, the subject was 
told that he would now be the recipient of 
shock, and he was asked to indicate the degree 
of shock he wanted. The self-selected levels 
of shock were the measures of guilt expiation. 

The design of the present experiment also 
permitted the test of two conflict hypotheses 
concerning the differences in the amounts of 
aggression expressed when aggression is di- 
rected at a primary target, as opposed to 
when it is displaced onto a different target. 
In his application of “stimulus—response gen- 
eralization” to the explanation of displace- 
ment, Miller (1948) asserted the degree of 
aggression which is expressed in displacement 
will decrease as a function of the dissimilarity 
between the primary target of the aggression 
and the target used for the displaced aggres- 
sion. That is, he predicted that the elicitation 
of the aggressive response would follow the 
laws of stimulus generalization. Using this 
position in the present experiment, it would 
be predicted that the late subject would be 
the recipient of more aggression than the 
volunteer since the volunteer would be some- 
what different from the subject. On the other 
hand, McNeil (1959) and Worchel (1960) 
suggested that the victim of displacement is 
an individual who creates some small annoy- 
ance for the aggression-sensitized subject and, 
in doing so, triggers off and becomes the 
target of all of the subject’s accumulated ag- 
gression—an amount larger than the victim’s 
small act justifies. This position does not 
rely on stimulus similarity, and it predicts 
that the target of displaced aggression will 
receive the same amount of aggression as the 
primary target of aggression. 


METHOD 


Sixty male undergraduate students from an intro- 
ductory psychology class at the University of Texas 
(Austin) served as subjects and were randomly 
assigned to one of four conditions. The experiment 
was a 2X2 factorial design, with two levels of 
frustration-generated aggression (low versus high) 
and two targets for aggression (direct versus dis- 
placed). 

When a subject arrived for the experiment, he 
was told that the experiment involved two persons 
and that his partner had not yet arrived, The sub- 
ject was then asked to wait in the corridor outside 
the laboratory until the other subject arrived. During 
his wait it was necessary for the subject to stand 
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A 
E- there were no chairs in the corridor. In the “mildly painful” but in no way dangerous. The i 
Jow-frustration condition, the subject had to wait experimenter then left the room, closed the door, 
in this situation for only 5 minutes, while, in the and returned to the room where the subject was 
high-frustration condition, subject had to wait for waiting. 

30 minutes. The subject was then told that his task in the 

At the end of the waiting period, a confederate experiment was to shock the confederate, and the 
arrived who introduced himself to the subject as operation of the shock apparatus was explained to 
either the late subject or as a student who was him, After he had become thoroughly familiar 
looking for the office of a teaching assistant. The with the apparatus, the subject was told that he 
introduction determined whether the subject was in was working in the uncertain stress condition and 

the direct or displaced aggression condition. Tf the that he could shock the confederate as often as he 

confederate used the late-subject introduction, the chose but to keep the shocks unpredictable. The 

two subjects (real and confederate) entered the lab-  5-minute experimental period was then begun, during 

oratory and the pseudoexperiment was begun. On the which the subject was allowed to administer at will 

other hand, if the inquiring-student introduction was what he believed were shocks to the confederate. : 

used, the student knocked on the door to ask the This gave the subject an opportunity to express om { 
Da. 


ey 


experimenter if he knew the whereabouts of a teach- aggression against either the source of the frustration 
ing assistant. The experimenter indicated that he or an innocent: bystander. During this period the 
did not know the assistant in question, but before experimenter sat quietly behind the subject where 
the inquiring student could leave, the experimenter he could not be seen. 

explained that an experiment was being run in which At the conclusion of the 5-minute shock period, 
two subjects were needed and that the second sub- the confederate was disconnected from the apparatus 
ject had not shown up. He then asked the inquiring and dismissed. The subject was then told that he 
student whether he would mind helping out for a would now serve as the learner in the “certain stress 
few minutes by taking part in the experiment. The condition” of the experiment; that is, he was told 
inquiring student consented, he and the subject were that he would be tested on the learning task while 
then taken into the laboratory, and the pseudo- receiving a specific number of shocks. It was pointed 
experiment was begun.® out that to assure his certainty about the shocks, c N 

Spe, 


The pseudoexperiment was the same in all condi- he would be allowed to determine the number of 
tions. It was first explained to the subject and the shocks which he was going to receive, He was then 
confederate that the experiment involved the effect asked to indicate how many shocks he was willing 
of “uncertain and certain stress on human conceptual to receive. This ended the experiment, and the 
learning.” It was pointed out that the stress to be subject was completely debriefed. 
used would be in the form of electric shocks and The shock apparatus used in the present study 
that one person would be the “learner” and would was a variation on the Buss aggression EE of 
receive shocks while performing a task, while the (Buss, 1961). It consisted of a response panel con- 
other person would be the “teacher” and would taining a red signal light and a switch labeled “shock 
deliver shocks. The experimenter then allowed the — switch.” Heavy electrical cords ran from the back 
subject to pick one of two cards to determine which of the panel through the wall and into the adjoining 
role he would play. Since, unknown to the subject, room. The subject was led to believe that by throw- XxX j 
both cards were labeled “teacher,” the subject was jng the switch he could administer shocks to the ge 
always the one who administered shocks. Electrodes confederate. When the switch was thrown, the 
were then strapped onto the confederate, who was apparatus emitted a harsh buzzing noise and the 
Jed into an adjoining room that was arranged $O red signal light came on, but unknown to the sub- 
that the subject could hear the experimenter explain ject, rather than shocking the confederate, the device 
to the confederate what he presumably would be merely recorded the subject’s response on a counter, j 
doing in his learning task. The experimenter also The number of shocks which the subject admin- i 
explained to the confederate that he was 4 subject istered to the late subject and the volunteer were 
in the “uncertain stress condition” and that he used as behavioral measures of direct and displaced 
would, therefore, not know when he was going to aggression, respectively. The number of shocks that 
be shocked or how many shocks he would receive. the subject accepted for himself was used as a ‘ 
He was told, however, that the shocks would be behavioral measure of guilt. i ' 
pees tee na 


3 Jt should be noted that the subjects were aware 
of the fact that they were obligated to wait 55 RESULTS AND DISCUSSION 


i for any experiment. Because the volunteer h i ` à 
e e amount of time that the T e deceptions involved af the present ex- 
subjects were committed to the experiment, but periment were apparently quite credible since 


obviously could reduce the time the subjects would only four subjects reported that they had Í 


have to spend, it seems highly a P been suspicious a ead been 
teer would have been perceived as a si e 4 t i 
Sagano and therefore been a target for direct told. These subjects were eliminated from the 


aggression. study and replaced by other subjects, 
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sa 
Aggression 

A 2 X 2 analysis of the variance with two 

levels of frustration (5-minute wait and 30- 

minute wait) and two targets for aggression 

(late subject and volunteer) was carried out 

F on the number of shocks given by the sub- 

jects. This analysis revealed a significant ef- 

fect due to level of frustration (F = 6.08, 

æ df= 1/56, p = .02; more shocks given with 

high than low frustration), while the target 

of aggression (F = .00, df = 1/56) and inter- 

¿h action (F = .50, df = 1/56) effects were not 

NY iten From this it is clear that subjects 

who were exposed to a high level of frustra- 

tion expressed more aggression in terms of 

the number of shocks they gave than those 

exposed to a low level of frustration, a finding 

í consistent with the early findings of Dollard 

et al. (1939). Further, it is apparent that 

the subjects showed no selectivity in terms 

of whom they used as the target of their ag- 

gression, a point that is dealt with in greater 
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Guilt 
} A 2 X 2 analysis of variance carried out on 
j the number of shocks that the subjects agreed 
,. to accept revealed a significant effect due to 


level of frustration (F = 6.18, df= 1/56, 
= .02; more shocks requested under low 
t high frustration), while the target of 
kesion (F =.10, df = 1/56) and inter- 
action (F = .96, df = 1/56) effects were not 
significant. Since significantly more self- 
punishment was requested in the low- than in 
the high-frustration condition, it can be con- 
cluded that subjects who shocked other per- 
t sons in the presence of little, if any, feeling 
of frustration felt more guilty than did sub- 
jects who were frustrated when they shocked 
other persons. The fact that there was not 
a significant effect for targets of aggression 
(late subject versus volunteer) suggests that 
displaced aggression did not result in more 
guilt. Any conclusions concerning guilt rest, 
of course, on the validity of the measure of 
guilt used in this experiment. In view of the 
fact that this measure was found to be ef- 
fective in previous research (e.g., Wallace & 
Sadalla, 1966) and the fact that it revealed 
the expected results with regard to guilt after 
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aggression as a result of high or low frustra- 
tion, the measure used in this experiment 
does seem to be reasonably valid. 

At first, it might be suggested that less 
guilt was expressed by frustrated than non- 
frustrated subjects because the frustrated sub- 
jects felt justified in expressing the high level 
of aggression (Brock & Buss, 1964) since 
they had been kept waiting a long time. On 
the other hand, the highly frustrated subjects 
were not objectively justified in expressing 
a high level of aggression against the volun- 
teer since this was not the individual who 
caused their wait. With regard to the dif- 
ferences in justification associated with the 
two targets, it should be noted that with the 
guilt measure there was no interaction be- 
tween levels of frustration and targets for 
aggression, a finding which suggests that justi- 
fication did not play a role in determining the 
guilt that a subject felt after the aggression. 

In summary, concerning guilt, it seems clear 
that when subjects were frustrated, they gave 
vent to aggressive feelings without subsequent 
feelings of guilt concerning this expression. 
This was the case regardless of against whom 
the aggression was expressed and whether or 
not the target deserved the aggression. 


Stimulus Generalization and Displacement 


The finding that there were no differences 
between the levels of aggression expressed 
toward the source of the frustration (the 
late subject) and toward the innocent by- 
stander (the volunteer) supports the ob- 
servations of McNeil (1959) and Worchel 
(1960), that the amount of aggression 
expressed will depend on the amount of 
accumulated frustration, rather than on 
the similarity between the source of the 
frustration and the target of the aggression. 
To reconcile the finding that there was 
no difference in the amounts of aggression 
expressed toward the late subject and the 
volunteer with the position and prediction of 
Miller (1948), it would be necessary to sug- 
gest that the late subject and the volunteer 
did not differ in stimulus similarity. It would 
seem, however, that in fact the two individ- 
uals were quite dissimilar conceptually. For 
example, one individual was an “experimental 
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subject,” while the other was a “student seek- 
ing help”; one individual was obviously negli- 
gent and was causing the subject considerable 
and prolonged inconvenience, while the other 
was helpful and giving of his time to elimi- 
nate the subject’s inconvenience. As Miller 
(1951, 1959) has pointed out, these types of 
qualitative dimensions may be most important 
in terms of comparing people. Therefore, from 
the present investigation it seems that simi- 
larity, at least conceptual similarity, is not 
crucial in determining the amount of aggres- 
sion that will be displaced onto another 
target. Conclusions concerning physical simi- 
larity cannot be drawn from this experiment 
because the subjects did not have an oppor- 
tunity to compare the physical characteristics 
of the targets. 


CONCLUSIONS AND IMPLICATIONS 


The above results suggest a number of 
conclusions concerning frustration, aggression, 
displacement, and guilt. First, in this experi- 
ment, frustration did result in aggression. 
Second, the high-frustration subjects who ex- 
pressed the high level of aggression expressed 
less guilt than the low-frustration subjects 
who expressed little aggression. Third, in the 
situation in which direct expression of aggres- 
sion was not possible, the frustrated subjects 
did displace their aggression. Fourth, the level 
of aggression displaced was as high as the 
aggression that was expressed directly, despite 
the fact that the target for displaced aggres- 
sion was apparently quite dissimilar from the 
target for direct aggression. Last, the dis- 
placement of aggression onto an inappropriate 
target did not result in higher levels of guilt. 

The findings of this experiment seem to 
have a number of social implications, although 
the problems involved in generalizing from 
the laboratory to natural settings must be 
borne in mind. The fact that targets for high 
levels of aggression need not be related to the 
source of frustration that caused the aggres- 
sion is consistent with the observation that 
frustrated persons are likely to aggress in- 
discriminately in releasing their frustrations. 
Probably more important, however, is the 
finding that after the aggression has been 
expressed, the once frustrated individual is 
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not likely to experience guilt regardless of the 
appropriateness of the target against which 
he expressed his aggression. This finding is 
especially interesting in view of the fact that 
it is based on the behavior of intellectually 
competent middle-class college students who 
were apparently well socialized; that is, it 
seems that this behavior is not unique to 
particular groups or “unsocialized delin- 
quents” (Hewitt & Jenkins, 1946) who have 
traditionally been thought to behave in this 
“immoral” fashion. It is important to note 
that in the absence of the feelings of guilt 
the individual will have fewer internal Te, 
straints to inhibit his future performance of 
prohibited acts, therefore relying more heavily 
on external controls such as fear of punish- 
ment, controls which are all too fragile (e.g., 
Ritter & Holmes, 1969). Also, since the indi- y 
vidual has no guilt, punishment for the act 
may not be seen by him as appropriate or 
may be ineffective. 

While the present results were clear in 
identifying a number of unexpected relation, A 
ships (e.g., frustrated subjects did not feel” | 
guilty after aggression even if the aggression ©} 
was displaced) and in pointing out a potenti- 
ally serious problem of pen k socal 
control, the results, unfortunate id not 
offer a solution for the problem. It is clear 
that more experimental attention should be 
devoted to the relationship between aggres- 
sion and guilt for both theoretical and A 
practical reasons. 
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THE STARE AS A STIMULUS TO FLIGHT IN HUMAN SUBJECTS: 
A SERIES OF FIELD EXPERIMENTS! 


PHOEBE C. ELLSWORTH,? J. MERRILL CARLSMITH, ayo ALEXANDER HENSON 


Stanford University 


A series of field experiments were performed to test the hypothesis that avoid- | 
ance behavior is elicited in human subjects by staring, In each experiment, the p j 
experimenter stared (experimental group) or did not stare (control group) at : 

people stopped at a traffic light, and measured their speed across the intersection ‘ 
when the light changed. Experimenters rode a motor scooter (Experiment 1) 
or stood on the street corner. Subjects were pedestrians (Experiment IV) or } 
automobile drivers. Presence versus absence of staring, sex of experimenter, 
and sex of subject were varied. In all experiments, crossing time was signifi- 
cantly shorter in the stare conditions. Main effects and interactions for sex 
disappeared when four male and four female experimenters were used (Experi- 
ments III and IV). Crossing time was significantly shorter when subjects 


were stared at than when they were confronted with an incongruous situation i 
which did not involve staring (Experiment V). There was no correlation s 
between staring time and crossing time. fos 


One of the most frequently reported com- 
ponents of agonistic or threat displays in 
primates is a steady, direct gaze at the object 
of aggression. This type of gaze is character- 
istic of the aggressive behavior of chimpanzees 
(Van Lawick-Goodall, 1968), gorillas (Schal- 
ler, 1963), and a wide variety of monkeys 
(Van Hooff, 1967). Typically, it occurs as a 
prelude to attack or as a substitute for it, 
depending on the reaction of the other ani- 
mal. This reaction is usually flight, a sub- 
missive display, a return gaze, or a combina- 
tion of these elements. Aggressive staring is 
probably more common in the dominant ani- 
mal in an encounter, although the subordi- 
nate may sometimes stare back temporarily. 
Gaze aversion is more characteristic of the 


occur primarily in aggressive encounters. 
Tentatively, the evidence seems to indicate 
that staring is necessary, but probably not | 


sufficient for a display to be communicative ^, 


of aggression in nonhuman primates. 

Several students of primate behavior have 
suggested that the fixed stare is also a form 
of threat in man (e.g., Schaller, 1963; Van 
Lawick-Goodall, 1968), and Van Hooff 
(1967) noted that staring is often a prelude 
to attack in man as well as in other primates. 
There is some evidence that a gaze fixated on 


the object is a component of the expression Ñ ‘®t 


of anger in humans (Ekman, 1972) ; however, 
the fact that staring may precede attack does 
not necessarily imply that staring serves as a 
signal of aggressive intent in man. Research 


A 


subordinate animal, and seems to be an ex- on eye contact has shown that in a situation -4 
pression of submission (Altmann, 1967) or in which they are being negatively evaluated 

an indication of peaceful intent. There are subjects avoid looking at a hostile interviewer 

also some nonaggressive situations in which (Exline & Winters, 1965a), thus showing be- 

one animal may stare at another, although, havior similar to primate submission, and also ’ 


in adults, direct staring at the face seems to 


1 Elizabeth Hastorf served as the female experi- 
menter in Experiment I. In Experiments ir and IV, 
the experimenters were Barbara Alperin, Jim Austin, 
John Mower, J udy Payne, Steve Savery, J ohn 
Weeks, Kathy Weinberger, and Judy Whitney. Their 
assistance is gratefully acknowledged. 

2 Requests for reprints should be sent to Phoebe 
C. Ellsworth, who is now at the Department of 
Psychology, Yale University, New Haven, Connecti- 


cut 06510. 
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rate the situation and the interviewer more %& 


negatively when the interviewer makes fre- 
quent eye contacts than when eye contact 
is infrequent (Ellsworth & Carlsmith, 1968). 
In neither of these studies, however, was 
prolonged direct staring used as a variable. 
In most experimental studies of visual inter- 
action in humans (e.g., Argyle & Dean, 1965; 
Exline, Gray, & Schuette, 1965; Exline & 


Winters, 1965a, 1965b), the visual behavior 
of the subjects has been the dependent vari- 
able. In experiments in which eye contact 
has served as the independent variable (Ar- 
gyle, Lalljee, & Cook, 1968; Ellsworth & Carl- 
smith, 1968; Exline & Eldridge, 1967), star- 
ing has not been one of the variables of in- 
terest. 
The work of Exline and others indicates 
that eye contact is associated with a variety 
of sentiments, such as affiliation and interest, 
which are not related to aggression, and that, 
ein fact, “a higher percentage of eye contact 
between communicators is typically associ- 
ated with more positive attitudes between the 
communicators [Mehrabian, 1969, p. 364].” 
It is possible, however, that staring has dif- 
ferent consequences from mere looking. Typi- 
* cally, in a two-person interaction, the visual 
behavior of the participants is extremely inter- 
dependent, with the rate of change of gaze 
direction very highly correlated (Kendon, 
1967. For the purposes of the current study, 
a stare was roughly defined as a gaze or look 
which persists regardless of the behavior of 
the other person. The impact of a stare may 
be mitigated if the starer responds to the 
other person by smiling or talking; maximal 
impact will be achieved if the starer simply 
continues to stare without making any facial 
\ or verbal response to changes in the other 
person’s behavior. The starer’s visual and 
facial behavior, not being responsive to the 
other person’s behavior, creates a type of 
interaction which is different from those 
studied in the above-mentiond experiments.® 
A stare, in these terms, is more analogous to 
4- the “spatial invasions” studied by Felipe and 
Sommer (1966) than to “eye contact” as 
conceptualized by those who have studied 
visual interaction. 

The current study, then, was designed to 
examine the effects of one particular form of 
human visual behavior: the stare. It was hy- 


id 


$3 


| 8 This distinction is clearer in principle than in fact. 
In several of the experiments cited here, the experi- 
\ menter maintained a steady gaze toward the sub- 
ject, so as to be able to speak justifiably of “eye 
| contact.” The representativeness of this procedure 
has been questioned on the grounds of its similarity 
: to staring as defined here (Duncan, 1969; Ellsworth 
7 > & Carlsmith, 1968). 
4 
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pothesized that if a stare has negative or 
threatening cue properties for people, then it 
should tend to elicit attempts to withdraw 
or escape from the situation. If escape is 
temporarily impossible, tension may build up, 
so that when the opportunity to escape does 
arise, the response may be exaggerated and 
thus more readily observable. A series of 
experiments were carried out to study this 
hypothesis. In four of the five experiments, 
the subjects were drivers stopped at an inter- 
section for a red light; in the fifth, they were 
pedestrians in the same situation. While the 
subject was waiting for the light to change, 
the experimenter stared at him. It was hy- 
pothesized that when the light changed to 
green, the subjects who had been stared at 
would cross the intersection (“escape”) faster 
than a control group of subjects who were not 
stared at. 


PILOT STUDY AND GENERAL OVERVIEW 


A pilot study was carried out to determine 
the general efficacy of the procedure. The 
experimenter pulled up next to the target 
car on a motor scooter, stared at the driver 
of the car until the traffic light changed, and 
measured the time it took the car to cross 
the intersection. A control group was simply - 
timed. The results of the pilot study were 
favorable to the hypothesis: drivers who were 
stared at crossed the intersection significantly 
faster than those who were not (t= 2.17, 
$ < .025). In order to test the generality 
of the phenomenon, sex of experimenter and 
sex of subject were systematically varied in 
the four subsequent experiments. The first 
experiment was a simple replication of the 
pilot study. The use of the motor scooter in 
this experiment raised the possibility that 
any aggressive overtones in the situation 
might have been due to a certain motor- 
vehicle-oriented type of competition. Thus, 
a second experiment was run in which the 
experimenter simply stood on a street corner. 
In Experiment III, four male experimenters 
and four female experimenters were used in 
order to conduct a valid test of the effects of 
staring across dyads including all combina- 
tions of sex of experimenter and sex of sub- 
ject. In Experiment IV, an attempt was made 
to extend the generality of the findings by 
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using pedestrians as subjects. And finally, 
in Experiment V, a control group was added 
in which the experimenter engaged in an 
“incongruous,” “inappropriate” behavior that 
did not involve eye contact with the subject. 


Experiment I 
Method 


Subjects. Seventy-seven drivers, ranging in age 
from about 16 to 70, served as subjects. Of these, 
36 were male, and all were Caucasian. Drivers 
making turns were not used as subjects, nor were 
drivers who initiated conversations with the experi- 
menter. There were six subjects in this latter cate- 
gory, all of whom were in the stare condition. 
Finally, two other potential subjects were eliminated 
because it was obvious that they did not notice 
the experimenter because their attention was occupied 
with a distracting store window display. 

Setting. The experiment was carried out at a busy 
intersection on a four-lane street in the commercial 
center of a suburban town (population 56,000) near 
Stanford University. The results of the pilot study 
indicated that time of day was not a significant 
variable, so the subjects were run during both the 
morning and the afternoon at all hours except rush 
hours. Both the male and the female experimenters 
were college undergraduates. 

Procedure. The situation was timed so that the 
experimenter, astride a rather dilapidated motor 
scooter, was always the first one to arrive at the 
red signal. The experimenter stopped at the signal 
in the inside lane, so as to be able to stare into the 
target car on the drivers side. Each experimenter 
had a list of the sequence of conditions in which 
the subjects were to be run, drawn up previously 
with the order of conditions determined by a table 
of random numbers. Every Caucasian subject who 
pulled up next to the experimenter at the light, and 
who proceeded straight across the intersection was 
included as a subject, precluding experimenter bias 
jn the selection of subjects and their assignment 
of conditions. If the driver made a turn at the 
intersection, the next subject was run in the same 
condition. 

In the stare condition, when the subject pulled 
up next to the scooter, the experimenter turned 
his/her head and stared directly at the subject’s face. 
The eye-to-eye distance between the experimenter 
and the subject was between 4 and 5 feet. The ex- 
perimeter never wore dark glasses and always de- 
livered the stare manipulation with the naked eye. 
Bi 

4Because of the possibility that the elimination 
of these subjects might have biased the results in 
favor of the predicted outcome (e.g. if the conversa- 
tionalists were genial, slow-moving people), the data 
were reanalyzed, with the six slowest subjects in the 
stare condition eliminated. This procedure did not 
alter the significance level of the results (¢ = 3.64, 


p < 001). 
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Subjects were run whether or not they were wearing 
dark glasses. The experimenter maintained the same 
impassive, neutral expression throughout the staring 
period. The stoplight was visible in the experi- 
menter's field of vision. As soon as it changed to ui 
green, the staring period ended and the crossing 
period began. The crossing period ended when the 
rear wheels of the subject’s car crossed the nearer 
of the white lines on the far side of the intersection. 
The experimenter did not start driving until after 
the subject’s crossing time had been measured. A ø 
two-handed stopwatch was used for measuring the 

staring time and the crossing time. (The stopwatch l 
was kept in the experimenter’s pocket throughout 

the timing, and was thus not visible to the subject.) 

As the experimenter began to stare, both handsy, 

were set in motion. One hand was stopped when \ i 

the light changed (giving a measure of the staring 

time), and the other when the subject reached the 

other side of the intersection (giving a measure of 

total time), The crossing time was obtained by 
subtracting the staring time from the total time, and 

thus included the amount of time it took the driver 

to start as well as the actual time spent in crossing. 

The experimenter then turned the corner, pulled over 

to the side of the street, and wrote down the staring 

time, total time, and the subject’s sex and est J 

age (by decade). 

The procedure in the no-stare control group 
identical to that in the stare condition, except that +- 
the experimenter merely glanced at the subject to 
get the information on sex and age, and then looked 
away. When the light changed, the crossing time 
was recorded. The design thus was a 2 X 2X2 fac- n 
torial, varying presence versus absence of staring, % 
sex of experimenter, and sex of subject. The results 
of Experiment I are discussed below along with the 
results of Experiment II. 


Experiment IT \ S 
4 


Method 


In order to rule out possible “challenge” or “drag 
racing” effects which might potentially be expected 
to produce the predicted effect in Experiment I, 
where both the experimenter and the subject were 
equipped with motors and wheels, Experiment II was 
run without the scooter. The setting was switched 
to a one-way street parallel to the street in Experi- 
ment I and a block away from it. The experimenter 
stood on the corner of the sidewalk and, in the E 
stare condition, began staring as soon as a car pulled * 
up in the nearer lane and stopped for a red light. N 
Eighty-eight Caucasian subjects were run, covering 
approximately the same age range as in Experiment I. 
Forty-seven of the subjects were male. Three addi- 
tional drivers were not used as subjects because they 


* 


5 At the suggestion of Michael Argyle, a follow-up 
study was run to determine whether staring had 
differential effects on subjects with and without dark 


glasses. There were no differences in crossing time 
between the two groups. 


© 


f did not notice the experimenter, The male experi- 

menter was the same as in Experiment I, and the 

j female was a graduate student. The eye-to-eye dis- 

tance was between 4 and 6 feet. The assignment of 

subjects to conditions and the recording of staring 

times and crossing times were carried out in exactly 

the same manner as in Experiment I, and the other 

precautions used in the experiment were also main- 

tained. The street corner was not a popular hitch- 

hiking location, and, in fact, neither of the experi- 

e~ menters ever found any other person waiting there, 

In addition, hitchhikers in this town almost always 

signal their intent with either a thumb or a painted 

; sign or both, whereas the experimenters simply stood 

k there. It is conceivable that some drivers believed 

` the experimenters to be hitchhikers, though none 
gave any indication that this was the case. 

The design was a 2 X2 X 2 factorial with the same 

variables as in Experiment I. 


Results of Experiments I and IT 


The basic hypothesis was strongly sup- 
ported in both experiments. Subjects who were 
stared at crossed the intersection significantly 
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faster than subjects in the no-stare condition. 
In Experiment I, the mean crossing time was 
5.3 seconds in the stare condition and 6.5 
seconds in the no-stare condition; in Experi- 
ment II, the means were 5.5 seconds and 6.7 
seconds, respectively. In both experiments, 
the variance was greater in the no-stare than 
in the stare condition, but according to 
Hartley’s test (Winer, 1962, p. 93), the 
assumption of homogeneity of variance was 
not violated. Tables 1 and 2 show the eight 
means for each experiment. Tables 3 and 4 
show the analyses of variance, calculated with 
an unweighted-means analysis (Winer, 1962, 
p. 222). In both experiments, the main effect 
for staring is hightly significant (p < .001). 
The fact that the effect occurred in both 
experiments indicates that the increased 
speed was primarily an avoidance reaction 
rather than a response to a perceived racing 
challenge. 


TABLE 1 
Mean CROSSING Times IN EXPERIMENT I (SCOOTER) 
Stare No stare 
Male experimenter Female experimenter Male experimenter Female experimenter 
Male Female Male Female Male Female Male Female 
subject subject subject subject subject subject subject subject 
5.2 (8) 5.9 (13) 4.8 (10) 5.3 (9) 8.0 (6) 6.5 (13) 6.1 (12) 5.9 (6) 
5.6 (21) 5.0 (19) 6.9 (19) 6.1 (18) 
5.3 (40) 6.5 (37) 
Note.—N = 77; numbers in parentheses indicate cell ns. 
TABLE 2 
Mean CROSSING TIMES IN EXPERIMENT II (STREET CORNER) 
Stare No stare 
Male experimenter Female experimenter Male experimenter Female experimenter 
Mal Femal Mal Femal 
ae Fem ale Be Ee male subj act a tees subj ene ait hielt, 
s900 | 630) | 49(12) | 4800 | 6703 | 750) | 6103) | 6801 
‘ 6.1 (21) 4.8 (22) 7.0 (22) 6.4 (23) 
5.5 (43) 6.7 (45) 


Note.—N = 88; numbers in parentheses indicate_cell7s. 
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TABLE 3 
ANALYSIS OF VARIANCE FOR 
EXPERIMENT I (SCOOTER) 
Source df MS F 
Stare (A) 1 31,93 24,75** 
Sex of experimenter (B) 1 12.79 9.91* 
Sex of subject (C) 1 26 .20 
AXB 1 2.38 1.84 
AXC 1 9.79 7.59" 
BXC 1 1.06 .82 
AXBXC 1 2.29 1.78 
Within error 69 1.29 
*p <.01 
wD <.001 


There was also a significant main effect for 
sex of experimenter in both experiments, with 
subjects crossing the intersection faster in the 
presence of a female experimenter (Experi- 
ment I: p< .01; Experiment Il: $ < 001). 
In Experiment I, there was also a significant 
interaction between staring and sex of sub- 
"ject, such that males crossed the intersection 
faster than females in the stare condition and 
slower in the no-stare condition ($ < 01). 
‘This interaction was not significant in Ex- 
riment II, where there was a slight but 
‘significant main effect for sex of subject, with 
males crossing faster than females in all 
nditions (p < .05). 
Since the duration of the stare could not 
held constant (as it depended on how soon 
‘the target car arrived at the intersection be- 
“fore the light changed), the actual starting 
_ time varied from 3.4 seconds to 27.5 seconds 
in Experiment I and from 3.0 seconds to 19.5 
econds in Experiment II (the traffic light in 
Experiment IT had a shorter period than in 


TABLE 4 


ANALYSIS OF VARIANCE FOR 
Experiment II (CORNER) 
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Source df MS F 
Stare (A) 1 36.31 44.28** 
~ Sex of experimenter (B) 1 20,00 24.39%" 
Sex of subject (C) 1 4.16 5.20* 
1 1.96 2.39 
1 1.41 1.71 
1 33 40 
1 23 .28 
80 82 


In neither experiment, how- 
time correlated signifi- 
time (Experiment T: 


Experiment I). 
ever, was the staring 
cantly with the crossing 
r= .09; Experiment Tl: r= 06). Thus, it 
would seem that once a person recognizes that 
he is being stared at, the tendency to escape 
is immediately elicited at, or nearly at, full 
strength, rather than becoming stronger as 
the disturbing stimulus continues. Inspection 
of scatter plots of the data suggested no 
evidence of nonrectilinear correlations. 

The estimated age of the subjects was 
slightly correlated with crossing time in Ex- 
periment I (FS .21 P< 05). This correla- 
tion, however, was almost entirely due to the 
behavior of the subjects in the no-stare condi- 
tion; age and crossing time were not signifi- 
cantly correlated in the stare condition. Thus, 
it seems highly unlikely that being stared at 
accentuates a competitive urge in peers. 
In Experiment IT, there was no significant 
correlation between age and crossing time. 

In neither experiment was there any dif- 
ference in crossing time related to the pres- 
ence of passengers in the car. Cars with pas- 
sengers were not very common, and werg 
fairly evenly distributed among the eight cej 
in both experiments. 


Experiment III 
Method 


The main effects and interaction for the sex vari- 
ables in Experiments I and II were unexpected and 
difficult to interpret. In addition, it seemed unwise bac 
to draw any conclusions on the basis of the sex 
of experimenter variable, since there was only one 
experimenter of each sex in each experiment, and the) ie 
results could easily have been due to other charac- ` h 
teristics of the individual experimenters. In order to ee | 
elucidate the problems raised by the significant sex 
effects, Experiment JIL was run with four male 
experimenters and four female experimenters. Be- 
cause the street used in Experiment II had been 
switched to two-way, the setting was moved to ) 
another one-way street in the same town, about » 

10 blocks away- The method was the same as in Y 
Experiment TI, except that an observer, unaware of 2 
the condition for each subject, recorded the crossing 
times from a position on the other side of the street 
from the experimenter. This procedure precluded the 
accurate recording of staring times, but given the 
near-zero correlations between staring time and cross- 
ing time in Experiments I and II, it seemed more 
valuable to eliminate the possibility of measurement 
bias arising from the experimenter’s knowledge of the 


subject’s condition than to gather additional data, $ 
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TABLE 5 


Meran Crossine TIMES IN EXPERIMENT III (EIGHT EXPERIMENTERS, DRIVER SUBJECTS) 


Stare No stare 
Male experimenter Female experimenter Male experimenter Female experimenter 
Male Female Male Female Male Female Male Female 
subject subject subject subject subject subject subject subject 
5.6 (20) 5.8 (12) 5.5 (21) 5.6 (11) 6.3 (22) 6.5 (10) 6.0 (17) 7.0 (15) 
5.7 (32) 5.5 (32) 6.3 (32) 6.5 (32) 
y 5.6 (64) 6.4 (64) 


a7 Note.—N = 128; numbers in parentheses indicate cell ns. 


on the staring time-crossing time relationship. The 
observer could not tell whether the experimenter 
was staring or not. A total of 128 subjects were 
run, of whom 80 were male. Each experimenter ran 
16 subjects. None of the experimenters had run sub- 
jects in Experiment I or Experiment II, and all were 
college freshmen. The eye-to-eye distance between 
the experimenter and-the subject was the same as 
in Experiment II, and the selection of subjects and 
¢ their assignment to conditions were carried out in 
the same manner as in the other two experiments, 
except that non-Caucasian subjects were also run. 

The design of the experiment was thus a 2 X 2 X 2 
factorial, varying stare versus no stare, sex of subject, 
and sex of experimenter. In addition, we can analyze 
the effect of the eight experimenters. 


Results 


The results are presented in Table 5, and 
an, unweighted-means analysis of variance is 
shown in Table 6. The results are analyzed 
as a 2 X 2 X 8 factorial, with experimenters 
contributing the eight levels in order to show 
the appropriate error terms for the F tests. 
Sex of experimenter, with one degree of free- 
dom, is shown as a part of each mean square 
involving experimenters. It is apparent once 
again that staring reduces the amount of time 
taken to cross the intersection. When subjects 
are stared at, they take an average of 5.6 
seconds to cross the intersection; when not 
stared at, they average 6.4 seconds. Viewing 
experiments as a random factor in the analy- 
sis, the appropriate F test for this effect uses 
the Stare x Experimenter interaction as an 
error term (F = 30.42, df = 1/7, p < .005). 

Table 6 also indicates that the troublesome 
effects attributable to sex which appeared in 


the first two experiments have disappeared. 
There are no main effects of either sex of 
experimenter or sex of subject which approach 
significance, nor are there any interactions 
which appear to approach significance. Evi- 
dently, the effects observed in the first two 
experiments were due to idiosyncratic charac- 
teristics of the particular experimenters used. 
It is of some interest to note that in this 
experiment there was no effect due to the 
experimenters, nor did that variable interact 
with anything else. This gives us some confi- 
dence in the extent to which we can general- 
ize about this phenomenon across various 
individuals who may be staring. 


Experiment IV 
Method 


In the three experiments reported so far, the sub- 
jects were always automobile drivers. Although the 


TABLE 6 


ANALYSIS OF VARIANCE FOR EXPERIMENT III 
(EIGHT EXPERIMENTERS, DRIVER SUBJECTS) 


Source df MS A 

Stare (A) 1 5.78 30.42* 
Sex of subjects (B) 1 .97 
Experimenter (C) 7 63 
Sex of experimenter (D) 1 ll 
AXB 1 -66 
AXC 7 .19 
AXD 1 .19 
BXC 7 38 
BXD 1 16 
AXBXC 7 21 
AXBXD 1 39 

Within error 96 32 

* p <.005 
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“drag-race challenge” hypothesis was ruled out by 
depriving the experimenters of their wheels, the pos- 
sibility remains that drivers may be motivated to 
demonstrate the power of their cars, the speed of 
their reactions, or other less obvious qualities when 
supplied with an attentive spectator. In order to rule 
out car-specific hypotheses altogether, Experiment IV 
was conducted with pedestrians as subjects, on the 
assumption that fast walking is not considered indica- 
tive of prowess or bravado. The street corner was 
the same as in Experiment I. The experimenter stood 
on the corner of the sidewalk, and in the stare condi- 
tion began staring as soon as a pedestrian came up 
to the crosswalk and faced the red light. Each ex- 
perimenter had a supply of 16 jelly beans in his/her 
pocket, 8 each of two different colors. One color 
signified the stare condition, the other, the no-stare 
condition. When a potential subject arrived at the 
crosswalk, the experimenter removed 1 of the jelly 
beans, noted its color, and ate it. In this manner, 
a quasi-random assignment of subjects to conditions 
was achieved without the experimenter attracting the 
attention of the subject and other bystanders by 
consulting a list. As in Experiment II, the timing 
was done by an observer on the other side of the 
street who was unable to tell whether the experi- 
menter was staring’or not. Since the first person to 
reach the corner was always the subject, the observer 
had no difficulty determining which pedestrian to 
time. The crossing time was defined as the interval 
between the onset of the green light and the moment 
when the subject stepped up onto the curb after 
crossing the street. The eight experimenters were the 
same as in Experiment IIJ, and the eye-to-eye dis- 
tance between the experimenter and the subject 
varied from 34 to 44 feet. The staring manipulation 
may have been somewhat more salient to the sub- 
jects in this experiment than in the car studies, since 
occasionally a third person would come up and block 
the experimenter’s line of regard, forcing him to shift 
his position in order to reestablish a clear view. 
Each experimenter ran 16 subjects, for a total of 
128, of whom 73 were male. 

The design of this experiment is essentially the 


TABLE 7 


MEAN CROSSING TIME FOR EXPERIMENT IV (EICHT EXPERIMENTERS, PEDESTRIAN SUBJECTS) 
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same as Experiment III. A 2 X 2 X 2 factorial design, 
varying stare versus no stare, sex of subject, and 
sex of experimenter comprised the basic design. Again, 
the analysis treats the design as a 2 X 2 X 8 design, 
with one degree of freedom corresponding to sex . 
of experimenters pulled out from the experimenter 
and experimenter interaction sum of squares in 
each case. 


Results 


Again, the results suggest a strong effect 
due to the staring of the experimenter. 
Table 7 shows the mean crossing time for all 
groups. The means are not comparable to 
earlier experiments, since the subjects in this 
experiment had to walk across the intersec-. “ 
tion. Subjects who were stared at crossed the 
intersection in an average of 11.1 seconds; 
control subjects required 12.2 seconds. This 
difference is reflected in the analysis of vari- 
ance by an F ratio of 29.91 (df=1/7, 
p < .005). Once again, there is no strong 
evidence of any main effect or interaction 
involving sex of experimenter or sex of sub- 
ject. Again, staring leads to flight, as defined 
by speed of departure from the presence of 
the starer when the situation permits. This 
phenomenon is independent of any charac- 
teristics observable in our sample of eight 


starers. 
Experiment | E 


Although the results of the first four experi; 
ments supported the notion that a direct 1 
fixed stare elicits avoidance in a variety of | 
situations, the interpretation of the situation 
in terms of the interpersonal, communicative 
aspects of the stare was not the only pos- 


Stare No stare 
Male experimenter Female experimenter Male experimenter Female experimenter 
sey} Rema | Mle | FREE Narre te eee fe ua |e 
11.1 (23) | 10.9 (10) | 11.6 (16) | 10.9(17) | 121 (17) | 11.0 (14) | 12.8 (17) | 12.9 (14) 
11.0 (33) 11.2 (33) 11.6 (31) 12,9 (31) 
11.1 (66) 12.2 (62) 


Note.—N = 128; numbers in parentheses indicate cell xs. 
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sible approach to an explanation of the effect. 
The behavior of the starer was unusual and 
incongruous, and it was impossible to rule out 
an interpretation based on the hypothesis that 
people are motivated to withdraw from all 
situations in which other people are behaving 
incongruously or inappropriately. According 
to this hypothesis, staring would simply be 
one example of incongruous behavior, and the 
fact that it happened to be a form of behav- 
ior included within the framework of non- 
verbal communication research would be ir- 
relevant in terms of its motivating properties 
in the experimental situation reported here. 


Method 


In order to test this hypothesis, it was first neces- 
sary to devis an adequate Operational definition of 
incongruity. The authors wished to avoid using an 
incongruous behavior that would “load” the experi- 
ment one way or the other, either by using a situ- 
ation with threatening overtones (e.g., a male experi- 
menter standing on the corner cracking a whip), or 
by using a situation with sexual overtones (eg., a 
femalë sunbathing on the sidewalk), In order to 


© provide a fair test, it was important to select a 


behavior in which incongruity was the only salient 
and reliable stimulus property. In order to find such 
a behavior, an informal questionnaire containing’ a 
list of 20 possible “incongruous” behaviors that could 
be applied in the experimental paradigm was pre- 
sented to 30 undergraduates taking the introductory 
psychology course. The students were asked to rate 
the situations on a number of scales and to provide 
their own adjectival label for each situation, On 
the basis of the results of this pretest and other 
informal questioning, an operational definition of 
perceived incongruity was devised. 


Procedure 


The experimenter sat on the sidewalk with a 
hammer, on a street corner a block away from that 
used in Experiment III, and began to tap and pick 
at the sidewalk in front of her whenever a car pulled 
up in the nearer lane and stopped for a red light. 
She did not look at the driver of the car or acknowl- 
edge his presence in any way, but simply went on 
with her activity, calmly and quietly. Two additional 
conditions were run: a replication of the stare condi- 
tion used in Experiments II-III and a replication of 
the no-stare condition. During these two conditions, 
the experimenter hid the hammer behind a telephone 
pole so that it was not visible to the subject. The 
experimenter was the same female graduate student 
as in Experiment IT, and the drivers were timed 
across the intersection by a high school student who 
was hired for the occasion and who was unaware of 
the hypothesis. The timer stood on the sidewalk 
across the street from the experimenter and a little 
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TABLE 8 


ANALYSIS OF VARIANCE FOR EXPERIMENT IV 
(E1cur EXPERIMENTERS, PEDESTRIAN 


SUBJECTS) 
Source df MS F 

Stare (A) 1 10.17 29.91* 
Sex of subject (B) 1 2:52 
Experimenter (C) if 2.91 
Sex of experimenter (D) 1 3.79 

xB 1 01 
AXC 7 234 
AXD 1 1.36 
Bx C 7 1.19 
BXD 1 37 
AXBXC 7 62 
AXBXD 1 65 

Within error 96 1.00 

*p <.005. 


„to the rear of the car, as in Experiment III. Thirty 


subjects were run, 10 in each condition. The distance 
between the experimenter and the subject was the 
same as in Experiments II and III, as was the 
method by which the subjects were selected and 
assigned to conditions. $i 


Results 


The hypothesis that incongruity is the rele- 
vant stimulus to flight in this situation was 
not confirmed. The mean crossing times were 
4.0 seconds in the stare condition, 5.7 in the 
no-stare condition, and 5.5 in the incongruity 
condition. A one-way analysis of variance was 
performed on the data, yielding a significant 
effect for treatments (F = 5.97, p< 01). A 
comparison between the stare condition and 
the average of the other two (Winer, 1962, 
p. 67) indicated that the stare condition was 
clearly different from the other conditions and 
that there was very little difference between 
the incongruity and the no-stare condition 
(F = 11.68, p < .005). Thus, the construct of 
incongruity, in itself, seems insufficient to 
explain the results of the series of experiments 
reported here. 

The variance was smallest in the stare 
condition, intermediate in the no-stare condi- 
tion, and largest in the incongruity condition, 
but these variances did not differ among 
themselves sufficiently to violate the assump- 
tion of homogeneity. In this and the other 
experiments reported here, the consistently 
low variances in the stare conditions are prob- 
ably due to a floor effect. The differences 
among the variances, however, provide some 
slight additional evidence that the effect of 
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staring is not the same as the effect of any 
incongruous situation which does not involve 
staring. In the incongruous situation, the 
variability was slightly increased over that in 
the no-stare control group, while staring seems 
to have a more unifying effect on subjects’ 
behavior. 


Discussion 
Staring Time and Crossing Time 


The fact that the staring time was not cor- 
related with the subjects’ speed across the 
intersection in Experiments I and II is quite 
important, since it indicates that the full 
avoidance response is elicited by the percep- 
tion that one is being stared at. The duration 
of the gaze necessary for it to be perceived 
as a stare and therefore to produce the avoid- 
ance response is very short; in these experi- 
ments, subjects who were stared at for less 
than 5 seconds drove off just as fast as those 
who were stared at for 18 seconds or more. 
Psychologically, the actual amount of time 
spent gazing is probably less relevant to the 
perceiver’s reaction than certain behavioral 
criteria, the most important of which is the 
starer’s failure to react to the other person’s 
glance. The expected reaction on the part of 
the starer would seem to be either gaze aver- 
sion (analogous to a primate appeasement 
signal), or a mitigating change in facial ex- 
pression, such as a smile (also a signal in the 
absence of hostile intent), or some verbal 
clarification of the situation; if none of these 
occurs immediately, the gaze is capable of 
eliciting the avoidance response, even in the 
absence of other hostile signals (such as a 
frown). This temporal relationship was clearly 
apparent in the behavior of the subjects in 
the experiments reported here. As soon as 
they had stopped at the light, most of them 
would begin to look around and would almost 
immediately notice the starer. Within a sec- 
ond or two, they would avert their own gaze 
and begin to indulge in a variety of appar- 
ently nervous behaviors, such as fumbling 
with their clothing or radio, revving up the 
engines of their cars, glancing frequently at 
the traffic light, or initiating animated con- 
versation with their passengers.® If there was 


6 Incidentally, the variety of these behaviors 
coupled with the low variance in the stare condition 
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a long time interval before the light changed, 
the subjects tended to glance furtively back 
at the experimenter, averting their gaze as 
soon as their eyes met his. It should be re- 
membered that the six drivers who attempted 
to converse with the experimenter in Experi- 
ment I (and thus were not used as subjects) 
were all in the stare condition. In the no-stare 
condition, the subject’s eyes occasionally met 
the experimenter’s while the experimenter was 
glancing over to make a note of the subject’s 
age and sex. In these cases the experimenter 
immediately looked away, the subjects sat 
quietly waiting for the light to change, then 
crossed the intersection at speeds no different 
from those of the subjects who did not see 
the experimenter glance at them. 


The Staring Effect 


The studies reported here demonstrate that 
staring at humans can elicit the same sort of 
responses that are common in primates; that 
is, staring can act like a primate threat dis- 
play. Furthermore, this effect was obtained 
by 11 different experimenters in three dif- 
ferent situations. However, the fact that 
avoidance is a predictable response in these 
situations does not necessarily imply that 
staring functions as an unequivocal signal of 
aggressive intent in man as it does in many 
other primates. Nor does it necessarily imply 
that staring automatically triggers a response 
(attack or flight or their surrogates) appro- 
priate to an agonistic encounter. In most pri- 
mate groups, both the role relationships and 
the individual’s “rules” of behavior for ex- 
pressing aggressive, nonaggressive, or submis- 
sive intent are relatively clear-cut, so that the 
question of what a behavior means or what 
response is appropriate in a given situation 
comes up much less frequently than it does in 
human interactions. In the experiments re- 
ported here, it is quite possible that the sub- 
jects became tense for lack of an appropriate 
response for the situation. Thus, there are two 
possible explanations for the results, the de- 


provides some evidence against the possibi - 
tive explanation that subjects in lp seg aie 
were more attentive to the traffic light, and thus 
more likely to start moving as soon as it changed. 
No comparative data were collected on this RA 
since it was impossible to observe the subjects’ 

direction in the no-stare condition. is 
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gree of overlap between them depending on 
the generality of the perception of staring as 
a form of threat. The first is that staring is 
generally perceived as a signal of hostile in- 
tent, as in primates, and elicits avoidance (or, 
in some circumstances, counterattack). The 
second is that gazing at a person’s face is an 
extremely salient stimulus with interpersonal 
implications which cannot be ignored. The 
stare, in effect, is a demand for a response, 
and in a situation where there is no ap- 
propriate response, tension will be evoked, 
and the subject will be motivated to escape 
the situation. 


a A situation in which there is no apparent 


‘response often involves incongruous behavior, 


or, in Sommer’s terms (Felipe & Sommer, 
1966), norm violation. The incongruous or 
norm-violating properties of the staring situ- 
ation, however, are insufficient to account for 


. the avoidance response; in Experiment V, it 


was demonstrated that avoidance is not a 
necessary response to incongruous behavior. 
Although there was no apparent appropriate 
response in the incongruous condition, neither 
was there any demand for a response from 
the subject; he could remain uninvolved. If it 
is assumed that staring is a salient stimulus 
which forcibly involves the subject in an 
interpersonal encounter and demands a re- 
sponse, the fact that there is no appropriate 
response becomes very important to the sub- 
ject, arousing tension and eliciting avoidance 
at the earliest possible moment. The invasions 
of personal space studied by Felipe and 
Sommer (1966) have properties similar to 
those postulated here for staring; in addition 
to violating a norm, the experimenter’s behav- 
ior was of a kind that precluded subjective 
noninvolyement. 

This interpretation of the stare in human 
interactions, as well as the alternative hy- 
pothesis that staring is typically perceived as 
a threat, should be tested in the laboratory. 
The questions which this line of research next 
suggest are: (a) Do subjects perceive a 
stare as an indication of hostile intent in the 
absence of other aggressive cues? (b) When 
appropriate responses are available, does the 
stare evoke these responses, or does it still 
evoke withdrawal? (c) Under what circum- 
stances does staring elicit avoidance behavior, 
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and under what circumstances is it perceived 
as a threat? ; 
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It was hypothesized that responses on a measure of attraction-rejection would 
be more positive toward a dissimilar stranger whose attitudes were expressed 
in open-minded, rather than dogmatic, terms. In two experiments, subjects 
were exposed to dogmatic or open-minded statements, which were handwritten 
in Experiment I (N = 42) and expressed orally by confederates in Experiment 
II (N =57). In both experiments, analysis of variance indicated confirmation 
of the hypothesis (p < 02). Alternative interpretations and theoretical impli- 


\ cations of the data are discussed. 


"g The, enormous variability among human 
cultures, groups, and individuals is an obvious 
descriptive fact. As the earth’s unoccupied 
“buffer” space decreases at an accelerating 
rate and as human populations become more 
and more interdependent, contacts between 
very dissimilar people become increasingly 
unavoidable. Given these circumstances, toler- 
ance of dissimilarity among people would seem 
to be an essential prerequisite for peaceful 
and harmonious existence. One of the most 
reliable phenomena to be found in social psy- 
chological literature, however, is the tendency 
to reject those who are dissimilar. The gen- 
erality of this phenomenon across cultures and 
subcultures and across age and occupational 
groups has been clearly demonstrated, for 
example, in experiments on interpersonal at- 
traction (Byrne, 1969). Not only are dis- 
similar others disliked, but they are also 
evaluated negatively with respect to such 
characteristics as intelligence, adjustment, 
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knowledge, and morality (Byrne, 196la, 
1971). Negative evaluation, in turn, has been 
found to elicit reciprocal rejection even more 
negative than that occasioned by dissimilarity 
(e.g., Aronson & Worchel, 1966; Byrne & 
Ervin, 1969; Byrne & Griffitt, 1966; Byrne 
& Rhamey, 1965). Thus, intolerance initiates 
a cycle that may become literally vicious. 
In addition, dislike of one’s fellows has 
been found to have deleterious effects on 
learning (Lott, 1969; Lott & Lott, 1966) and 
on productivity (Hoyt, 1967). 

Assuming, then, that intolerance of dis- 
similarity is both undesirable and omnipres- 
ent, it may be concluded that there is need 
for research to determine what conditions 
would be most conducive to greater tolerance. 

The term intolerance is used here to refer 
to rejection of a stranger on the basis of 
information indicating that he holds attitudes 
that are dissimilar to those of the subject. 
This usage is consistent with Rokeach’s 
(1960) proposition that “the more closed our 
belief systems, the more we will reject others 
who disagree with us, and the more we will 
accept others because they agree with us 
[p. 80].” 

Most of the research approaches to dif- 
ferential tolerance for dissimiliarity have con- 
centrated on the response and the responder. 
For example, response to dissimilarity has 
been investigated as a function of dogmatism 
(Baskett, 1966; Gormly & Clore, 1969), in- 
security (Goldstein & Rosenfeld, 1969), au- 
thoritarianism (Byrne, 1965; Sheffield & 
Byrne, 1967), and need for affiliation (Byrne 
1961b, 1962). These attempts to identify the 
individuals who are most and least tolerant 
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of dissimilarity have been only sporadically 
successful. 

A complementary approach is to examine 
those aspects of the stimulus that evoke nega- 
tive reactions and to attempt to manipulate 
them. It has been proposed that dissimilar 
attitudes lead to dislike because they elicit 
negative affect, and that they elicit negative 
affect because dissimilarity is threatening 
(Byrne & Clore, 1970; Clore & Gormly, 
1969), Palmer’s (1969) vindication hypothe- 
sis implies that the threat involves primarily 
the fear of being incorrect and hence being 
made to appear a fool. It might be speculated 
that much of this reaction rests on repeated 
childhood experiences in which mistakes in 
language, in reasoning, or in understanding 
are met with ridicule. 

It is suggested that one reason for the re- 
pulsive effect of dissimilar attitudes is that 
they implicitly challenge the competence of 
the individual. According to Rokeach ( 1960), 
opinionated language, which is an overt, 
verbal aspect of dogmatism, 


‘gives us two kinds of information about the speaker. 


First, it tells us whether the speaker accepts or rejects 
a particular idea or belief. Second, it tells us 
whether the speaker accepts or rejects people depend- 
ing on whether they agree or disagree with this idea 
[p. 46]. 


It is a common practice to wrap one’s own 
political views, for example, in simple-minded 
rhetoric which implies that “my side” is cor- 
rect, and “the other side” is the last refuge 
for scoundrels and fools, Even if an attitude 
statement does not directly accuse a person 
of being foolish or ignorant, an unmodified 
disagreement may be perceived as requiring 
that one person is “right” and the other is 
“wrong.” Thus, either directly or indirectly, 
dogmatic wording should increase the chal- 
lenging or threatening effect of disagreeing 
attitudes. Conversely, open-minded wording 
might reduce threat by allowing the subject 
to “save face.” 

In a _ sociometric study of informal 
relationships among dormitory residents, 
Rosenfeld and Nauman (1969) found that 
peer-group motivation to engage in social 
relationships with the subject was negatively 
related to the dogmatism score of the subject. 
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In the present experiments, it was hypothe- 
sized that attraction is more positive toward 
an open-minded dissimilar stranger than 
toward a dogmatic dissimilar stranger. 


EXPERIMENT I 
Method 


Dogmatic and open-minded attitude statements 
were composed by deducing implications for verbal 
behavior from dictionary definitions and from 
Rokeach’s (1960) classic description of open and 
closed minds. Definitions of dogmatism suggest that 
the construct may be multidimensional, at least in 
terms of behaviors attributed to dogmatic persons. 
Of course, a large part of the construct is conceptu- 
alized as a type of internal cognitive structure. On 
the other hand, the aspect of dogmatism that seems 
to have the most significance for interpersonal rela- 
tionships is intolerance for Opposing views and for 
people who hold such views. Composition of the 
statements for the present experiment focused mainly 
on the latter aspect of dogmatism, so that the dog- 
matic and open-minded statements differed greatly in 
amount of tolerance for opposing views. Statements 
were also composed to represent both positive and 
negative positions with respect to various topics, 
which covered a wide range of issues. 

Subjects. The subjects were 42 male undergraduates 
at the University of Texas who had been pretested 
on a 24-item attitude scale at the beginning of the 
semester. In this experiment, the attitude statements 
were presented in the form of handwritten answers 
on a 16-item open-ended questionnaire. Each subject 
received the filled-out questionnaire of what was 
purportedly another student. His task was to evaluate 
the stranger on the basis of information gained from 
reading the attitude statements. The primary depen- 
dent variable was interpersonal attraction, méasured 
by the Interpersonal Judgment Scale (Byrne, 1969). 
This instrument consists of six 7-point rating scales, 
the last two of which pertain to the subject’s prob- 
able liking or disliking for the stranger and his 
feelings about working with him. Scores on these 
two scales are summed to yield a measure of 
attraction-rejection with a possible range from 2 to 
14 points, yielding a split-half reliability of .85 
(Byrne & Nelson, 1965). 

A 2 X2 factorial design was used, the two factors 
being attitude similarity and dogmatism—open- 
mindedness of the wording of the attitudes. Each 
subject received a single completed questionnaire 
which contained either all dogmatic or all open- 
minded statements. In the similar condition, 12 of 
the 16 statements (proportion of similar attitudes 
=.75) agreed with the subject’s views, and 4 state- 
ments disagreed. In the dissimilar condition, only 4 
statements (.25 similarity) agreed with subject’s atti- 
tudes, while 12 disagreed. The topics on which the 
stranger expressed similar or dissimilar attitudes were 
randomly selected. 
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Results 


The mean attraction responses for the 
four experimental conditions are presented in 
Table 1. Analysis of variance indicated sig- 
nificant main effects for both attitude simi- 
larity (F = 12.77, df = 1/38, p <.001) and 
dogmatism—open-mindedness (F = 8.39, df = 
1/38, p< .01). The interaction F was less 
than 1. Attraction was more positive toward 
a similar stranger than toward a dissimilar 
stranger and more positive toward a stranger 
expressing his views open-mindedly than 
toward a stranger who expressed his views 
dogmatically. 

The hypothesis dealt with the effects of 
verbal dogmatism on tolerance for dissimi- 
larity. As hypothesized, rejection of the dog- 
mati dissimilar stranger was greater than re- 
jection of the open-minded dissimilar stranger 
(E = 7.44, df = 1/23, p < .02), A compara- 
ble analysis of the response to the similar 
stranger yielded no significant effects for 
verbal dogmatism (F = 1.64, df = 1/15, ns). 


Discussion 


The results clearly support the hypothesis 
that intolerance of persons with dissimilar 
attitudes is greater when the attitudes are 
stated dogmatically than when they are stated 
open-mindedly. 

Generalization of findings from one situa- 
tion to another requires that the crucial vari- 
ables responsible for the obtained effect be 
present in each situation and that those vari- 
ables which are different in the two situations 
be irrelevant to the effect. Determination of 
generalizability can be made only by repeat- 
ing experiments in additional situations in 
which those variables deemed crucial remain 
the same while other aspects of the situation 
are altered. In the present instance, the dog- 
matism hypothesis was tested in a second 
experiment, which differed from the first in 
mode of presentation of the attitude state- 
ments (oral rather than written), in attitudi- 
nal contrast (two strangers who disagreed with 
one another rather than one stranger), in 
mode of presentation of the stranger (hypo- 
thetical stranger represented by paper-and- 
pencil responses versus live confederates), and 
in various minor aspects of the stimulus con- 
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TABLE 1 


MEAN ATTRACTION RESPONSES TOWARD OPEN-MINDED 
OR DOGMATIC STRANGER ON Basis OF EXPOSURE 
TO SIMILAR OR DISSIMILAR WRITTEN STATEMENTS 


Proportion of similar attitudes 


Stranger's statement 


25 75 
Open-minded 9.09 11.20 
Dogmatic 6.64 9,71 


text. If the results of the two experiments are 
consistent, the generality of the effect of 
verbal dogmatism will obviously be much 
greater than it would have been if only 
one isolated experiment or two identical 
experiments had been conducted. 


EXPERIMENT IT 
Method 


Several weeks before the experimental sessions, 57 
subjects (30 males, 27 females)? were pretested on 
an attitude survey in their introductory psychology 
classes at the University of Texas. 

Subjects were run individually. Each subject and 
two confederates reported separately to the desig- 
nated experimental room. The experimenter met them 
and described the experiment as one involving tele- 
vision interviews. It was explained that the focus of 
interest was on the effect of being interviewed in 
the presence of peers. In one condition, two students 
would be interviewed together and would be ob- 
served on television by the third student. In the 
other condition, one student would be interviewed 
alone without an audience of peers, A drawing was 
held, and it was arranged that the subject would 
be assigned to serve as the audience while the two 
confederates were interviewed together; later the 
subject was to be interviewed alone. All three stu- 
dents were given mimeographed copies of the 12 
questions that would be asked. The confederates 
were taken to the interview room and the subject 
to a separate room where he could watch the process 
on closed-circuit television. In the interim, the con- 
federates received scripts indicating their responses to 
each question. 

During the interview the confederates were seated 
at a table facing the camera. Each question was asked 
once, and both confederates answered before the 
next question was asked. In each interview, 10 of 
the 12 answers given by one confederate were con- 


3 Sixty subjects were originally run in Experi- 
ment II, but data were eliminated for the first 3 
female subjects because the confederates found it 
necessary to discontinue their participation in the 
experiment, and it was therefore not possible to 
counterbalance roles for those confederates, 
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gruent with the views of the subject (similarity 
= .83), while the other confederate gave only two 
agreeing responses (similarity = .17). Thus, attitude 
similarity was a within-subjects variable. Verbal dog- 
matism was a between-subjects variable in that both 
confederates gave dogmatic or else both gave open- 
minded statements. In a given interview each con- 
federate was told which responses to give but was 
unaware of whether he was assigned to the similar 
or dissimilar role. Eight different confederates were 
used, The role assignment and order of respond- 
ing were counterbalanced. Each confederate served 
equally in dogmatic and open-minded conditions. The 
sex of the confederates, the camera technician, and 
the interviewer was matched with that of the sub- 
ject. Effort was made to secure confederates who 
were good actors, relatively “neutral” in appearance, 
and matched in pairs as to physical attractiveness, 
height, haft color, voice quality, etc. 

Following the interview, the subject was given 
two copies of the Interpersonal Judgment Scale, 
asked to write the name of one interviewee on the 
top of each, and to evaluate the two individuals on 
the basis of what had been learned about them in 
watching the interview. 


Results 


As may be seen in Table 2, attraction was 
found to be influenced by attitude similarity 
(F = 82.80, df = 1/55, p < .001). There was 
no main effect for verbal dogmatism (F < 1), 
but the similarity-dogmatism interaction was 
significant (F = 9.10, df = 1/55, p < .005). 
Further analysis indicated that the responses 
closely paralleled those in the first experiment. 
Toward the dissimilar stranger, as hypothe- 
sized, there was a significant effect for verbal 
dogmatism (F = 6.81, df = 1/55, p< .02), 
while toward the similar stranger, the effect 

was not significant (F = 1.43, df = 1/55, ns). 


Discussion 


Both experiments provide support for the 
hypothesis that open-minded disagreement is 
more tolerable than dogmatic disagreement. 
Furthermore, this effect has been found again 
in a replication of Experiment I, which was 
an ancillary component of a later experiment 
(Hodges & Byrne, 1971). It is clear that the 
open-minded versus dogmatic manipulation 
used in these experiments produced strong ef- 
fects on attraction toward dissimilar strangers. 
The lack of an effect in the similar attitude 
condition indicates that the dogmatic versus 
‘open-minded statements did not represent 
simply unpleasant versus pleasant qualities. 
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TABLE 2 


MEAN ATTRACTION RESPONSES TOWARD OPEN-MINDED 
OR DOGMATIC STRANGERS ON Basis OF EXPOSURE 
TO SIMILAR AND DISSIMILAR ORAL STATEMENTS 


Proportion of similar attitudes 


Strangers’ statement 


17 83 
Open-minded 8.16 10.23 
6.85 10.88 


Dogmatic 


Rather, the effects were specific to response 
to a disagreeing stranger. 

Although the operational relationships be- 
tween the independent and dependent vari- 
ables are clear, the results leave room for 
more than one interpretation and, of course, 
for further research. For example, the verbal 
dogmatism variable can be analyzed further 
into more molecular constructs, such as atti- 
tude strength and insult to disagreers. Such 
components may each contribute to the ob- 
served effects, or perhaps a single effective 
component could be isolated. In any case, it is 
noteworthy that dictionary definitions and 
authoritative descriptions (Rokeach, 1960) of 
the natural molar behaviors which are labeled 
dogmatic or open-minded apparently include 
essentially the same components as do the 
statements used in these experiments. Thus, 
while further analysis will contribute to the 
understanding of the mechanisms which pro- 
duce the effects, the two constellations of 
statements seem to be appropriately labeled 
dogmatic and open-minded. 

Intuitive analysis of the apparent differ- 
ences between the dogmatic and open-minded 
statements suggests that they differ in attitude 
strength. This possibility raises two questions: 
(a) Does the stranger’s attitude strength have 
a significant effect on the relationship between 
similarity and attraction? and (b) Do sub- 
jects perceive differences in attitude strength 
between the two types of statements used in 
the present experiments? 

The first question was originally answered 
by Nelson (1965), who experimentally ma- 
nipulated the extremeness of the attitudes 
expressed by the stranger and found that 
attitude strength did mot significantly affect 
the similarity—attraction relationship. How- 
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ever, when Nelson analyzed his data in terms 
of the magnitude of the discrepancy between 
the stranger’s attitudes and those of the sub- 
ject, discrepancy was found to have a slight 
but significant (p < .05) influence on attrac- 
tion in addition to the effects of similarity— 
dissimilarity. This effect of the two partially 
independent stimulus elements was later con- 
firmed by Byrne, Clore, and Griffitt (1967). 
Hence, the more negative response to dog- 
matic, dissimilar strangers could have been 
at least partially a function of greater response 
discrepancy perceived in that condition. 

To determine what levels of belief strength 
are perceived by subjects in the dogmatic and 
open-minded statements, videotapes of actual 
interviews from Experiment II were shown to 
two undergraduate personality classes at 
Purdue University. One class observed an 
open-minded interview, while the other ob- 
served a dogmatic interview with the same 
confederates. After each response was made 
on the tape, the subjects were asked to indi- 
cate on attitude scales the way they believed 
‘that individual would have responded to the 
item. It was found with respect to each con- 
federate that the mean belief strength at- 
tributed to the dogmatic statements was sig- 
nificantly higher than the mean belief strength 
attributed to the open-minded statements 
(¢ = 6.52 and 7.51, df =49, p < 001). 

In Nelson’s (1965) experiment neither the 
attitude strength of the stranger nor the atti- 
tude strength of the subject was found to in- 
fluence attraction. Rather, discrepancy of atti- 
tude strength between subjects and strangers 
was the variable that influenced attraction, 
and even discrepancy had a much smaller in- 
fluence in that experiment than did similarity 
(with attitudes dichotomized into agree and 
disagree categories). In fact, one may note 
that the effect of discrepancy in that experi- 
ment, which used more subjects than either of 
the present experiments, was less statistically 
significant ($ < 05) than the effects of the 
dogmatism variable in these experiments. 

Nevertheless, in an attempt to isolate pos- 
sible effects of discrepancy in the present 
experiments, the original attitude surveys and 
experimental records from Experiment II 
were reexamined, and discrepancy scores were 
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calculated by comparing each subject’s atti- 
tude level with the mean belief levels attrib- 
uted to the confederates. The correlation be- 
tween discrepancy and attraction was not 
significantly different from zero (r = .03 for 
similar stranger and —.19 for dissimilar 
stranger). 

It appears as though neither attitude 
strength nor attitude discrepancy were re- 
sponsible for the significant effects of the 
open-minded versus dogmatic statements. 
However, the available evidence does not com- 
pletely rule out the possibility that discrep- 
ancy may have had some influence on attrac- 
tion in these experiments, since the subjects 
and the situation used for the discrepancy 
test were different from those in the original 
experiment. On the other hand, even if 
discrepancy did influence attraction to some 
degree, the relatively small effects found in 
the previous experiments in which atti- 
tude strength was systematically manipulated 
(Byrne et al., 1967; Nelson, 1965), together 
with the negative results of the test for dis- 
crepancy effects with the open-minded and 
dogmatic statements, suggest that discrepancy 
is unlikely to have produced the observed 
effects by itself. i 

Considering the theoretical basis on which 
the statements were composed, as well as the 
fact that the dogmatism variable was effective 
only under conditions of dissimilar attitudes, 
it seems most plausible that the critical vari- 
able was related to tolerance for opposing 
views of implied respect for those who dis- 
agree. Therefore, the results of these experi- 
ments are tentatively interpretated as sup- 
porting the proposal that intolerance of per- 
sons with dissimilar attitudes is at least in 
part a defensive reaction against threat to a 
person’s feelings of competence. 

Regardless of the mechanisms involved, it 
seems clear that verbal E. is 
more conducive to tolerance of dissimilar atti- 
tudes than is verbal dogmatism. This rela- 
tionship has been verified in quite different 
situational contexts. Whether open-minded- 
ness is a mitigating factor, dogmatism is an 
exacerbating factor, or both, is still an empiri- 
cal question. Presumably, though, dogmatism- 
open-mindedness represents a continuous di- 
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mension, and, at this point, we speculate, 
parsimoniously, that the effect of this variable 
would also be a continuous function. 

One question that arises is whether it is 
possible to teach ourselves, to teach others, 
and/or to teach children to express their views 
in an open-minded way. Can people learn to 
think and speak so as to express the idea: 
“There are arguments for X and arguments 
for Y. There is no way to determine conclu- 
sively whether X or Y is correct. Intelligent 
individuals of good will may come to support 
X or to support Y. I am strongly in favor 
of X for what I believe to be excellent rea- 
sons, but I may be wrong, and I respect your 
right to support Y.”? Offhand, it appears 
much easier for people to learn to think and 
speak dogmatically and to reject their op- 
ponents with insulting epithets, rotten eggs, 
rocks, and bullets. 
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An experimental investigation of determinants of whether people in need will 
seek help from an available source was based on an attributional analysis of 
when secking help is experienced as threatening to self-esteem. Subjects per- 


ceived that they were performing poor! 
available guidelines could help them. 


ly on a social judgment task and that 
Help was sought significantly more 


(a) when it was reasonable to attribute responsiblity for failure externally 
rather than to the self; (b) when the attributes linked to help seeking were 
peripheral rather than central to the subjects’ self-conception (only among sub- 
jects with high self-esteem) ; (c) by subjects with low rather than high self- 
esteem (only when help seeking reflected on central attributes); and (d) by 
subjects with low rather than high achievement motivation. 


Recent research has shed much light on 
help giving, but the other side of the helping 
paradigm, the conditions that affect whether 
help that is available will be sought or ac- 
cepted, remains relatively unillumined. As the 
example of poor families who forego their 
legitimate right to public assistance suggests, 
people are often reluctant to solicit or accept 
the help which they need. The current study 
was designed to identify some of the factors, 
both situational and individual, that influ-. 
ence whether people in need will seek help. 


Situational Variables 


In keeping with much theorizing in social 
psychology, it was assumed that readiness to 
seek help, like readiness to undertake many 
other actions, is a function of perceived threat 
to self-esteem. Attribution theory, as devel- 
oped by Jones and Davis (1965) and Kelley 
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(1967), suggests that the potential for threat 
to self-esteem inherent in an act of seeking 
help is directly related to whether the person 
attributes responsibility for his failure and 
hence for his need for help to himself. Attribu- 
tion of personal responsibility for failure to 
the self is most likely to occur in the absence 
of adequate external justifications for failure. 

The perceived adequacy of external justifi- 
cations for failure may derive from informa- 
tion the person possesses both about the prop- 


` erties of the performance situation itself and 


about the responses of others in the same sit- 
uation. Regarding task properties, Feather 
(1967) has shown, for example, that the more 
objectively difficult a task was perceived to 
be, the less disturbed subjects were by failure. 
Regarding rates of others’ performance, 
Weiner and Kukla (1970) found that judged 
external as against self-attribution for failure 
on a task was a positive function of the pro- 
portion of group members who were perceived 
to have failed at the task. 

Examples of how both bases for external 
attribution presumably have been used to 
cancel the demeaning quality of seeking or 
accepting help readily come to mind. Leaders 
of underprivileged groups have justified de- 
mands for help by attributing their constitu- 
ents’ need to a discriminatory social structure. 
Reluctant oldsters have been urged to accept 
social security benefits with the argument that 
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since everyone over 65 receives them, there is 
no reflection upon their integrity. 

Locus of attribution for failure affects the 
probability that seeking help will threaten 
self-esteem. The intensity of the threat experi- 
enced, however, depends on the nature of the 
inadequacy acknowledged when seeking help 
to overcome the failure. Seeking the services 
of a psychiatrist, for example, is likely to be 
more threatening than calling upon a lawyer 
to draw up a contract. This is because, for 
most of us, social adjustment and mental 
health are attributes more central to our self- 
conceptions than is knowledge of the fine 
points of legal language and practice. Thus, 
one determinant of the intensity of threat to 
self-esteem inherent in an act is the ego cen- 
trality of the attributes on which the failure 
necessitating that act reflects. The centrality 
of an attribute to a person’s self-conception 
can be expressed in terms of the impact on 
his overall level of self-esteem of evaluations 
on that attribute. Central attributes are func- 
tionally linked with many aspects of one’s 
self-concept, while peripheral attributes are 
loosely tied to other aspects of the self (Mil- 
ler, 1961). 

To summarize, the decision to seek help can 
be formulated as a two-step process. First, 
the person who is failing allocates responsibil- 
ity for his failure to himself and/or to possi- 
ble external causes. To the extent that failure 
is self-attributed, it is experienced as reflect- 
ing personal inadequacies. The more central 
the characteristics deemed inadequate are to 
the person’s self-conception, the more threat- 
ening to self-esteem is acknowledgment of 
failure. Second, the decision to seek help is 
reached based on the earlier attributions. If 
there has been self-attribution for failure, and 
especially if the failure reflects on central 
attributes, the person is likely to experience 
much threat to self-esteem in anticipation of 
seeking help. The act of seeking help under 
these conditions has the symbolic significance 
of affirming that one is indeed possessed with 
the inadequacies implied by one’s poor per- 
formance. An attribution of failure to external 
causes or a downgrading of the centrality of 
the implicated attributes would reduce the 
symbolic threat inherent in seeking help. We 


therefore hypothesized that (a) help is more 
likely to be sought when the locus of attribu- 
tion for failure is external rather than to the 
self, and that (6) help is more likely to be 
sought when failure reflects negatively on 
attributes which are peripheral rather than 
more central to the self-conception of the 
person in need. 

The above discussion emphasizes the nega- 
tive implications of seeking help for self- 
esteem. If other people are likely to observe 
the act, embarrassment may be a further 
source of inhibition. In order to isolate the 
effects of threat to self-esteem unconfounded 
by the reactions of others, an attempt was 
made in this study to minimize embarrassment 
as a source of variance. 


Individual Difference Variables 


Threat to self-esteem is the key intervening 
variable in the above analysis of situational 
impacts on help seeking. Thus, it is reasonable 
to ask whether a person’s persistent ( chronic) 
level of self-esteem also influences the readi- 
ness to seek help. Two theoretical formula- 
tions speak to the question of how people 
with different levels of self-esteem respond 
when they are confronted with experiences 
threatening to their feelings of self-worth 
(Bramel, 1968). When applied to help seek- 
ing, these formulations yield opposing pre- 
dictions, 

According to a cognitive consistency 
formulation, information about the self is dis- 
turbing to a person in proportion to the 
amount of inconsistency between the informa- 
tion and his general level of self-esteem, For 
a person high in self-esteem, the cognition 
that his status on important attributes is 
called into question by seeking help is 
strongly dissonant with his positive self- 
image. For a person of low self-esteem, the 
same cognition is less dissonant because of 
his generally low self-regard. 

The cognitive consistency formulation leads 
to the prediction that (a) persons with low 
self-esteem are more likely to seek help than 
are those with high self-esteem. An interac- 
tion prediction may be more appropriate if — 
Aronson (1968) is correct in arguing that dist 
sonance is created only when behavior is in- 
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consistent with important self-conceptions or 
strong expectancies. In that event, those low 
in self-esteem will be more likely to seek 
help than those high in self-esteem only in 
the high centrality condition. 

According to the second formulation, level 
of self-esteem influences how secure or inse- 
cure people are in the face of information that 
threatens their self-conception. This might 
be termed a “vulnerability” formulation. The 
person with low self-esteem is one who pos- 
sesses relatively few positive cognitions about 
himself. Hence, his need to defend against 
information that would undermine even these 
cognitions is great. The person with high self- 
esteem, by contrast, has a wealth of favorable 
self-cognitions, so that he can acknowledge 
negative information with less concern (see 
Deutsch, Krauss, & Rosenau, 1962; Dittes, 
1959; for similar views). Regarding help seek- 
ing, the vulnerability formulation leads to the 
prediction that (b) persons with high self- 
esteem are more likely to seek help than are 
those with low self-esteem. 

The need to seek help usually arises when 
a person fails to attain a desired goal or to 
achieve a standard of excellence. Thus, a sec- 
ond individual difference that may influence 
readiness to seek help is the person’s achieve- 
ment motivation (need for achievement minus 
fear of failure). In their attributional analysis 
of achievement motivation, Weiner and Kukla 
(1970) proposed that when people fail, those 
high in achievement motivation are more 
likely to attribute their failure to a lack of 
effort and less likely to attribute it to deficient 
ability than are those low in this motivational 
tendency. When failure is attributed to lack 
of effort rather than ability, seeking help is 
less likely because further effort alone may be 
expected to produce success. This analysis 
leads to the hypothesis that (c) achievement 
motivation is negatively related to seeking 
help. 

The same hypothesis derives directly from 
the interpretation of achievement motivation 
as an orientation to obtaining gratification 
from performing up to a standard of excel- 
lence (Atkinson, 1964). Gratification arises 
not from overt signs of success, but from the 
knowledge that one has excelled through his 
own efforts. Those high in achievement moti- 
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vation are therefore expected to avoid seeking 
help because such behavior precludes obtain- 
ing the gratifications attached to achieving on 
one’s own. 


METHOD 
Subjects 


Female undergraduates at the University of Wis- 
consin, drawn from a pool of students who had indi- 
cated interest in participating in paid research pro- 
grams, were contacted first by mail and then by 
telephone and offered $2 to take part in a medical 
sociology research study. A total of 48 girls, 94% of 
those contacted, participated. 


Design 


The experimental conditions consisted of two lev- 
els of centrality (low, high) and two levels of locus 
of attribution (external, self). Subjects were allo- 
cated to experimental conditions so that an equal 
number of those scoring above and below the median 
on the Rosenberg self-esteem scale (administered 
months earlier) were placed in each. Six high-self- 
esteem and six low-self-esteem subjects were run in 
each experimental condition, yielding a 2%2X2 
factorial design. The experimenter was aware of 
the centrality and locus of attribution treatments 
administered, but was blind to the subject’s level of 
self-esteem. 


Procedure 


A male experimenter ushered the subject into the. 
experimental room and offered the following cover 


story: 
a 


The research staff is developing techniques for 
training people to teach themselves to identify 
instances of neurotic interaction interspersed with- 
in otherwise healthy relationships. To ascertain 
how well these techniques are likely to work, the 
help of college students is being solicited. 

. 


He then told the subject that she would hear 12 
dialogues excerpted from the spontaneous interac- , 
tions of two graduate students. Her task would Ben 
to rate how neurotic each of these dialogues was on 
a 9-point scale, ranging from “not neurotic at all” to 
“very neurotic.” Her performance would be evaluated 
by comparing her ratings with those of a team of 
clinical. psychologists who had previously rated the 
dialogues, 

The experimenter indicated that the task would 
last about 25 minutes, and that he would be in 
another room so as not to distract the subject. Each 
dialogue would be followed by a -minute interval, 
during which the subject would announce her rating 
over an intercom system, and the experimenter would 
then report the rating given that dialogue by the 
team of clinical psychologists. One ground rfile was 
announced: during the task, the subject and the ex- 
perimenter would be restricted to exchanging ratings 
and were to engage in no other conversation over the 
intercom. oie 
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Before the task began, the experimenter drew the 
subject's attention to the possibility of seeking help. 
He explained that if she felt that she needed some 
help, she should feel free to consult a set of guide- 
lines on how to identify neurotic interactions which 
had been specially prepared by the team of psy- 
chologists. 


Independent Variables 


Centrality was manipulated by varying whether 
or not the act of seeking help from the guidelines 
might shed doubt on two attributes presumed to be 
of central importance to most people, their intelli- 
gence and their mental health. In the high cen- 
trality condition, it was suggested that using the 
guidelines implied an admission of weakness on 
these attributes. In the low centrality condition, by 
contrast, the implications of consulting the guide- 
lines were disassociated from these attributes. 


[Low] Research has shown that the ability to 
identify neurotic interactions has nothing to do 
with a persons own mental health and native 
intelligence. Still, it is unreasonable to expect that 
everyone should be able to perform at the same 
level of proficiency on this task. Consequently, 
Chicago has provided written guidelines to help 
certain people perform adequately. Those people 
who feel that they are relatively unskilled at 
identifying neurotic interactions will most likely 
neéd to rely on the guidelines. 


[High] Research has shown that the ability to 
identify neurotic interactions depends largely upon 
a person’s own mental health and native intelli- 
gence. Therefore, it is unreasonable to expect that 
everyone should be able to perform at the same 
level of proficiency on this task. Consequently, 
Chicago has provided written guidelines to help 
certain people perform adequately. Those people 
who feel that they are handicapped, either be- 
cause they are unable to have healthy relationships 
themselves, or because they are not as smart as 
the average person, will most likely need to rely 
on the guidelines. 


In both conditions, the experimenter then presented 
the guidelines as follows: 


The guidelines are in the manila envelope. They 
are one page long and take a couple of minutes to 
read. Wait until you have rated at least two 
dialogues by yourself before you consider using 
the guidelines. That is, give yourself a chance to 
see how accurate your judgments are without 
them. Then, if you feel you need to rely on them, 
stop the tape before you go over to the table to 


use the guidelines. 


Locus of atiribution was manipulated by varying 
two elements in the instructions intended to affect 
whether poor performance on the task would be 
perceived as externally determined or as reflecting 
upon attributes of the self. It was postulated that 
when subjects believed the use of the guidelines to 
be a common response, and when they perceived the 


training procedure to be poorly developed, they would 
attribute their failure to characteristics of the ex- 
ternal situation (the task). Attribution to the self, 
by contrast, was expected when the training proce- 
dure was viewed as refined and use of the guidelines 
as unusual. Locus of attribution was therefore manip- 
ulated as follows: 


[External] About 65% of the people who have 
worked on this particular task have found that 
they needed to consult the guidelines. One reason 
for this may be that the dialogues don’t provide 
enough information. The clinical psychologists who 
tated the conversations had access to additional 
information on the pairs of people which you will 
“not be given. It is also possible that the con- 
struction of the tape does not afford enough time 
in which to evaluate the evidence and rate the 
dialogues adequately. One of the things we may find 
is that some revision in this training procedure is 
necessary for it to be effective. If we find that 
people are having difficulty with the ratings, we 
may have to return the tape to Chicago for major 
additions in dialogues and expansion in time. 


[Self] About 10% of the people who have worked 
on this particular task have found that they needed 
to consult the guidelines. At first, we thought that 
this might be because the dialogues didn’t provide 
enough information. But Chicago has assured us 
that they deliberately selected portions of the 
dialogues which contain informative cues for the 
average person to make accurate judgments, More- 
over, these excerpts were specially chosen in order 
to allow people to make judgments when they 
have only a brief period of time for deliberation, 
The cues in the dialogues were selected on the 
basis of their adequacy in allowing people to exer- 
cise considered judgment even in the brief period 
available between dialogues. 


After receiving these instructions, the subject was 
told that an estimate of her initial level of skill in 
identifying neurotic interactions was needed to help 
evaluate the training procedure. She was asked to 
estimate her skill as compared with the average col- 
lege student using a 9-point scale, and then she 
turned on the tape recorder to begin the task. 

Following the subject’s rating of each dialogue, 
the experimenter provided bogus ratings by the 
clinical psychologists, The experimenter determined 
the bogus ratings by adding or subtracting a pre- 
scheduled number from the subject’s rating according 
to the following pattern: the first rating was off by 
plus or minus 3; the next three, by plus or minus 2; 
the next, by plus or minus 3; and the remainder 
fluctuated between 1 and 3 off. This pattern was 
designed to make it difficult for the subject to deny 
her incompetence at the task, and by implication, 
her need to seek help. 

In order to obtain the guidelines, it was necessary 
for the subject to walk over to a table on the far 
side of ‘the experimental room, ‘This behavior was 
monitored over a concealed closed-circuit television. 
If and when the subject consulted the guidelines, the 
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experimenter recorded the number of dialogues rated 
up to that point. 

Moments after the subject finished examining the 
guidelines and resumed listening to the dialogues, the 
experimenter interrupted over the intercom. He an- 
nounced that the receiving part of his intercom had 
broken down, An “exasperated” experimenter then 
rejoined the subject and indicated that it would prob- 
ably be impossible to repair the equipment in time 
to finish, He asked her to complete a reaction form 
based on her experience thus far, while he checked 
further. Subjects who did not consult the guidelines 
completed the reaction form after rating all 12 
dialogues. The experimenter then probed for sus- 
picion (finding none) and gradually provided a full 
debriefing. Finally, the subject completed a 10-item 
version of the Achievement Risk Preference Scale 
(Atkinson & O’Connor, 1963). This scale purports to 
measure need for achievement minus fear of failure 
by presenting respondents with forced choices be- 
tween pairs of self-descriptive statements, one pre- 
sumed to characterize people whose need to achieve 
success is greater than their need to avoid failure 
and the other characteristic of people for whom the 
priority of these two needs is reversed. 


RESULTS 


Perceptions of the Setting and Experimental 
Manipulations 


To provide a valid test of hypotheses re- 
garding help seeking, it was necessary for the 
subjects to feel that they needed help to suc- 
ceed and could receive it by consulting the 
guidelines. Responses to two items on the re- 
action form indicate that most subjects felt 
that they were failing and in need of help. 
Fighty-nine percent of the subjects rated the 
accuracy of their judgments of the dialogues 
at or below the midpoint on a 7-point scale, 
and 85% described their feelings while rating 
the dialogues at or below the midpoint on a 
scale of competence. Responses to these items 
did not vary by experimental conditions. 

Responses to the question “Before you 
started to rate the dialogues, how helpful did 
you think the guidelines would be in making 
accurate judgments?” indicated that most sub- 
jects perceived the guidelines as a genuine 
source of help. Fully 88% felt that the guide- 
lines were at or above the midpoint on a scale 
of helpfulness, and there was no variation in 
helpfulness ratings by experimental condition. 
Note that even the subjects who consulted the 
guidelines had no opportunity to learn 
whether they were indeed helpful for improv- 
ing judgment because they were interrupted 
before they could rate another dialogue. 
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TABLE 1 


Locus OF RESPONSIBILITY AND PERCEPTIONS 
OF THE TRAINING PROCEDURE 


Locus of attribution 
condition 
Perceived qualities of training i 
procedure 5 
Self External 
(r = 24) | Gi = 24) 
Adequacy of time 5.54 4.58 | 2.38* 
Adequacy of information 3.08 2.52 | 1.49 
Validity as index of ability to 
identify neurotic inter- 
actions 3.50 2.92 | 1.39 
Frequency of consulting 
guidelines 3.04 4.13 | 2.29* 
an = 23, Car 
*p <.05, 


The success of the locus of attribution 
manipulation was checked by four questions 
regarding the subjects’ perceptions of the 
quality of the training procedure and of the 
frequency with which the guidelines were 
typically used. Table 1 shows that all four 
differences between external and self-attribu- 
tion conditions were in the expected direction, 
but only two were significant.’ Subjects in 
the external locus of attribution condition 
were more likely than those in the self-attribu- 
tion condition to perceive the amount of time 
available to rate the dialogues as inadequate, 
and to perceive use of the guidelines as a com- 
mon response. These responses suggest that 
both bases for external versus self-attribution 
of failure (task properties and rates of per- 
formance) may have been operative. In inter- 
preting the impact of locus of attribution on 
seeking help, it will therefore not be feasible 
to trace effects to one of these bases rather 
than to the other. 

The centrality manipulation had been in- 
tended to vary the connection between the 
ability to identify neurotic interactions—and 
hence to succeed on the task—and the person’s 
own mental health and native intelligence. To 
determine the success of this manipulation, 
subjects were told and asked: “The ability to 
identify neurotic interactions may depend 
upon certain characteristics of people, How 
strongly do you think that each of the follow- 
ing might be related to that skill?” A list of 
characteristics followed, in which the two of 


8 All significance tests reported in this 
two-tailed. oe 
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Interest were embedded. Table 2 shows that on 

only one of the two key items did a signifi- 
cant difference emerge between the high and 
low centrality conditions. Subjects in the high 
centrality condition considered “own capacity 
to have healthy relationships” to be more 
closely related to the ability to identify neu- 
rotic interactions than did subjects in the low 
centrality condition. The perception of a re- 
lationship between “native intelligence” and 
ability to identify neurotic interactions, while 
in the expected direction, was not statistically 
significant (p < .15). 


Determinants of Seeking Help 


Indexing help seeking by the number of 
dialogues rated before consulting the guide- 
lines produced two problems: (a) Within-cell 
variances were heterogeneous and correlated 
with cell means; and (4) the number assigned 
to subjects who did not seek help was arbi- 
trary. Both of these problems were alleviated 
by trichotomizing the distribution into mean- 
ingfully different categories ordered in terms 
of readiness to seek help. Those subjects who 
chose not to seek help at all (35%) were 
assigned a score of 0. Those who rated at 
least two dialogues more than the minimum 
required by the instructions, but eventually 
sought help from the guidelines (27%), were 
assigned a score of 1. Finally, those who 
turned to the guidelines as soon as possible or 
after rating only one dialogue more than the 
minimum (38% ) received a score of 2.* 

The analysis of variance for seeking help, 
presented in Table 3, indicates a significant 
main effect for locus of attribution, As pre- 
dicted, subjects sought help sooner and more 
frequently in the external attribution condi- 
tion (X = .71). Eighty-eight percent of those 
in the external attribution condition sought 
help at some point during the task, while only 
44% of those in the self-attribution condition 
turned to the guidelines. Computation of 
omega-square indicates that locus of attribu- 
tion accounted for approximately 11% of the 
variance in seeking help. 

As indicated in Table 3, the centrality of 
the attributes reflected on by failure did not 


4One subject who sought help after listening to 
the first dialogue, despite the instruction to rate at 
least two dialogues before using the guidelines, was 
included in this category. 
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TABLE 2 


CENTRALITY AND PERCEIVED CORRELATES OF ABILITY 
TO IDENTIFY NEUROTIC INTERACTIONS 


Centrality condition 
Perceived Correlate HAIE 


Low High 
(n = 23) | (n = 24) 
| 


Sex 2.52 1.88 | 1.08 
Formal education 2.96 3.42 -94 
Native intelligence 3.61 4.21 | 1.51 
Religiosity 96 E25 yp 
Own capacity to have healthy 

relationships 4.70 5.54 | 2.19% 


*p <.05. 


have a significant main effect on help seeking. 
The main effect for self-esteem also fell short 
of significance (p < .15), supporting neither 
the cognitive consistency nor the vulnerability 
formulations. A clearer picture of the oper- 
ation both of centrality and self-esteem can 
be obtained from examination of the signifi- 
cant interaction between these two variables, 
portrayed graphically in Figure 1. 

Post hoc comparisons among the four means 
involved in this interaction (by Duncan’s 
multiple-range test) showed a tendency for 
centrality to have the predicted effect on seek- 
ing help among those high in self-esteem (p < 
06), but subjects low in self-esteem were 
equally likely to seek help under high or low 
centrality conditions. The form of the inter- 
action suggests that there was little inhibition 
of help seeking when failure did not reflect 
on central attributes or when the potential 
seeker of help already held a low evaluation 
of herself. Inhibition was apparent, however, 
for those with a high level of self-esteem when 
seeking help might symbolize admission of 
inadequacy on central attributes. 


TABLE 3 


ANALYSIS OF VARIANCE FoR HELP SEEKING 


Source df MS F 

Locus of responsibility (A) 1 4.69 7.65** 
Centrality (B) 1 .52 <1 
Self-esteem (C) 1 1.69 2.76 
AXB 1 .02 <1 
AXC 1 02 <1 
BXC 1 2.52 4.12* 
AXBXC 1 1.02 1.67 

Within error | 40 -61 

*p <.05. 

PED < 01 


HIGH SELF-ESTEEM 
LOW SELF-ESTEEM 


4 1 
Low High 


Fic. 1. Mean scores for help seeking as a function 


of centrality and self-esteem. (Each mean is based 
on n = 12.) 


Within the high centrality condition, self- 
esteem appears to have had the effect pre- 
dicted by the cognitive consistency formula- 
tion. Those low in self-esteem were more 
likely to seek help than those high in self- 
esteem (p < .05). The absence of an esteem 
effect under low centrality is supportive of 
Aronson’s (1968) view that dissonance is cre- 
ated only when behavior is inconsistent with 
important self-conceptions. Computation of 
omega-square indicates that the Centrality X 
Self-Esteem interaction accounted for approxi- 
mately 5% of the variance in seeking help. 

The combined strength of the variables 
considered thus far is evidenced in the be- 
havior of subjects in the conditions where 
seeking help was postulated to be most and 
least threatening to self-esteem. Not one of 
the subjects with high self-esteem who was 
exposed to the self-attribution and high cen- 
trality manipulations sought help. By con- 
trast, where threat to self-esteem was postu- 
lated to be weakest—for those low in self- 
esteem who were exposed to the external 
attribution and low centrality manipulations 
—every one of the subjects turned to the 
guidelines for help. 


As measured by the Achievement Risk 


Preference Scale, achievement motivation 
showed a strong negative relationship to seek- 
ing help (7 = —.46, p < .01). As predicted, 
subjects whose need for achievement exceeded 
their fear of failure were less likely to con- 
sult the guidelines than those whose fear of 
failure exceeded their need for achievement. 
Since achievement motivation was measured 
postexperimentally, the possibility arises that 
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failure of randomization on this variable might 
have influenced the experimental effects. Upon 
examination, it was found that achievement 
motivation was totally unrelated to assign- 
ment to either the locus of attribution or 
centrality conditions. Achievement motivation 
was, however, positively related to level of 
self-esteem (r= .40, p < .01). Despite this 
substantial correlation, achievement motiva- 
tion contributed a significant increment to the 
amount of variance in help seeking accounted 
for when added to the multiple regression 
equation which already included the experi- 
mental variables and their interactions (F = 
10.65, df = 1/39, p < .01). In fact, of all the 
variables considered, achievement motivation 
accounted for the largest proportion of vari- 
ance in seeking help (21%). 

The relationships to help seeking of various 
socioeconomic, kinship, religious, and political 
attitude variables were also examined, Only 
father’s income was significantly related to the 
use of the guidelines (r = .36, p < .02). Since 
none of the other socioeconomic status vari- 
ables was related to seeking help, it is perhaps 
best not to attribute importance to this single 
finding. 


Discussion 


It was postulated that the locus of attribu- 
tion for failure would affect the seeking of 
help by influencing whether use of the guide- 
lines was experienced as threatening to self- 
esteem. Some evidence to support this inter- 
pretation of the observed locus of attribution 
effect was provided by the subjects’ estimates 
of their initial level of skill in identifying 
neurotic interactions. Prior to beginning the 
task, the subjects in the self-attribution condi- 
tion gave significantly lower estimates of their 
own skill as “compared with the average col- 
lege student” than did subjects in the external 
attribution condition (p < .01). The mean 
skill estimate by subjects in the self-attribu- 
tion condition may indicate that some of them 
recognized the potential threat to their esteem 
of failure and sought to defend against it by 
asserting in advance that failure would not 
be incompatible: with their self-conception, 
Note that the skill estimates were not related 
to seeking help later (7 = —.02) so that they 
cannot account for the locus of attribution 
effect. 


By varying the perceived frequency of 
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guidelines used by others, the locus of attribu- 
tion manipulation may also have influenced 
the perceived “demand” characteristics in the 
situation. Although all of the subjects were 
told to feel free to consult the guidelines after 
rating the first two dialogues, those in the 
self-attribution condition (10% use) may 
nonetheless have perceived the use of the 
guidelines as illegitimate or harmful to the 
research. As a check on this interpretation of 
the locus of attribution results, the subjects 
were asked: “After rating Dialogue 2, did you 
feel that it would be harmful in any way to 
this research if you were to consult the guide- 
lines?” While there was little overall concern 
for harmfulness, there was a trend for those 
in the self-attribution condition to view the 
use of the guidelines as more harmful (p < 
4.10). To control for this trend, an analysis of 
covariance was performed, with perceived 
harmfulness as the covariate. In this analysis, 
the impact of locus of attribution on help 
seeking remained significant (F = 5.19, p< 
05), and the strength of the significant Cen- 
trality X Self-Esteem interaction was un- 
affected (F = 4.61, p < .05). 

One interpretation for the absence of a 
main effect for centrality is that the locus of 
attribution manipulation affected the impact 
of the centrality manipulation. Specifically, 
subjects in the external locus of attribution 
condition, where performance could be inter- 
preted as not reflecting on one’s own abilities 
at all, might be unaffected by any manipula- 
tion of the centrality of traits presumably 
reflected on. Contrary to the implications of 
this interpretation, however, (a) subjects in 
, the two locus of attribution conditions did not 
Y differ in the perception of how well the task 
measured ability to identify neurotic inter- 
actions (means were close to scale midpoint) ; 
and (b) the significant effect of the centrality 
manipulation on the perceived connection be- 
tween that ability and one’s own capacity to 
have healthy relationships was just as strong 
under external as under internal locus of 
attribution. 

Examination of the significant Centrality x 
Self-Esteem interaction portrayed in Figure 1 
reveals that the help seeking of only one 
group failed to conform to predictions based 
on the idea that it is more threatening to 
acknowledge failure which reflects on central 
, attributes, These were the low-self-esteem sub- 
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jects in the high centrality condition. A 
plausible interpretation of the behavior of 
these subjects is that any threat to self- 
esteem that they experienced was offset by 
a desire to study the guidelines. The Rosen- 
berg (1965) scale of self-esteem probably re- 
flects in large part the respondent’s social 
adjustment. Faced with failure that implied 
their own inability “to have healthy relation- 
ships” in the high centrality condition, low- 
self-esteem subjects would be more likely to 
accept this implication than high-self-esteem 
subjects, and more likely to look upon the 
guidelines as providing an opportunity to ob- 
tain help with a significant personal problem. 

The above interpretation could account for 
the absence of a centrality main effect. It 
also suggests an alternative explanation for 
the self-esteem results that have been viewed 
as supportive of the version of the cognitive 
consistency formulation based on Aronson 
(1968). Differences between high- and low- 
self-esteem subjects in the use of the guide- 
lines in the high centrality condition may re- 
flect differential motivation to obtain help for 
one’s own problems, rather than differences in 
the amount of consistency between level of 
self-esteem and the implications of help- 
seeking behavior. Future research is needed 
to sort out these two explanations. 

According to the attributional analysis of 
achievement motivation, the reason seeking 
help was less common among those high in 
achievement motivation is because these per- 
sons tend to attribute their failure to lack of 
effort rather than to lack of ability. This 
interpretation received support from the signi- 
ficant tendency of those high in achievement 
motivation to report a greater sense of compe- 
tence while rating the dialogues than those 
low in achievement motivation (r = .30, p < 
.05). Achievement motivation was related 
neither to subjective estimates of skill prior 
to embarking on the task nor to subjective 
estimates of accuracy in judging the dialogues. 
Thus, those high in achievement motivation 
were as likely to expect failure and to 
acknowledge it, but they were less likely to 
attribute it to their incompetence or lack of 
ability. 

An alternative interpretation of the causal 
direction of the achievement motivation find- 
ings is that the use of the guidelines influ- 
enced arousal of achievement motivation. This 


noA 
interpretation requires that substantial dif- 
ferences in arousal persisted throughout the 
4-hour period during which subjects com- 
pleted the reaction form and were debriefed, 
prior to administration of the Achievement 
Risk Preference Scale. Evaluation of this 
alternative awaits research in which achieve- 
ment motivation is measured in advance of the 
opportunity to seek help. 


Implications 


This research has examined situational and 
individual difference variables that influence 
whether help is sought when people find 
themselves in need and when a source of help 
is accessible. As such, the research has impli- 
cations for efforts to maximize the utilization 
of helping services in society. The experi- 
mental findings suggest that the threat to 
self-esteem inherent in seeking help may often 
serve as an inhibitor. Persuasive messages in- 
tended to overcome the reluctance of people 
to seek help are more likely to succeed if they 
offer the target person(s) external justifica- 
tions for avoiding self-attribution of failure. * 

The findings indicate that when possible, it 
would be well in choosing an influence strat- 
egy to take into account at least two charac- 
teristics of the target persons. If their level 
of self-esteem is high, it is particularly im- 
portant to define the situation so that at- 
tributes central to their self-conceptions are 
not likely to be undermined by the act of 
seeking help. For those strongly motivated to 
achieve on their own, efforts to persuade them 
that seeking help does not constitute an ad- 
mission of failure may be in order. 

Attempts to upgrade the productivity and 
perseverance of underachieving individuals by 
training in achievement motivation (e.g., Mc- 
Clelland & Winter, 1969) may stumble on an 
interesting paradox. The strong negative as- 
sociation between achievement motivation and 
seeking help suggests that as efforts to raise 
achievement motivation succeed, the trainee 
may become less willing to accept further 
help even if some needs continue. Increasing 
achievement motivation may improve success, 
according to the attributional analysis, when 
failure is due to lack of effort rather than lack 
of ability. When jnability is the cause of 
failure, however, achievement training, by in- 
creasing the attribution of failure to lack of 
effort, may be counterproductive. 
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In this experiment the effort was made to 
minimize embarrassment as an obstacle to 
seeking help, in order to focus on determi- 
nants of threat to self-esteem. This was done 
by providing an inanimate source of help, by 
removing the experimenter physically from the 
setting, and by limiting his behavior during 
the task to the mechanical recitation of num- 
bers. In natural situations, people in need 
often must contend with active help givers 
and sometimes with the presence of onlookers. 
Research on factors that influence threats to 
public esteem in help-seeking situations would 
complement the present study. 
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CONCEPTUAL SYSTEM FUNCTIONING AND SELECTIVE 
RECALL OF INFORMATION: 


GEORGE GREAVES 2 


Georgia State University 


Using Harvey, Hunt, and Schroder’s schema of conceptual development, 20 
System 1 subjects, 10 System 1-3 subjects, and 12 System 3 subjects were 
exposed to factual information concerning the United States and Sweden, some 
of which favored the United States, some of which favored Sweden. Half of 


received the information from the same source who this time biased all infor- 
mation in favor of Sweden. System 1 subjects recalled information concerning 
the country in whose favor the source was biased but did not recall informa- 
tion for the other country. System 3 subjects could recall pro-United-States 
information only under both conditions. System 1-3 performance was more 
similar to System 3 performance than to System 1. The author discusses the 


significance and possible interpretation of the differential performance of these 


cognitive differences in similar designs. 


One of the recurring debates in social psy- 
chology concerns the selective learning hy- 
pothesis. On this view, subjects exposed to 
information will tend to select out informa- 
tion that favors their position or views while 
rejecting information that they find threaten- 
ing or inconsistent with their views. The de- 
pendent measure has nearly always been rate 
of recall, and the results have been, at best, 
unambiguously inconclusive. 

That selective learning does, or at least 
“should” take place, has a long history in 
psychological lore. James (1890) emphasized 
that persons selectively attend to information 
they find appealing, while Sullivan (1954, 
1956) emphasized selective inattention as a 
defense against threatening material. The 
Freudian (Freud, 1959) analogue of this 
phenomenon is, of course, repression. 

In any case, by the time Levine and 
Murphy (1943) presented the “classical” em- 
pirical study of selective learning and forget- 
ting, it appeared that this was just another 
case of what anthropologist Ashley Montague 


1 This study was done in connection with the 
author’s doctoral dissertation in partial fulfillment of 
the PhD degree. 

? Requests for reprints should be sent to George 
Greaves, who is now at the Outpatient Psychiatry 
Department, Kitchener-Waterloo Hospital, Kitche- 
ner, Ontario, Canada. 


and stresses the need for controlling for 


once called “‘psychology’s struggle to prove 
the obvious.” 

In this classic study, Levine and Murphy 
presented procommunist and anticommunist 
subjects with procommunist and anticom- 
munist readings for a period of 5 weeks. For 
the following 5 weeks, the subjects were 
simply asked to reproduce passages from the 
readings. As expected, the procommunist sub- 
jects could reproduce procommunist passages 
better than anticommunist ones, and vice 
versa. 

There were flaws in the study, of course, 
such as no controls for prior familiarity, and 
these flaws opened the door to new studies. 
Supportive data were subsequently produced 
by Alper and Korchin (1952) and Jones and 
Kohler (1958). The latter study is the most 
“elite” of the group and served as a model 
for several subsequent studies. 

However, the subsequent studies cast seri- 
ous doubt on the selective learning hypothesis, 
at least as stated. Taft (1954) reversed the 
Alper and Korchin (1952) findings. Fitzger- 
ald and Ausubel (1963) reversed the Jones 
and Kohler (1958) study. Waly and Cook 
(1966) also failed to confirm the hypothesis, 

The most rigorous empirical study of se- 
lective learning-forgetting was undertaken by 
Greenwald and Sakamura (1967) with sur- 
prising results. The experimenters were com- 
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mitte early demonstrating the hypo- 
thetical phenomenon on a “once and for all” 
basis when, to their chagrin, their carefully 
designed study failed to produce the antici- 
pated results. So, they redesigned the experi- 
ment. Still no results. A third experiment. 
Still nothing. They were forced to conclude 
that there were still unidentified variables at 
work which tended to undermine the expected 
effect. Brigham and Cook (1969) produced 
still more negative results. Then, to further 
complicate the issue, Feather (1969), depart- 
ing radically from traditional experimental 
designs for testing this hypothesis, produced 
rather dramatic results for what he calls 
“selective recall.” 

In examining these and related studies, it 
becomes obvious that cognitive variables, 
such as the subjects’ level of conceptual sys- 
tem functioning (Harvey, Hunt, & Schroder, 
1961), may play an important role in per- 
formance in studies involving information 
input-output designs (see Greaves, 1971a). 
The purpose of the present study was to retest 
the selective learning—forgetting hypothesis as 
a possible function of cognitively mediated 
behavior. 


METHOD 


The present study involved a traditional input- 
output design, while attending to subgroup differ- 
ences in conceptual complexity. Of interest also was 
the possible effect of source bias on the part of a 
high-status/high-expertise source. 


Subjects 


Harvey’s “This I Believe” test (Greaves, 1971c), 
an instrument for assessing level of conceptual sys- 
tem functioning, was administered to 258 students, 
enrolled in eight sections of introductory psychology, 
at the beginning of the course. Of this group, 42 
subjects were designated without their knowledge, 
and prior to the experimental treatment, as experi- 
mental subjects, while 20 served as controls. The 
distribution of subjects by class assignment is shown 
in Table 1. Due to the comparative absence of Sys- 
tem 2 and System 4 students enrolled in these classes, 
these subjects were not available for experimental 
purposes. 

Subjects were told by their class instructors at 
the beginning of the quarter that they would have 
visiting speakers from time to time. About 6 weeks 
into the course, it was announced that in a few 
days “Mr. Greaves, a doctoral student in social 
psychology,” would be speaking to them about so- 
cial psychology and his interests. 


GREAVES 


TABLE 1 


DISTRIBUTION OF SUBJECTS BY CLASS ASSIGNMENT 


Class System 1 | System 1-3 | System 3 Total N 
d 3 2 1 6 
2 3 1 2 6 
3 4 2 3 9 
4 3 3 2 8 
5 5 1 2 8 
6 2 1 2 5 
t 5 4 4 13 
8 3 2 2 7 


At the appointed time the author went to the 
class and explained his interest in culturally induced 
effects on behavior, To demonstrate this, he con- 
tinued, he had brought along some slides he had 
acquired from the “Swedish Educational Service” (no 
such organization) which demonstrate how differ- 
ences in cultural ideals had resulted in marked 
behavioral differences between United States and 
Swedish citizens. 

At this point the experimenter showed the group 
of 120 slides which depicted a wide variety of 
scenes of the United States and Sweden, as well as 
graphs, charts, and drawings of rates of crime, 
divorce, birth, death, health, sexual activity, and the 
like. Since all the writing on the slides was in 
Swedish, the experimenter translated them one by 
one, strongly endorsing them for their accuracy (and 
they were accurate), and frequently giving the 
source of the data (United States Statistical Ab- 
stract, Department of Commerce, etc.). 

Classes 1, 2, and 3 (see Table 1) received a nar- 
ration by the experimenter which was biased ex- 
tremely in favor of the United States and against 
Sweden (pro-United-States/anti-Sweden version), 
regardless of what the facts warranted. For instance, 
while the health record of Sweden is far superior 
to that of the United States, the experimenter 
derided this bit of information by pointing to 
Swedish “socialism” and to how the Swedish gov- 
ernment had effectively wrested free enterprise from 
the hands of Swedish physicians. Classes 4, 5, and 
6 received precisely the same slides and statistical 
information, word for word, but this time the in- 
terpretation of all facts was strongly biased in 
favor of Sweden (pro-Sweden/anti-United-States 
version) . 

Following the 25-minute presentation, the “guest 
speaker” (experimenter) informed the class that the 
“Swedish Educational Service,” which had supplied 
the slides and information, was interested in getting 
American students’ reactions to the information and 
seeing how effective the slides were in getting it 
across. Questionnaires containing 22 questions con- 


8 This subject matter was chosen since “the 
American way of life” has proven to be a highly 
ego-involving subject in conceptual systems research. 

4The United States death rate being almost twice 
as high. 
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cerning death rates, health figures, murder rates, 
etc., which had been given in the audiovisual pres- 
entation, were then handed out with an accompany- 
ing answer sheet. Subjects were given 20 minutes to 
fill out the answer sheet (which was in a 16-part 
multiple-choice format for each of the 22 questions). 
Both time and format had been previously worked 
out in a pilot study in another city. 

Classes 7 and 8 were simply given the question- 
naire cold on the guise that the “Swedish Educa- 
tional Service” wanted to find out how much Ameri- 
can students actually knew about Sweden. This pro- 
vided a basis for assessing the stimulus properties of 
the questionnaire and answer sheet and response pat- 
terns, independent of the experimental treatment. All 
groups were administered the questionnaire and 
answer sheet again at an interval of 10 days to see 
if there were any differences in long-term memory 
decay as a function of any of the factors considered. 
All subjects were debriefed during the last week of 
the quarter. 

In all classes, the answer sheets of the predesig- 
nated experimental subjects (who were naive re- 
garding their status) were kept and scored, while 
the other sheets were discarded. The purpose of run- 
ning subjects this way was, of course, to minimize 
experimental demand characteristics by neither in- 
forming any class member that he was a part of any 
experiment or that the guest speaker was anything 
other than exactly that. 

Responses were scored on the basis of the devi- 
ance from the correct answer. That is to say, the 
dependent measure was “how much” the subject 
misconstrued the correct information provided him 
in the experimental treatment (the “deviance score”), 
Such scoring, it was felt, would be much more sensi- 
tive to systematic response bias than the somewhat 
crude “either/or” scoring of many earlier studies, The 
final scores, by subject, were given in “percentage 
accurate” terms, computed on the basis of the fol- 
lowing formula: 


Percentage accurate score = 


3 devi 
100% — [( 3 deviance Tos ) 10], 
maximum possible deviance 


TABLE 2 


MEANS FoR UNITED-STATES- AND SWEDEN- 
FAVORING RESPONSES (IN ARCSINES) 


Item System 1 | System 3 | Combined 
Pro-United-States bias 
E P 2.3333 | 2.3045 | 2.3189 
ea 2.0673 | 2.0041 | 2.0357 
Pro-Sweden bias i 
oe SaN 1.9628 | 2.4310 | 2.1969 
are 2.2185 | 2.1458 | 2.1822 
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TABLE 3 


CELL LOADINGS (ns) ror ANALYSIS OF VARIANCE 


Repeated measure 
Group Condition 
ci C: 
aı (System 1) bı (pro United States/ 10 10 
anti Sweden) 
be (pro Sweden/ 10 10 
anti United States) 
az (System 1-3) | bi (pro United States/ 5 5 
anti Sweden) 
be (pro Sweden/ 3 5 
anti United States) 
aa (System 3) bı (pro United States/ 6 6 
anti Sweden, 
b2 (pro Sweden/ 6 6 
anti United States) 


There were two scores for each subject: (a) his per- 
centage accurate score on information whose factual 
content judges held favoring the United States and 
(b) his percentage accurate score on information 
judges held favoring Sweden. The percentage accurate 
scores were subsequently converted to arcsines, and 
analyses of variance and appropriate ¢ tests were 
run (see Table 2). 


RESULTS 


The cell loadings for the analyses of vari- 
ance are shown in Table 3, and the analyses 
themselves are in Tables 4 and 5. For the 
United-States-favoring information there was 
a significant System Type X Presentation 
Bias interaction, while for the Sweden-favor- 
ing information there was a main effect by 


TABLE 4 


ANALYSIS OF VARIANCE IN ACCURACY SCORES FOR 
SWEDEN-FAvoRING QUESTIONS 


Source of variance SS df | MS F 
Between subjects 41 
System type (A) :2462 | 2 |.1231 | 5.699* 
Source bias (B) :2032} 1 |.2032| 9.407** 
AXB 0244} 2 | .0122 
Error between 
subjects :7979 | 36 | .0216 
Within subjects 
Time of presentation 
8730] 1 | 8730] 16.503** 
AXC -0360 | 2 | .0180 
BXC 0611} 1} .0611| 1.155 
AXBXC 0379 | 2 | .0190 
Error within sub- 
jects 1.9055 | 36 | .0529 
Note.—Unweighted-means solution (Winer, 1962). 
*p <.05. 
FD <.01. 
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TABLE 5 


4 
ANALYSIS OF VARIANCE IN Accuracy SCORES FOR 
UNITED-STATES-FAVORING QUESTIONS 


Source of variance SS df MS F 
Between subjects 41 
System type (A) 1067} 2 | .0534| 1.679 
Source bias (B) .0051 | 1] .0051 
AXB 4622} 2| .2311| 7,267** 
Error between 
subjects 1.1440 | 36 | .0318 
Within subjects 
Time of presentation 
(©) 4.0315 | 1 | 4.0315 | 36.287** 
AXC -2199| 2] .1100 
BXC 3214] 1} .3214} 2.893* 
AXBXC 1041} 2) .0521 
Error within 
subjects 


Note.—Unweighted-means solution (Winer, 1962). 
Fp <.10. 
wD < O01. 


system type and for presentation bias. There 
was, of course, an anticipated main effect for 
memory loss over time, but there were no 
significant system type or presentation inter- 
actions with the time factor, indicating the 
decay in recall to be uniform. 

Specific within-subjects differences in recall 
found were the following: (@) When faced 
with the pro-United-States bias by the source, 
System 1 subjects recalled the United-States- 
favoring information far better than that 
favoring Sweden (f= 5.77, p< .001, two- 
tailed test). (b) Conversely, when faced with 
the pro-Sweden bias by the source, System 1 
subjects recalled the Sweden-favoring infor- 
mation better than that favoring the United 
States (= 2.66, p< .02, two-tailed test). 
(c) System 3 subjects answered the United- 
States-favoring information more accurately 
under both conditions. (Under the pro-United- 
States bias, £= 2.941, p< .02, two-tailed 
test; pro-Sweden bias, ¿= 2.545, p< .05, 
two-tailed test). (d) System 1-3 subjects 
answered United-States-favoring questions 
better than Sweden-favoring ones under the 
pro-United-States bias (t= 2.475, p< .05, 
two-tailed test), but given the pro-Sweden 
bias, these subjects answered both groups of 
questions equally well (¢ = 1.189, ns). 

Between subjects, the following was found: 
(a) System 1 subjects recalled Sweden-favor- 
ing information better given a pro-Sweden 
bias on the part of the source (t = 8.73, p < 
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.001, two-tailed test). (b) Conversely, System 
1 subjects recalled United-States-favoring in- 
formation more poorly when faced with the 
pro-Sweden bias (t = 22.47, p < .001, two- 
tailed test). (c) As a group, System 3 and 
System 1-3 subjects did not differ as a func- 
tion of the source’s bias in their ability to 
recall information (¢ < 1 in both cases). It 


should be remembered from above, however, < 


that these subjects were less able to recall 
Sweden-favoring information in any case, par- 
ticularly the System 3 subjects. 

The control subjects did not differ from one 
another either by system-type performance or 
by ability to respond accurately to the United- 
States-favoring versus the Sweden-favoring 
items, and performed uniformly poorer than 


Be 


y 


subjects who had received the experimental p 


treatment. 

Debriefing of the experiment took two dif- 
ferent stages, one involving a chance to inter- 
act with the experimenter after he had pre- 
sented the rationale, results, and need for the 
study and deception, another involving a 
chance to interact with their class instrustor. 
All indications are that the deception itself 
was highly successful, with only 2 persons 


from over 200 who received the presentation | 


indicating (on questionnaires or in direct inter- 
action) that they were even “suspicious.” As 
for hurt feelings, feelings of being taken ad- 
vantage of or duped, and other such unfor- 
tunate side effects of deception experiments, 
the aftermath was, instead, focused on a 
fascination with the experiment, and with 
feelings ranging from mild annoyance to pride 
at having been a part of the study. 


Discussion 


The present study indicates that the selec- 
tive learning—forgetting hypothesis can be 
demonstrated in an experimental setting at a 
high degree of confidence, using a traditional 
input-output experimental design. It also 
bears out the author’s contention (Greaves, 
1971a) that cognitive variables must be at- 
tended to in such experiments in order to 
avoid throwing the baby out with the bath 
water by ignoring important systematic sub- 
group variance. 


~ 


: 


SYSTEM FUNCTIONING AND RECALL OF INFORMATION 


This point is easily illustrated by making 
reference to Table 2. Under the condition 
of the pro-Sweden source bias, where the 
most dramatic differences between and within 
groups occurred, there are no significant dif- 
ferences occurring whatsoever if the scores 
from the two “subgroups” are combined (see 
column 3), Since both System 1 and System 3 
subjects may be expected to appear regularly 
in any given population (Greaves, 1971a), one 
would expect that any similar study done in 
which such population subgroups were not 
identified might have attributed this subgroup 
variance to “error” variance, masking the true 
experimental effect, 

As to the effects of source bias in this 
experiment, the difference in performance be- 
tween System 1 and System 3 subjects is 
somewhat striking. The System 1 subjects ap- 
parently may be induced to misperceive and 
distort input, as well as ignore their normal 
reference-group values, simply on the basis 
of authority bias or directive (e.g., Harvey & 
Beverly, 1961). Such behavior seems to be 
prototypic of that associated with the “au- 
thoritarian personality” (Adorno, Frenkel- 
Brunswik, Levinson, & Sanford, 1950) and 
bears out other contentions that System 1 
subjects and authoritarian subjects represent 
one and the same group (Harvey, 1964, 1966; 
Harvey & Ware, 1968). Since authoritarian 
persons are presumably ingratiating to high- 
status persons, the behavior of these subjects 
is consistent with Jones’ (1964) findings that 
subjects’. awareness of source bias may in- 
crease attitude change by showing them the 
way to ingratiate the source. 

System 3 persons obviously represent a 
different breed. The literature describes these 
persons as extremely peer-group dependent 
(Greaves, 1971a; Harvey, 1964, 1966; Harvey 
et al., 1961), and one wonders if conformity 
to peer-group or reference-group norms is not 
so great as to preclude their willingness (or 
ability) to attend to discrepant input, regard- 
less of source status. The Asch (1956) studies 
certainly demonstrate that there are such 
individuals, and a recent study by the author 
(Greaves, 1971b) seems to underscore the 
reference-group orientation of System 3 indi- 
viduals. One begins to wonder at some point 
if these are not indeed the subjects that 
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much of the peer-group conformity literature 
is really written about. 


SUMMARY AND CONCLUSION 


If one attends to cognitive variables, selec- 
tive recall of information can be demonstrated 
at a high degree of confidence. Using Harvey 
et al.’s (1961) schema of conceptual develop- 
ment, the present study dealt mainly with 
persons at two different levels of conceptual 
development whose chief behavioral charac- 
teristics seem to include a strong authority 
orientation for one group and a strong peer- 
group orientation for the other. Failure to 
have identified these subgroups would have 
resulted in a failure to demonstrate the selec- 
tive recall phenomenon due to the “canceling” 
effects of the differential subgroup perfor- 
mances. More studies are needed to determine 
the ramifications of being able to readily 
identify these subgroups in experimental 
settings, since these subjects may often be 
expected to act differentially toward the same 
stimuli. 
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STIMULUS CHARACTERISTICS AND SUBJECT PREJUDICE 
AS DETERMINANTS OF STEREOTYPE ATTRIBUTION * 


JACK M. FELDMAN 2 


University of Illinois 


The study of the trait-attribution process 
has been known by many names, and various 
methods have been used to study it. Some of 
the earliest studies of trait attribution were 
investigations of racial and ethnic stereotypes 
(e.g., Katz & Braly, 1958), in which subjects 
were asked to check the personality traits that 
they thought were typical of various racial, 
religious, and ethnic groups. This technique 
has continued into the 1960s (Erlich, 1962; 
Karlins, Coffman, & Walters, 1969) and 
has been extended to social class (David- 
son, Reissman, & Myers, 1962), occupation 
(Brown, 1962; Ulrich, Hecklich, & Roeber, 
1966) and “social role” stereotypes (Sarbin, 
1964). These studies have demonstrated that 
complexes of traits are assigned to stimulus 
persons on the basis of information about 
their race, social class, etc., and that these 
trait complexes are shared to a certain extent 
by all members of the (subject) population. 

Studies of “trait centrality” and “implicit 
personality theory” have shown that informa- 
tion about personality characteristics of the 
stimulus also leads to the attribution of entire 


1 This study was conducted in the spring of 1967 
in partial fulfillment of the requirements for the 
MA degree in psychology at the University of Illi- 
nois. The author is indebted to H. C. Triandis for 
his advice and critical comments at all stages of 
this research, 

2 Requests for reprints should be sent to Jack M. 
Feldman, Department of Psychology, University of 
Illinois, Champaign, Illinois 61820. 


complexes of traits, Asch (1946) and Wishner 
(1960) have demonstrated this using descrip- 
tions of a person as “warm” for one group 
and “cold” for another (see also Kelley, 
1950). Koltuv (1962) has found that indi- 
viduals’ ratings of others show patterns of 
non-zero-trait intercorrelations which remain 
significant even when corrected for halo ef- 
fect and that these intercorrelations are larger 
when the persons rated are less familiar to 
the subject. 

A connection between stereotyping and im- 
plicit personality theory studies was made by 
Secord and Berscheid (1963), who found that 
traits contained in the black stereotype tended 
to elicit other stereotyped traits to a greater 
extent than they elicited nonstereotyped 
traits, regardless of the stimulus person’s race. 
Tajfel, Sheikh, and Gardner (1964) showed 
that members of a stereotyped group are 
judged more similar on traits which are part 
of the stereotype than on traits which are not. 

Secord, Bevan, and Katz (1956) and Secord 
(1959) have further demonstrated that stereo- 
typing varies directly with the subject’s cate- 
gorization of the stimulus person into a stereo- 
typed group. Secord (1958) attributed this 
phenomenon to the subjects’ desire to comply 
with the experimenter’s request to describe 
the stimulus person’s personality, which forces 
the use of the only information source avail- 
able—that of the stimulus person’s race. The 
same effect was demonstrated by Thurlow 
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(1962), whose subjects judged photographed 
stimulus person’s personality traits to be more 
like those of a “murderer” or a “senator,” 
depending on the probability with which they 
thought the photographs would be those of 
murderers or senators, (See also Brigham, 
1971, for a more complete exposition of this 
point.) 

Another important source of information 
about stimulus persons is the behavior of that 
person—his type of speech or dress, his choice 
of leisure activities, his choice of profession, 
performance or nonperformance of role re- 
quirements, and so on. Jones and Davis 
(1965) have formulated a theory of trait at- 
tribution based on the interpretation of such 
behavior. While too detailed to consider here, 
the basic premise is that persons draw infer- 
ences from others’ behavior to dispositions in 
their personalities and that these inferences 
are based, in part, on the perceived value of 
the outcome of the observed behavior to the 
stimulus person. Thus, for example, the person 
who works diligently to put himself through 
college would be seen as valuing education 
more than the person who does not, other 
things being equal. 

The process by which information of vari- 
ous types and from various sources is com- 
bined into a unified impression of the stimulus 
person is suggested by several types of studies. 

Fishbein (1965) has demonstrated that the 
attitude of a subject toward an object may 
be predicted from knowledge of the subject’s 
beliefs about that object (the character- 
istics attributed to it) and the subject’s 
evaluation of those characteristics. Triandis 
and his associates (Triandis, 1964; Triandis 
& Davis, 1965; Triandis, Loh, & Levin, 
1966; Triandis, Tanaka, & Shanmugam, 
1966; Triandis & Triandis, 1965), using 
a measure of behavioral intentions as well 
as the evaluation scale of the semantic 
differential, have demonstrated that com- 
plex stimulus persons, varying along such 
dimensions as race, religion, nationality, 
occupation, etc., are evaluated differently 

by subjects in a consistent and reliable 
fashion. It is reasonable to conclude that 
the complex stimuli differ in evaluation 
as a function of the beliefs that subjects hold 
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about them, and that the factors that con- 
tribute heavily to evaluation also contribute 
to belief. ; 

That individual differences in the attribu- 
tion process exist is also suggested by the 
above studies. High scorers on the F scale, 
or those showing a high degree of anti-Negro 
bias, typically place more emphasis on race as 
a determinant of behavioral intentions. 
Though this effect is, in many cases, deter- 
mined by norms of behavior, it is reliable 
enough to allow one to generalize that highly 
prejudiced individuals attend to race more 
than do nonprejudiced persons. Thus, it may 
be inferred that prejudiced persons probably 
use race as a source of information more than 
the nonprejudiced. 

Cognitive rigidity, which often covaries 
with prejudice (Bettelheim & Janowitz, 1964; 
Frenkel-Brunswik, 1949; Triandis & Triandis, 
1965), is also important in trait attribution. 
The rigid or authoritarian individual has been 
shown to be less able to’ form integrated per- 
sonality impressions and to rely more heavily 
on single sources of information than multiple 
sources when those sources present incompati- 
ble information (Gollin & Rosenberg, 1956; 
Steiner, 1954; Steiner & Johnson, 1963). 

A trait implication study (Bruner, Shapiro, 
& Tagiuri, 1958) suggests a means to inte- 
grate the preceding information. It was found 
that when subjects rated the probability of 
various traits’ association with other traits, 
both singly and in combinations of two and 
three “cue traits,” two combinatorial rules 
allowed correct prediction of multiple-trait 
implications from single-trait implications. 

While the specific rules of integration are 
not relevant here, the idea of multiple-cue 
utilization in trait attribution is. If cue traits 
are considered as information sources about 
hypothetical stimulus persons, then the rea- 
soning of this study can be extended to fac- 
torial designs such as those of Triandis and 
his associates. 

It is possible to define racial and occupa- 
tional stereotypes, using words judged to have 
high probability of association with Negroes 
and low probability of association with whites, 
and different words of high probability of 
association with professionals and low proba- 
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bility of association with working-class per- 
sons. The attribution of such stereotypes as a 
whole to stimulus persons varying on a num- 
ber of different dimensions can then be studied 
to determine the extent to which additional 
and/or conflicting information influences at- 
tribution. The effects of subject prejudice on 
stereotyping may also be assessed. 

Five predictions can be made: (a) The 
race of the stimulus person will control a 
major proportion of variance in the attribu- 
tion of a racial stereotype, due to the fact 
that race implies the “racial stereotype” traits 
most strongly. (b) Likewise, occupation will 
control a major proportion of variance in the 
attribution of an occupational stereotype. 
Other predictions are: (c) Race and occupa- 
tion, and occupational and social background, 
will interact significantly (Bayton, McAllis- 
ter, & Hamer, 1965). (d) The highly preju- 
diced will stereotype on a racial basis more 
than moderate or low-prejudiced subjects. 
(e) Independent variables implying long-term 
behavior of the stimulus person (e.g., occupa- 
tion or upward and downward mobility) will 
control more variance than will factors over 
which the stimulus person has no control, such 


as race. 


EXPERIMENT I: STEREOTYPE 
DEFINITION STUDY 


Method 


Subjects. Subjects were 44 male and female students 
from discussion sections of an introductory social 
psychology course at the University of Ilinois, They 
were run during the regular class hours. 

Instrument. Two introductory psychology “honors” 
classes of 10 persons each were requested to respond 
with descriptive adjectives to four stimulus cate- 
gories: Negro, white, professional, and working class. 
A total of 107 adjectives were collected. These were 
then rated by 44 subjects on “probability judgment 
scales.” These were scales ranging from O to 100 in 
steps of five, asking for the number out of 100 such 
persons who would show each trait. Categories to be 
rated were presented in random order to each sub- 
ject; adjective order was constant across categories. 

Analysis. The sample was randomly divided into 
two groups of 22 to allow cross-validation. Paired ¢ 
tests were then carried out in both samples on each 
adjective, between the categories Negro and white 
and professional and working class. 

A hypothesis-testing methodology was chosen in 
preference to correlational or factor-analytic methods 
for two reasons. First, it was thought necessary to 


TABLE 1 


PAIRED ¢ TESTS BETWEEN NEGRO AND WHITE 
CATEGORIES DEFINING THE RACIAL 
STEREOTYPE 


Mean 
Trait SETETE EEn eae A t 
Negro White 


Slow-moving | 39.09 | 26.13 | 4.79 | 2.70* 12.5 


% vari- 
ance 


Athletic 71.59 | 43.40 | 3.77 | 7.46**| 56.4 
Musical 50.90 | 26.59 | 4.31 | 5.63***| 41.1 
Political 32.27 | 47.50 | 5.38 | 2.83** | 13.7 
Self-satisfied | 23.63 | 39.77 | 4.06 | 3.96***| 25.0 
Thrifty 36.36 | 53.40 | 4.94 | 3.44** | 19.8 
Nationalistic | 42.50 | 63.18 | 5.03 | 4.10") 26.4 


Conservative | 32.50 | 49.31 | 4.39 | 3.82***) 23.6 


Note.—For simplicity of presentation, all values are based 
on one sample. Values for the second sample are comparable. 
N = 22 foral cases, 

02. 


<.01. 
wp < 001. 


identify those adjectives which subjects strongly 
associate with categories employed. Second, the de- 
scriptive, rather than evaluative, nature of the ob- 
tained stereotypes was to be emphasized, and the t- 
test analysis seemed most appropriate for this purpose. 

Before any adjective was accepted as part of 
either the racial or occupational stereotype, it was 
required to be significant at the .05 level or better 
in both samples. In addition, the word had to 
differentiate on a racial or occupational basis only. 
Overlapping words were not used in the final scale 
in the interest of obtaining relatively “pure” stereo- 
types and consequent ease of interpretation.’ 


Results 


The occupational stereotype proved to be 
much stronger and more differentiated than 
the racial stereotype; it contained more ad- 
jectives, and percentages of variance con- 
trolled (Hays, 1963) were generally greater. 

Because of the imbalance in numbers of 
adjectives contained in the racial and occupa- 
tional stereotypes, it was decided to select 
those adjectives with the highest percentage 
of variance estimates for inclusion in the final 
questionnaire. Selected words are presented in 
Tables 1 and 2.* 

These results led to a qualification of 
Hypothesis @, which was reformulated to 


3 Due to space limitations, tables of “overlapping” 
words are omitted. Interested readers may obtain a 
list of these words, with mean differences, t-test 
results, and percentages of variance, from the author. 

4 Because of space limitations, it is impossible to 
present the entire list of adjectives. Interested readers 
may request a copy from the author. 
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TABLE 2 


PAIRED ¢ TESTS BETWEEN PROFESSIONAL AND 
Workinc-Ciass CATEGORIES DEFINING 
THE OCCUPATIONAL STEREOTYPE 


ii S of 
Trait vereme Eroles SE i machines 
Coarse 45.22 | 20.22 | 5.10 | 4.89 | 34.2 
Illiterate 13.18 1.36 | 1.90 | 6.20 | 45.9 
Persistent 43.63 | 71.36 | 6.64 | 5.96 | 43.9 
Foresighted 42.50 | 65.68 | 5.03 | 4.60 | 31.4 
Well-dressed | 42.95 | 80.22 | 5.11 | 7.28 | 54.1 
Resourceful 43.86 | 76.59 | 5.08 | 6.43 | 47.8 
Striving 45.90 | 73.40 | 4.49 | 6.11 | 45.2 
Independent | 47.72 | 66.81 | 3.46 | 5.51 | 40.0 
Complex 37.90 | 68.77 | 4.56 | 6.75 | 50.3 
Creative 23.63 | 51.81 | 5.51 | 5.11 | 36.3 
Intelligent 42.72 | 82.40 | 5.40 | 7.34 | 54.5 
Leaders 21.81 | 61.31 | 6.94 | 5.68 | 41.5 


Note.—For simplicity of presentation, all values are based 
on one sample. 


read: The race of the stimulus person will 
control a significant, but not a major, propor- 
tion of variance in the attribution of the racial 
stereotype. 


EXPERIMENT JI: STEREOTYPE 
ATTRIBUTION STUDY 


Method 


Subjects. A total of 115 white male subjects from 
an introductory psychology course were run, in three 
groups, for 1 hour each. Subjects were fulfilling a 
course requirement. Due to misprinted or wrongly 
assembled questionnaires, 12 subjects had to be dis- 
carded. An additional 5 subjects failed to complete 
the questionnaire, leaving a total of 98. 

Design. A five-way factorial design was used, com- 
bining repeated-measures and independent group 
analyses. The repeated-measures variables were 
race (Negro-white), occupation (professional—work- 
ing class), and social background, indexed by “father’s 
occupation” (professional-working class), embodied 
in eight “complex stimulus persons.” Between-subjects 
variables were one of three different social behavior 
descriptions attached to each questionnaire and 
prejudice, indexed by the scale described below. The 
sample was divided into high-, moderate-, and low- 
prejudiced thirds on the basis of total scores on 
this measure. 


5 The three social-behavior descriptions were sug- 
gested by Pettigrew (1964), in which he describes 
three types of responses by blacks to social rejection: 
assimilation, aggression, and withdrawal. His descrip- 
tions were paraphrased and included in the design 
as “social-behavior descriptions.” This factor was 
included in order to assess the influence of de- 
scribed versus implied social behavior on stereotype 
attribution. 
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Instrument. It was decided that subjects should 
respond to the eight stimulus persons on two types 
of scales in order to determine if method variance 
effects existed (Campbell & Fiske, 1959). Accord- 
ingly, those words comprising the racial and occupa- 
tional stereotypes were put into an adjective check- 
list form as well as the probability judgment scale. 

Subjects were told to check only those adjectives 
which they felt were “likely to apply to the persons 
presented” on the checklist scale. They were told to 
mark the probability scale in accord with the likeli- 
hood with which they felt the person in question 
would exhibit that trait. 

In addition, a response-set measure was included 
for each scale. This consisted of a stimulus person 
identified only by the initials “R. S.,” to be rated 
exactly as the other stimuli were. This measure was 
included to allow a statistical correction for response 
set, should one prove necessary, and also to assess 
the relationship between prejudice and response set. 

The prejudice measure used was the same as 
Triandis et al. (1965). It consists of semantic- 
differential evaluative scale ratings of nine racial 
issues, ranging from “civil rights” to “interracial 
marriages.” 

For ease of scoring, some items of the occupa- 
tional and racial stereotypes were reversed. In the 
final scales, checking an adjective or giving a high 
probability response on the racial stereotype corre- 
sponded to the Negro stereotype, the Opposite to 
the white. Checking or giving a high probability re- 
sponse on the occupational stereotype constituted a 
response in the professional direction; the opposite, 


a response in the working-class direction, ¥ 


The experimental task was presented to the sub- 
jects as a “personality judgment inventory,” Subjects 
were told they were to rate several real persons on 
various “traits” in order to show how accurate they 
could be in comparison with personality inventories 
which the people presented as stimuli had supposedly 
taken. To enhance the reality of this presentation, 
stimulus persons were referred to as “Mr.,” followed 
by two randomly selected initials; specific occupa- 
tions were given, as well as “father’s occupation.” 
These were typical professional and/or working-class 
occupations. In addition, an age between 35 and 37 
was given to each stimulus person. This narrow age 
range was not felt sufficient to influence the ratings. 
One of three social behavior 


descriptions 4 
presented before any ratings were made an 
peated before the second scale type. Subjects we 4 


told that the description applied to all of the 
stimulus persons except the “person” whose rating 
served as a response-set measure. 

The order of presentation was checklist, proba- 


bility scale, and attitude scale. Stimulus persons 2 


appeared in a different random order within each 
type of scale, but order was the same for all subjects. 
Thus, order effects could occur within a scale type, 
but not between scales, 


mall 
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Results 


The scores for each subject were means 
over the 8 words of the racial stereotype and 
the 12 words of the occupational stereotype, 
No transformations were performed, 

Racial stereotype. Results of the analyses 
of variance of the racial stereotype are pre- 
sented in Tables 3 and 4, Table 3 represents 
the analysis of the checklist scale; 4, the 
probability scale, 

The two scale types give similar results. 
Race, occupation, and background main ef. 
fects are all highly significant ; Percentage of 
variance controlled by each factor is similar 
across the two methods. Inspection of Table 7 
reveals that the direction of effects is the 
same.° 

Table 7 also shows the effects for the occu- 
pation and background factors to be Opposite 
in direction due to a significant Occupation 


6 In the interests of saving space, all nonsignificant 
F ratios and all interaction effects not replicated on 
both scale types are omitted. The author feels that 
effects confounded by method variance serve only 
to confuse the issue, even though their omission leads 
to a lack of support for the prejudice hypothesis, 


TABLE 3 


ANALYSIS OF VARIANCE SUMMARY TABLE FOR THE 
RACIAL STEREOTYPE: CHECKLIST SCALE 


Source F % variance 
Race (A) 13.4** 92 
Occupation (B) 48.18** 5.11 
Background (C) 7.92** 39 
BSC 4.74* 220: 


Note.—All nonsignificant F ratios are omitted. F ratios 
were computed using the Effect X Subject interaction term 
as the mean square error (df = 89). % variance refers to the 
value pupeesssqnared (Hays, 1963), 

05. 


p LOk 
TABLE 4 


ANALYSIS OF VARIANCE SUMMARY FOR THE RACIAL 
STEREOTYPE: PROBABILITY JvupcMmentT SCALE 


Source F % variance 
Race (A) 14.6* -70 
Occupation (B) 43.36* 4.94 
Background (C) 13.23* 38 
BXseé 24,57* 62 


Note.—Alll nonsignificant F ratios are omitted. F ratios were 
computed using the Effect X Subject interaction term as the 
mean square error (df = 89), % variance refers to the value 
omega-squared (Hays, 1963). 

FE AL 


337 


TABLE 5 


ANALYSIS oF VARIANCE SUMMARY TABLE FOR THE 
OCCUPATIONAL STEREOTYPE: CHECKLIST SCALE 


Source F % variance 
Occupation (A) 555.29% 46.16 
A X Race (B) 9.6614 .312 
B X Background (C) 6.75* 181 
AXC 35.69** 1.36 


Note.—All nonsignificant F ratios are omitted, F ratios 
were computed using the Effect X Subject interaction term 
as the mean square error (df = 89), % variance refers to the 
value omega-squared (Hays, 1963), 5 

*p < 025, 

SEDIS O 


TABLE 6 


ANALYSIS OF VARIANCE SUMMARY TABLE FOR THE 
OCCUPATIONAL STEREOTYPE: PROBABILITY 
JUDGMENT SCALE 


Source F % variance 
Occupation (A) 386.51* 52.20 
A X Race (B) 18.66* .40 
B X Background (©) 16.52* 30 
AXC 31.83* -60 


Note.—All nonsignificant F ratios are omitted. F ratios were 
computed using the Effect X Subject interaction term as the 


mean square error (df = 89). % variance refers to the value 
omega-squared (Hays, 1963). 
*p <01. 


X Background interaction. Individuals of pro- 
fessional background and working-class occu- 
pation are rated significantly higher on the 
Negro stereotype than are any other indi- 
viduals (checklist, F = 330.6, p < .01; prob- 
ability scale, F = 20.25, p < 01). 

Occupational stereotype. Tables 5 and 6 
summarize the analyses of the checklist and 
probability judgment scales. Occupation con- 
trols the major proportion of variance on 
both scales. 

Table 7 presents this effect. Professionals 
are accorded more traits, and higher ratings 
on those traits, than are the working class. 
This is consistent with the definition of the 
stereotype. 

The Race x Occupation interaction is due 
to the perception of the “Negro profession- 
als” as more “professional” than whites. The 
difference js significant for both scales 
(checklist, F = 6.12, p< 0l; probability 
scale, F = 8.09, ~<.01). 

The Race x Background interactions are 
consistent with these results, Negroes of pro- 
fessional background are rated higher than 
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TABLE 7 


CELL Means SUMMED Over BEHAVIOR AND PREJUDICE FACTORS 


Racial stereotype Occupational stereotype 
Stimulus person 
Checklist Probability scale Checklist Probability scale 

Negro, professional, professional background 257 37.76 732 74.82 
Negro, professional, working-class background +248 39.24 -608 69.22 
Negro, working-class, professional background 349 46.63 A714 36.95 
Negro, working-class, working-class background 304 43.08 251 39.15 
White, professional, professional background 202 37.07 -620 68.52 
White, professional, working-class background -204 + 36.39 611 68.53 
White, working-class, professional background 325 43.91 -190 36.98 
White, working-class, working-class background .278 39.99 .269 43.35 


Note.—Means presented here are unweighted. 


comparable whites (checklist, F = 5.09, p < 
.01). The probability scale means show that 
whites of professional background are rated 
lower in professional characteristics than are 
whites of working-class background (F = 
7.904, p < .01), while Negroes of professional 
and working-class background are rated about 
equal (F = 3.004, p > .05). 

The Occupation X Background interaction 
for both scales shows a definite “downward 
mobility effect”; stimulus persons of profes- 
sional background and working-class occupa- 
tion are accorded significantly fewer profes- 
sional characteristics, and are rated lower on 
these characteristics, than are any other 
stimulus persons (checklist, F = 11.93, p< 
01; probability scale, F = 13.15, p < .01). 

Response set. Two-way analyses of variance 
were performed on the response-set measures 
for each scale and stereotype, using social 
behavior and prejudice as independent vari- 
ables. Dependent variables were derived as in 
previous analyses. Prejudice did not influence 
response set on any measure. The checklist 
measure of the racial stereotype was influ- 
enced by the social behavior descriptions 
(F = 7.26, p < 01), but this effect was not 
replicated. 


Discussion 


The firmest substantive conclusion that 
may be drawn from these data is that race is 
not a very central trait for the sample of 
students participating in these studies. The 
results of both the stereotype definition and 
attribution studies imply that the “black 
stereotype” is, in reality, a lower-class black 


stereotype, and that when contradictory in- 
formation is presented, it takes precedence 
over race. 

The results of the stereotype definition 
study support this assertion. The racial stereo- 
type is much weaker in general than. the 
occupational and is apparently confounded by 
belief on the subject’s part that blacks are, 
by and large, working-class persons. The di- 
rection of the effects of race and occupation 
in both studies show that working-class per- 
sons are accorded the characteristics origi- 
nally ascribed to Negroes and that whites 
are accorded characteristics ascribed to 
professionals. 

This does not contradict the trait centrality 
position. Rather, it points out that careful 
specification of the “central trait” itself is 
necessaty. For these subjects, the stimulus 
person’s occupation is obviously much more 
“central” than his race, at least for the at- 
tribution of the 20 traits defining the racial 
and occupational stereotypes. 

The hypothesis that prejudiced subjects 
would use race as a central trait was not 
supported. Several alternative explanations 
exist: prejudice may not imply selective at- 
tention to race; the highly prejudiced may 
have such deviant beliefs about blacks that 
they were not represented in the stereotype; 
the range of prejudice obtained in a college 
sample may not be wide enough to produce 
the effect—that is, no very prejudiced people 
were included in the sample; the prejudice 
measure itself may be invalid; the more preju- 
diced in this sample may tend to believe more 
strongly than others that blacks are lower- 
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class, but when faced with contradictory in- 
formation are forced to respond in terms 
other than race, Of course, any or all of the 
explanations may be valid. $ 

These results do support Jones and Davis 
(1965). As Hypothesis e predicts, categories 
implying long-term behavioral dispositions 
control more variance than race. The highly 
significant effects of occupation and the sig- 
nificant Occupation X Background interac- 
tions (the “downward mobility effect”) show 
that when stimulus persons lose social posi- 
tion, they are rated higher on the Negro and 
lower on the professional stereotype. 

In conjunction with the above, Hypothe- 
sis ¢ is fully supported only for the occupa- 
tional stereotype, where Race X Occupation 
interactions occur, producing an “upward 
mobility effect.” That is, black professionals 
and blacks of professional background are 
rated higher on the occupational stereotype 
than are comparable whites. This agrees with 
the assertion that blacks are usually thought 
of as lower-class; it supports Jones and Davis’ 
theory as well, if it is assumed that most 
subjects believe that it is harder for a black 
to become a professional than it is for a white. 

The lack of a significant behavior-descrip- 
tion effect is probably due to the fact that the 
empirically defined stereotypes are largely 
irrelevant to the styles of behavior contained 
in the descriptions. Any future research along 
these lines should insure criterion relevance 
by including behavior descriptions in the 
elicitation as well as the attribution phase 
of the study. 

A final theoretical point concerns attribu- 
tion theory in general. The results of this 
study suggest that simple additive or aver- 
aging models are not adequate to describe 
stereotype attribution. Rather, any formal 
attribution model, whether it is concerned 
with traits or stereotypes, must consider inter- 
actions between categories which effectively 
define new, more complex categories (such as 
“upwardly mobile blacks”); these may carry 
their own set of implied traits or influence 
the attribution of “standard” stereotypes. 

Despite the negative results obtained in 
this study, individual differences have been 
shown to be important in others and should 


not be neglected in the construction of a 
formal model. Cognitive factors, such as 
rigidity, and attitudes should be included. The 
influence of belief on attitude is well docu- 
mented; the influence of preexisting attitude 
on beliefs has had relatively little investiga- 
tion. 

Methodologically, this study points out the 
need for multivariate techniques in the exami- 
nation of stereotype and trait attribution. 
Wiggins, Hoffman, and Taber’s (1969) study 
of intelligence judgments and the Triandis, 
Feldman, and Harvey (1970) study of 
stereotyping are steps in this direction. 
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THE INFORMATIVENESS OF TRAIT DESCRIPTIONS 


MIRIAM J. RODIN! 
San Diego State College 


Trait names are the standard stimuli in many impression-formation studies. This 
study compared the informativeness of trait names with that of other forms of 
coding interpersonal information that summarize both more and fewer behavioral 
instances. Encoders described a set of people (targets). Informativeness was mea- 
sured by how well a different group of people (decoders) matched the encoder de- 
scriptions with the targets. The forms of coding were compared under two infor- 
mation source conditions: the decoders either received all information from a single 
encoder or received some information from each encoder. Trait descriptions led to 
greater than chance decoding accuracy only under the multiple source condition. 
Even under that condition, however, the most informative codings transmitted 
about one and one-half times as much information as did the traits. It was concluded 
that trait descriptions are relatively uninformative. A question was raised about 
whether findings based on trait stimuli are applicable to the natural process of 


impression formation. 


Since trait names are the traditional and 
most experimentally convenient form in which 
to present stimulus information to the judges, 
they are used extensively in impression-for- 
mation research (e.g., Anderson, 1965; Ander- 
son & Barrios, 1961; Anderson & Jacobson, 
1965; Feldman, 1968; Hendrick & Costantiti, 
1970; Wyer & Watson, 1969). The typical 
procedure is to present judges with a set of 
adjectives or trait words that purport to 
describe another person (e.g., friendly, sincere, 
punctual) and then to have judges rate that 
person on a dimension such as likability. 

Of interest here is the nature of the stimulus 
information. Trait descriptions are just one of 
many alternative forms of coding interpersonal 
information. Despite their extensive use, the 
properties of trait descriptions per se have 
received little experimental attention. This 
study focuses on the property of informative- 
ness—the extent to which trait descriptions of 
other persons successfully communicate infor- 
mation about those persons. 

Informativeness is tested by having one set 
of people (encoders) describe (encode) a set of 
stimulus persons (targets), giving the encoder 
descriptions to a new set of people (decoders), 
and measuring how well the decoders can 
match the encodings with the targets. Two 
measures of informativeness are used. Com- 


1 Requests for reprints should be sent to Miriam Ap 
Rodin, Department of Psychology, San Diego State 
College, San Diego, California 92115. 


munication accuracy measures the number of 
correct matches, or hits. 7 (x,y) measures the 
amount of information transmitted to the de- 
coders. 7(x,y) differs from communication 
accuracy in that it takes consistent errors, as 
well as hits, into account. Consensual errors, 
as opposed to random errors, signify a (mis)- 
informativeness in the codings which is not 
detected by the straightforward accuracy 
measure. 

Trait names are thought to be informative 
because they summarize many behavioral 
instances. In considering alternative forms of 
coding, the number of instances summarized 
is, therefore, a dimension of primary interest. 
Along this dimension, description of another 
person may vary from description in terms of 
a single typical behavior (“He yells at people”), 
to trait description (“He’s aggressive”), to a 
metaphoric description (“He’s an injured alli- 
gator”). A typical behavior description codes 
a smaller sample of behavioral instances than 
a trait but is less open to decoder misinter- 
pretation. A well-chosen metaphor can code 
several traits, but is, of course, far more 
vulnerable to decoder misinterpretation. Given 
these considerations, it seems reasonable to 
hypothesize that, relative to both of the other 
forms of coding, the trait descriptions would 
maximize informativeness: they code more be- 
havioral instances than a typical behavior de- 
scription and are less open to decoder misinter- 
pretation than are metaphoric descriptions. 

The experimental paradigm for measuring 
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informativeness allows the experimenter two 
methodological options. He may present de- 
coders either with some information from each 
encoder, or with all information from a single 
encoder. On an a priori basis; one could argue 
for the communicative superiority of either 
one. When information comes from multiple 
sources, the effect of a poor encoder is reduced. 
On the other hand, when all information comes 
from a single source, the decoder should have 
a far better chance of understanding the en- 
coder’s (implicit) frame of reference. In the 
absence of a good decision rule, and because 
this methodological variable seemed likely to 
affect the results, the study was carried out 
under both the single and multiple source of 
information conditions, even though a formal 
statistical evaluation of the difference between 
them was not possible. 


METHOD 
Participants 


The encoders, decoders, and targets were all faculty 
members in the same department. The 21 people in- 
volved had all been in the Psychology Department for 
at least 2 years when the study was conducted, and 
some had known one another considerably longer. The 
7 men who, in the experimenter’s judgment, were the 
most highly visible were chosen as targets. The re- 
maining 14 people were randomly assigned to either 
the decoder or encoder groups. No one refused to 
participate. 


Encoding Conditions and Procedures 


Each decoder described all seven targets under each 
of the four encoding conditions. The encoding condi- 
tions varied along a “number of behavorial instances 
summarized” dimension. In order from “least” to 
“most”? along this dimension, the encoders were asked 
to (a) describe the target in terms of a typical behavior, 
(b) describe him using an adjective or two, (c) answer 
the question, “What kind of an animal is he?” (g) an- 
swer the question, “What kind of an entrée (main dish) 
is he?” The latter two conditions are variants of one 
another. Since the informativeness of metaphoric de- 
scriptions might well be a function of the particular 
metaphor (s) used, it seemed wise to include more than 
one example of this form of coding. Examples of the 
encodings from the four conditions are: “Catching you 
in the hall to start a conversation that could only be 
brought to a conclusion in a matter of hours’’; “dis- 
organized, energetic’; “possum’’; “leftover ham-corn- 
rice casserole.” 

Boyd (1969) found greater communication efficiency 
when the targets were encoded in context, namely, 
when the encoders were aware of the array of targets 
before beginning to encode rather than being presented 
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with, and coding, one target at a time. Knowing the 
entire array beforehand allows the encoders to code for 
distinctive features. In this study, all coding was done 
in context and with instructions intended to maximize 
accurate decoding. 

Instructions for the four conditions were virtually 
identical except for the coding rule. As an example, 
instructions for the “typical behavior” condition were 
as follows: 


Below is a list of people that you know. Describe each 
one in terms of a typical behavior. Choose behaviors 
which will maximize the likelihood that someone else 
who also knows these people will be able to correctly 
match them with your behaviorial description. 


Decoding 


The decoders’ task was to match the codings with the 
targets. In the straightforward matching situation, in 
which the sets to be matched contain an equal number 
of items, missing one match implies missing two. In 
order to avoid this problem, three dummy items were 
included with the critical seven. The instruction was to 
match each of the 10 items with one of the seven targets 
and to use each target at least once. The dummy items 
were actual descriptions from the encoders, but were 
not scored. 

The decoders decoded twice under each of the four 
coding instructions, once with multiple information 
sources and once with a single information source. 
When the source was multiple, for each condition, a 
randomized 7 X 7 Graeco-Latin square was used to, 
assign encoder-target combinations to the seven de- 
coders (Fisher & Yates, 1948). This design permitted 
the 49 (for any one condition) target-encoder com- 
binations to be assigned in such a way that the seven 
encodings given to each decoder sampled all targets 
and encoders and repeated none. In the single source 
condition, the seven critical items were from one en- 
coder. The decoders were not told who the encoder 
was. Encodings were assigned randomly with the 
restrictions that no decoder got more than one set of 
information from the same encoder, and all encoders 
were used an equal number of times. Decoding was done 
first with multiple sources, and then, about 3-4 weeks 
later, with a single source. 


All decoders were given the following instructions: 


This is a study of various ways of coding personality 
descriptions. Encoders have been asked to describe 
people in several different ways. The targets described 
are people both you and the encoders know. The 
encoders were told that their descriptions would be 
given to decoders [You are a decoder. ] whose job it 
would be to match the descriptions (encodings) with 
the targets. The encoders were asked to describe the 
targets in such a way as to maximize the likelihood 
of correct matching, 


Dependent Variable Measures 


The scores for the communication accuracy analyses 
of variance are the number of correct matches, T (x,y) 


, 
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measures the amount of information transmitted from 
the encoders to the decoders [7(«,9) = H(«) + H(y) 
— H(x,y)] (Attneave, 1957). When no information is 
transmitted, the responses have no relation to the 
stimuli, and Î(x,y) equals zero. When information 
transmission is perfect, each stimulus evokes its own 
unique response, and 7(w,y) equals the information 
input. Î(x,y) is calculated as follows: H (x) measures 
the amount (number of bits) of information contained 


in the stimuli [17 (x) = log n — (1/n) Zn: log ni]. Å (9) 
measures the amount of information contained in the 


a 7 A 
responses [H (y) = log” — (1/n) Yin; log n;]. H (x,y) is 
a measure of independence between the input and out- 


a P) 
put information [H (x,y) = log n — (1/n) Ens; log nij]. 


RESULTS 
Communication Accuracy 


In the multiple source condition, treatment 
effects are significant. The average eta estimate 
is .7; that is, about half of the variance is 
accounted for by differences in form of coding. 
The chance mean expectation of number correct 
on the matching task is 1.3. Only the typical 
behavior and adjective conditions produce 
decoding accuracy significantly better than 
chance. There are no significant decoder, 
encoder, or target effects on communication 
accuracy. 

In the single information source condition, 
form of coding remains a significant effect with 
an average eta estimate of .65. Behavioral 
descriptions are still superior to other forms of 
coding; however, all differences between other 
forms of coding disappear. Only the behavioral 
decodings are significantly more accurate than 
chance. In this condition there are significant 
encoder effects (F = 4.32, df = 6/18); whether 
these are due to differences in how well en- 
coders knew the targets or to differences in 
coding ability is not clear. 

It might be thought that message length is 
the crucial factor in the superiority of be- 
havorial codings since these are necessarily 
longer than the others. The number of words 
in the behavior codings ranged from 1 to 36; 
the median number was 8 words. Decoding 
accuracy was, however, unrelated to message 
length. Of the 27 correct decodings in the 
multiple source condition, 13 were at or below 
median length, and 14 were above it. 


TABLE 1 


MEAN DECODING ACCURACY ror THE Four DIr- 
FERENT FORMS OF CODING UNDER SINGLE AND 
MULTIPLE SOURCE-OF-INFORMATION 
CONDITIONS 


Form of coding 


Source of 


iuformation Typical 


EGNAT A Adjective | Animal | Entrée 
Multiple 3.86* 2.86* 2.00 | 1.00 
Single 3.57% 2.00 2.14 2.00 


Note.—Perfect accuracy would yield a mean of 7. A formal 
evaluation of the overall difference in accuracy between single 
and multiple sources is not possible. However, an informal 
inspection of the row sums (9,72 and 9,71) would seem to 
indicate no difference. 

* Significantly better than chance accuracy (p < 01). 


Amount of Information Transmitted 


T(«,y) is of little interest when decoding 
accuracy does not exceed chance accuracy. 
Trait descriptions did no better than chance in 
the single source condition, and T (x,y) is pre- 
sented, therefore, only for the multiple source 
condition. The information input was 2.807 
bits of information, and this, of course, is the 
theoretical maximum that can be transmitted, 
T (x,y) for the typical behavior descriptions 
was 1.169 bits; for the traits, .743 bits; for the 
animal metaphor, .659 bits; and for the entrée 
metaphor, .740 bits. The behavior descriptions 
transmitted about one and one-half times as 
much information as the trait descriptions. 
The trait and metaphoric descriptions trans- 
mitted about the same amount of information, 
The disparity between accuracy and f(x,y) 
scores for these three conditions can be ex- 
plained by consistent decoder error with the 
two metaphor encodings. 


Discussion 


Although it seemed reasonable to expect 
that trait descriptions would prove more in- 
formative than the other forms of coding, this 
expectation was belied by the data. Only the 
typical behavior codings were consistently 
more accurate than chance, and they trans- 
mitted about one and one-half times as much 
information as the trait codings. The advan- 
tage of being able to summarize more be- 
havioral instances is evidently offset by the 
increased opportunities for decoder error. 
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Tn the standard impression-formation experi- 
ment, judges are asked to rate a hypothetical 
target described by a trait list. In contrast to 
standard procedure, the present experiment 
used targets who were known to the decoders 
and who were embedded in a small, well-defined 
stimulus array. Since these are factors which 
facilitate communication, it may be concluded 
that forms of coding which are poorly informa- 
tive in the present experimental context would 
be likely to do even less well in the standard 
experiment. 

It is questionable whether the standard 
experimental procedure is an adequate simu- 
lation of the natural process by which people 
form impressions of others. The simulation 
seems weak on at least two counts: the nature 
of the stimulus information and the use that 
judges make of that information. The trait 
information is clearly artificial in the sense that 
one does not form impressions of other people 

from bare trait lists outside the laboratory. 
This study suggests that trait information is 
relatively uninformative as well, or at least 
raises obvious questions concerning what, if 
anything, it communicates to the judges. In 
addition, the judges in the standard experi- 
ments may be completely bypassing the 
impression-formation process : It is entirely 
possible that they arrive at their numerical 
ratings by simply combining the social desira- 
bility values associated with the traits. The 
results of experiments based on standard pro- 
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cedure undoubtedly reveal a great deal about 
the process of combining serial information; 
it is problematic, however, whether they bear 
on the process of impression formation. 
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CHILDHOOD DEVELOPMENT OF FEAR OF FAILURE 


MOTIVA 


TION * 
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The present study was designed to investigate the importance of certain child- 


parent interaction variables for the development of “fear of failure” 


rs of high and 1 


achieve- 
ow fear of 


neutral; (c) subjects 
behavior and pun- 
of failure motiya- 


tion than subjects whose mothers rewarded and were neutral, respectively, 


McClelland and his associates (McClel- 
land, Atkinson, Clark, & Lowell, 1953) sug- 
gested that two aspects of the achievement 
motive might be distinguished: “hope of suc- 
cess” and “fear of failure.” The former is said 
to be an approach motive, involving the an- 
ticipation of reward, while the latter is consid- 
ered an avoidance motive, involving the antici- 
pation of punishment. While the bulk of re- 
search completed after the publication of The 
Achievement Motive by McClelland et al. has 
dealt with the former (i.e., need for achieve- 
ment, or n Ach), an increasing amount of re- 
search has been devoted to fear of failure 
motivation in the past decade (e.g., Birney, 
Burdick, & T eevan, 1969; Heckhausen, 
1967). While Winterbottom (1958) completed 
an early study of the childhood origins of n 
Ach, the corresponding study of fear of failure 
motivation was not done. The aim of the 
present study is to provide this complemen- 
tary set of data. 

Winterbottom (1958) related n Ach to: 
early experiences with independence and mas- 
tery training, using twenty-nine 8-year-old 
boys and their mothers (who completed a 
questionnaire), and obtained the following 
results: (a) high n Ach subjects had earlier 


1 This study was supported by the Group Psychol- 
ogy Branch, Office of Naval Research, Contract Nonr 
3591 (01). ; 

? Requests for reprints should be sent to Richard 
C. Teevan, Department of Psychology, State Uni- 
versity of New York at Albany, Albany, New York 
12203. 
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training in her list of independence and mas- 
tery behaviors; (b) they had fewer experi- 
ences of being restricted in these areas, and 
although restriction training came earlier for 
them, it had been preceded by a good deal of 
training in independence; and (c) their moth- 
ers had a more positive view of their accomp- 
lishments and rewarded them more frequently, 
demonstrating more positive affect in the 
process. 

The present study used a modifed version 
of Winterbottom’s (1958) questionnaire to 
investigate some of the variables of child- 
parent interaction that might contribute to 
the development of fear of failure motivation. 

The specific hypotheses tested were as fol- 
lows: (a) High fear of failure subjects will 
have experienced significantly earlier maternal 
expectations for independence and achieve. 
ment behaviors than low fear of failure sub- 
jects. This hypothesis was based on the as- 
sumption, drawn from Winterbottom’s (1958) 
data, that early expectations for such be- 
haviors generate a general concern over 
achievement behavior, (High and low fear of 
failure is based on a median break.) (b) High 
fear of failure subjects will have experienced 
more frequent neutral responses and less fre- 
quent reward than low fear of failure subjects 
when their behavior in independence and 
achievement situations satisfied parental ex- 
pectations. (c) High fear of failure subjects 
will have experienced more frequent punish- 
ment and less frequent neutral responses than 


34.0 ede 


low fear of failure subjects when their be- 
havior in independence and achievement situ- 
ations did not satisfy parental expectations, 
Hypotheses b and c are based on the finding 
(Birney et al., 1969) that the high fear of 
failure subject tends to avoid achievement 
situations whem offered a choice. 


METHOD 
Subjects 


The subjects were 41 male high school juniors and 
seniors from Lewisburg Area High School, Lewis- 
burg, Pennsylvania, and their mothers. Out of 100 
questionnaires sent out to mothers, there were 41 
returns, all of which were used in the present study. 
Since the sons of 20 mothers returning questionnaires 
were high fear of failure, while the sons of the re- 
maining 21 mothers were low fear of failure (there 
were 50 highs and 50 lows in the total sample, based 
on a median split), there clearly was not a bias in 
terms of the fear of failure motivation of the sub- 
jects among those mothers who did return question- 
naires, 


Apparatus 


The apparatus consisted of the standard materials 
for the administration of the Thematic Apperception 
Test (TAT), including six TAT slides, and the ques- 
tionnaire on achievement expectations. 

The measure of fear of failure achievement motiva- 
tion used in the present study is the hostile press 
system, developed by Birney et al. (1969). This 
projective scoring system is independent of the n 
Ach scoring system, and was validated using goal- 
setting behavior in a level of aspiration situation, 
(See Birney et al., 1969, for a description of the 
hostile press system.) For the present study, a high 
score on the hostile press scoring system (i.e., a score 
above the median score) was defined as indicative of 
fear of failure motivation. A description of the six 
TAT slides used may also be found in Birney et al. 


Procedure 


The TAT was administered to all subjects accord- 
ing to standard neutral instructions (McClelland et 
al, 1953, p. 98). Interscorer reliability for hostile 
press was .94. 

The questionnaire was mailed to each subject’s 
mother. The instructions issued to all mothers for 
completing the questionnaire were as follows: 


Listed below are several different types of be- 
haviors which mothers might expect their son to be 
able to do. Since it has been found that in most 
families the mother spends more time with the 
child than the father through the early years, we 
would like only the mothers to fill out these 
questionnaires, 

For each of the 20 items listed below, place a 
check beside each behavior that you expected of 
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your child. If you did not at any time expect a 
given behavior of him, then simply leave the space 
blank. After checking a given statement, indicate 
next to the check the age at which—to the best 
of your memory—you expected it of him. 

In filling out the questionnaire, try to answer 
in terms of how you actually felt about a given 
behavior at the time your eleventh- (or twelfth-) 
grade son was still a child, rather than how you 
would feel about that particular behavior now. 
Do not answer in terms of the age at which he 
actually did accomplish a given behavior ; answer 
in terms of the age at which you expected it of 
him. 

The list of independence and achievenient ex- 
expectations is listed below: 

—To first begin walking, 

—To learn to tie his shoes right. 

——To learn to dress himself alone and do it well, 

——To stand up for his own rights with his 
brothers and/or sisters, if he has any. 

—To stand up for his own rights with other chil- 
dren, 

——To know his way’ aréund the area in which he 
lives, so that he can play where he wants with- 
out getting lost. 

—To be willing to try new things on his own 
without depending on his mother for help. 
—To try hard things for himself without asking 

for help. = 

— To be able to eat alone without help in cutting 
and handling food, es 

——To be able to lead other children and assert 
himself in children’s groups. 

— To make his own friends among children his 
own age, 

—To hang up his own clothes and look after his 
Own possessions. 

—To do well in school on his own. 

—To be able to undress and go to bed by himself. 

— To have interests and hobbies of his own. 

—To earn his own spending money, 

—To do regular household chores, and do them 
well. ; 

ae be able to stay at home alone during the 

ay. 

— To make decisions, like choosing his clothes or 
deciding how to spend his money, by himself. 

—To do well in competition with other children, 
To try hard to come out on top in sports. 


The final portion of the questionnaire consisted of 
two lists of alternative reactions of the mother to 
the child’s behavior in situations, such as those listed 
in the questionnaire. The first list was designed to 
get at the nature of the mother’s response to the- 
child when his behavior satisfied her expectations, 
The following instructions preceded this list: 


Of the six items listed below, pick the three 
which you feel are most characteristic of the way 
in which you reacted to your son when his behav- 
ior in situations, such as listed in the Previous 20 


> 
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items, met up with or satisfied your expectations 
of him. Place a 1 next to the statement which you 
feel is most characteristic of your reaction in such 
situations, a 2 next to the second most charac- 
teristic reaction, and a 3 next to the third most 
characteristic reaction in such situations. 


This list consisted of three reactions in which the 
mother rewarded the child and three relatively 
neutral reactions: 


— Kiss or hug him to show how pleased you are. 

— Tell him what a good boy he is. Praise him for 
being good. 

— Give him a special treat or privilege. 

— Do nothing at all to make it seem special. 

— Show him you expected it of him. 

— Show him how he could have done better, 


The second list was designed to get at the nature 
of the mother’s response to the child when his be- 
havior did not satisfy her expectations. The following 
instructions preceded this list: 


Out of the next six statements, pick the three 
which you feel are most characteristic of the way 
in which you reacted to your son when his behay- 
ior in situations, such as those listed in the first 
20 items, did not satisfy or meet your expectations 
of him. Again, place a 1 next to the most charac- 
teristic reaction, a 2 to the second most character- 
istic reaction, and a 3 next to the third most 
characteristic reactions in such situations. 


This list consisted of three reactions in which the 
mother punished the child and three relatively neutral 
reactions: 


——Scold him or spank him for doing it. 

— Show him you are disappointed in him, 

— Deprive him of something he likes or expects, 
such as a special treat or privilege. 

— Don’t show any feeling about it. 

— Point out how he should have behaved. 

— Just wait until he does what you want. 


In both lists, all six possible reactions were mixed 
together, so that the dichotomous nature of the 
possible alternatives would be less apparent, 

Along with the questionnaire, each mother received 
a letter which assured anonymity with respect to 
her answers, and was told that since we were inter- 
ested in child-rearing practices in general and not in 
specific individuals and their parents, they need not 
sign their name upon returning the questionnaire. 
This procedure was followed, since mothers might 
otherwise be biased in favor of reactions which were 
“socially acceptable.” It was felt by the authors that 
with this assurance that their answers would not be 
related to them in any way personally, mothers 
would be more likely to give honest answers. 

Actually, unknown to mothers, a code was estab- 
lished whereby each subject, his mother, and the 
completed questionnaire received a number. In this 
way, the motivational characteristics of each subject 
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TABLE 1 


Mean Hostite PRESS SCORES FOR Sons or MoTHERS 
Wuo EXPECTED INDEPENDENCE AND ACHIEVEMENT 
BEHAVIORS Barty AND Late 


| z 


Group n x t df 
Early 16 4.06 21.10* 38 
Late 24 2.33 
*p <.001, i 


could be related to his mother’s questionnaire. How- 
ever, since this code did, in effect, violate the guar- 
anteed anonymity of mothers’ answers, a debriefing 
procedure was considered necessary after the experi- 
ment was completed. Mothers were contacted by the 
experimenters, and an explanation was given to each 
mother for why the letter they received was worded 
as it was and why the use of a code Was considered 
necessary. Mothers were once again assured that we 
Were not interested in and would not publish individ- 
ual results. One-hundred percent of those mothers 
contacted after the completion of the study reacted 
positively to the use of the code and had no objec- 
tions to their data being made public as long as 
their names were withheld, In all of the analyses 
performed, two-tailed tests were used. 


RESULTS 


The hypothesis of earlier expectations for 
independence and achievement behaviors by 
mothers of high fear of failure subjects was 
confirmed. A ¢ test (Edwards, 1962), shown 
in Table 1, indicated that mothers of high fear 
of failure subjects expected such behaviors 
earlier, while mothers of low fear of failure 
subjects expected them later.® 

The hypothesis that high fear of failure 
subjects will have experienced more frequent 
neutral responses and less frequent reward 
responses when their behavior in independence 


3 Mothers were said to expect independence and 
achievement behaviors early or late on the following 
basis: a mean age was computed for each behavior, 
taking into consideration all mothers who reported 
expecting that behavior of their child. If the mean 
age reported by a particular mother was below the 
mean, she was categorized as having expected that 
behavior early; if it was above the mean, she was 
categorized as having expected it late, Then, taking 
all 20 items into consideration, if more of her expec- 
tations fell into the early category, she was said to 
have expected these behaviors early; if more fell into 
the late category, she was said to have expected them 
late. If an equal number of late and early expecta- 
tions were made, she was not placed in either cate- 
gory. } 
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and achievement situations was satisfactory 
was also confirmed. A Mann-Whitney test 
(Hays, 1963), computed for hostile press 
scores of “reward? and “neutral” groups, 
produced the following results: U= 9135; 
z = 2.42, p < .02, The hypothesis that high 
fear of failure subjects will have experienced 
more frequent punishment and less frequent 
neutral responses than low fear of failure sub- 
jects when their behavior in independence and 
achievement situations did not satisfy parental 
expectations was not confirmed. While the 
differences were in the predicted direction, the 
relationship did not reach significance. 

When the subjects were classified into one 
group which was rewarded for satisfactory be- 
havior and received neutral responses for un- 
satisfactory behavior, and another which 
received neutral responses for satisfactory 
behavior and punishment for unsatisfactory 
behavior, a Mann-Whitney test (Hays, 1963) 
indicated that subjects with a “neutral-pun- 
ishment” profile had significantly higher hos- 
tile press scores than subjects with a “reward— 
neutral” profile (U = 8.5, z = 2.31, p < .05). 
The neutral-punishment and reward—neutral 
categories were mutually exclusive. Thus, the 
neutral-punishment profile seems to favor the 
development of fear of failure motivation. 


DISCUSSION 


The data of the present study indicate that 
a major contribution to the development of 
fear of failure motivation is made by each of 
the following variables: (a) early maternal 
expectations of independence and achievement 
behaviors; (b) the receiving of neutral re- 
sponses for satisfactory behavior in indepen- 
dence and achievement situations; and (c) 
the receiving of punishment for unsatisfactory 
behavior in these situations. These findings 
have been confirmed in a replication of this 
study by Teevan and Fisher (1967). Further- 
more, the third hypothesis of the present 
study was confirmed in the replication study, 
adding strength to the emphasis given here 
to the role of punishment for unsatisfactory 
behavior. Thus, it would appear that the 
greater the extent to which a child experiences 
this type of interaction with his mother, the 
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more likely he is to develop fear of failure 
achievement motivation. 

While the Winterbottom (1958) study em- 
ployed an interview technique and the present 
study did not, if the data of the present study 
are combined with those of Winterbottom, it 
may be seen that achievement strivings and 
concern over the attainment of achievement 
goals are contingent upon the mother’s early 
emphasis on independence and achievement 
behaviors. However, these early expectations 
provide only a general concern over achieve- 
ment behavior; they are not related to 
whether a child develops a positive or nega- 
tive attitude toward achievement. The dif- 
ferential development of hope of success and 
fear of failure motivation appears to be a 
function of the mother’s reinforcement pattern 
following the child’s behavior in independence 
and achievement situations. If she is neutral 
toward satisfactory behavior and punishes un- 
satisfactory behavior, the child is likely to 
develop a negative attitude toward achieve- 
ment and become motivated by his fear of 
the negative consequences of failure. If she 
rewards satisfactory behavior, the child is 
likely to develop a positive attitude toward 
achievement and become motivated his 
anticipation of the positive consequences of 
success. 
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The hypothesis that dissonance may have general motivational properties simi- 
lar to those of noncognitive drives was supported. Subjects were offered high 
or low choice to perform a dull task (high or low dissonance) and then per- 
formed a verbal task which varied response competition (high or low) but 
was irrelevant to the dissonance task. The predicted interaction effect on verbal 
performance between dissonance and response competition was obtained, To 
eliminate alternative explanations for the performance effect, a second experi- 
ment varied dissonance arousal via the choice manipulation, and dissonance 
reduction via high or low postchoice justification. Under low justification, a 
replication of the performance effect was obtained as well as dissonance reduc- 
tion via reevaluation of the dull task. Under high justification, no dissonance 
effect on performance was obtained, as expected if the drive (dissonance) had 


been reduced by high justification. 


Theoretically, dissonance is defined as a 
negative state of psychological tension 
which is aroused by the juxtaposition of two 
mutually inconsistent cognitions (Festinger, 
1957). The individual is motivated to reduce 
dissonance by changing one or both of the 
otherwise inconsistent cognitions. For exam- 
ple, when one performs an attitudinally dis- 
crepant act, dissonance may be reduced by 
changing the attitude to be consistent with 
the act (Festinger & Carlsmith, 1959). 

The literature dealing with dissonance 
theory has been primarily concerned with 
attitude or behavior change directly instru- 
mental to the reduction of dissonance (cf. 
Brehm & Cohen, 1962; Kiesler, Collins, & 
Miller, 1969; Zimbardo, 1969). However, 
several investigators (Cottrell & Wack, 1967; 
Pallak, 1970; Waterman, 1969) have sug- 
gested that dissonance arousal may also affect 
behavior or performance unrelated or non- 
instrumental to the reduction of dissonance. 


1 The present research was supported in part by 
a faculty research grant to the first author awarded 
by the Graduate College (University of Iowa) from 
National Institutes of Health Biomedical Sciences 
Research Program funds and in part by a trainee- 
ship to the second author from National Institutes 
of Health Training Grant funds in the area of social 
psychology. Thanks are due to Paul Paulus, who 
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Such evidence seems broadly consistent with 
Festinger’s (1957) original statement which 
included the assumption that dissonance might 
be viewed as a motivational construct with 
properties similar to those of “drive” within 
traditional learning theory. If so, the status of 
dissonance arousal as a motivational con- 
struct would be more clearly defined in rela- 
tion to more traditional areas of psychology. 

Within learning theory, motivational con- 
structs such as drive facilitate response tend- 
encies whether the responses are instrumental 
or noninstrumental to reduction of drive level. 
In task situations in which habit strength of 
the correct response is high, that is, the cor- 
rect response is dominant or has a high prob- 
ability of emission, an increment in drive level 
facilitates the response and thereby enhances 
task performance. However, in task situations 
in which habit strength of the correct response 
is low, that is, where habit strength of an in- 
correct response is high, an increment in drive 
level facilitates the competing response and 
thereby depresses successful task performance 
(cf. Spence & Spence, 1966). 

From the learning literature, two empirical 
effects of motivational constructs have been 
employed in investigations of the general 
motivational effects of dissonance arousal. The 
first is that drive may effect performance or 
responses irrelevant to reduction of drive 
level, and the second is that drive level inter- 
acts with level of response competition to 
determine task performance. Let us first con- 
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sider research dealing with the effects of dis- 
sonance arousal on responses unrelated to dis- 
sonance reduction and then move to research 
explicitly dealing with interactions between 
dissonance arousal and response competition. 

Pallak (1970) reported that dissonance 
arousal facilitated incidental paired-associate 
retention whether or not dissonance was re- 
lated to the paired-associate copying task. 
Briefly, subjects in the relevant dissonance 
condition were offered high or low choice 
(high or low dissonance) to perform a 
dull paired-associate copying task. Dis- 
sonance was aroused between the cogni- 
tions “This task is dull and boring” and “I 
chose to perform this task.” However, in the 
irrelevant dissonance conditions, subjects first 
were offered high or low choice to write a 
counterattitudinal essay and then performed 
the dull paired-associate copying task. Thus, 
dissonance was aroused between the cognitions 
“This essay is against my belief” and “I chose 
to write this essay” and was unrelated to the 
paired-associate copying task. Presumably, if 
dissonance could be viewed as a more general 
motivational construct, then dissonance irrele- 
vant to the dull copying task would facilitate 
incidental retention, as other evidence relating 
incidental retention and drive suggests (cf. 
Pallak & Andrew, 1970; Spielberger, Good- 
stein, & Dahlstrom, 1958). 

Consistent with this view of dissonance 
arousal, both relevant and irrelevant disso- 
nance facilitated incidental retention. Thus, 
one could view the choice manipulation as a 
manipulation of level of arousal and the vari- 
ation of relevant or irrelevant dissonance as a 
variation in the source of arousal. The hy- 
pothesis that dissonance may have general 
motivation properties neatly accounted for the 
incidental retention data. Let us move to 
research involving interactions between dis- 
sonance arousal and response competition. 

Waterman (1969) manipulated dissonance 
- py inducing subjects to write an essay which 
was either inconsistent (dissonance) or con- 
sistent (no dissonance) with their initial atti- 
tude. As part of an ostensibly unrelated second 
experiment, subjects performed a paired- 
associate learning task which varied response 
competition by the degree of intralist competi- 


tion. The prediction was that if dissonance 
had drive or arousal properties, then, during 
acquisition, dissonance subjects should make 
fewer errors under low intralist competition 
and more errors under high intralist competi- 
tion than no-dissonance subjects. Overall, the 
results confirmed the prediction in that a sig- 
nificant Dissonance X Response Competition 
interaction was obtained for the error data. 
However, no difference in errors was ob- 
tained between the dissonance and no- 
dissonance conditions under low intralist 
competition, providing only partial support 
for the prediction. 

Cottrell and Wack (1967) manipulated the 
subject’s verbal response hierarchy by vary- 


ing the number of times response items were 


presented during training trials. Following 
training, dissonance was manipulated by in- 
forming subjects that the experimenter had 
illegally recruited the subject, and hence the 
subject could not receive credit for the experi- 
ment (dissonance). In the no-dissonance con- 
dition, the subject simply received a filler 
communication and expected to receive re- 
search credit, All subjects were then asked 
to perform a pseudorecognition task. Presum- 
ably, agreeing to perform the pseudorecogni- 
tion task would arouse greater dissonance in 
the illegal recruitment condition than in the 
control condition. During the pseudorecogni- 
tion task, ambiguous stimuli were rapidly 
presented, and the subject attempted to label 
the stimuli with verbal responses acquired 
during initial training. The results indicated 
that the manipulation of dissonance facili- 
tated dominant responses at the expense of 
nondominant responses. These results were 
interpreted as support for the hypothesis that 
dissonance arousal may have motivational 
properties similar to those of drive. 

Although the extant research is consistent 
with the view that dissonance may have 
general motivational properties, there are 
limitations on this interpretation. Compelling 
support for this proposition demands two 
considerations. Empirically, manipulations of 
dissonance arousal must produce a clear dec- 
rement in performance under high competition 
and a clear enhancement of performance 
under low response competition. The first ex- 


MOTIVATIONAL EFFECTS OF DISSONANCE AROUSAL 


periment was designed to provide data to meet 
this consideration, 

The second major consideration is that 
experiments employing a dissonance manipu- 
lation must provide a replication of the 
empirical effect of dissonance. Thus, obtained 
task performance effects must be uniquely due 
to dissonance arousal, that is, to the presence 
of two dissonant cognitions, rather than to 
the inadvertent arousal of some other motiva- 
tional or tension state. For example, one could 
argue that the manipulation of dissonance via 
“Segal recruitment” employed by Cottrell 
and Wack (1967) resulted in anger, or some 
form of emotional arousal other than disso- 
nance, and that this alternative arousal 
State mediated performance on the pseudo- 
recognition task, Similarly, one could argue 
that in the absence of any direct evidence 
of dissonance reduction, both the Pallak 
(1970) and the Waterman (1969) experi- 
ments are also open to similar explanations 
based on arousal or tension states other than 
dissonance. 

Empirically, evidence of dissonance reduc- 
tion and, by implication, dissonance arousal, 
typically consists of differential change in one 
or both of the cognitions standing in a dis- 
sonant relationship (cf. Brehm & Cohen, 


` 1962; Festinger, 1957, 1964). To the extent 


that dissonance arousal mediated the perfor- 
mance results reviewed here, one might have 
expected either reevaluation of the paired- 
associate copying task (Pallak, 1970), or 
attitude change regarding the essay topic 
(Waterman, 1969) , or increased liking for the 
pseudorecognition task (Cottrell & Wack, 
1967). None of the experiments above ob- 
tained empirical evidence of dissonance re- 
duction, and alternative explanations of the 
performance data, based on some other form 
of arousal, remain plausible. 

The procedure used in the present research 
was designed to rule out such an alternative 
by manipulating both dissonance arousal and 
dissonance reduction within the same experi- 
ment. Recall that dissonance results from the 
juxtaposition of two inconsistent cognitions, 
Any subsequent information that would be 
consistent with choosing to perform a dull 
task would serve to reduce dissonance and 
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hence reduce any effect of dissonance on task 
performance. Experiment IT was designed to 
manipulate dissonance arousal (by offering 
subjects high or low choice to perform a dull 
task) orthogonally to dissonance reduction 
(by providing subjects with high or low justi- 
fication following the choice manipulation). 

In the first experiment, all subjects per- 
formed the dull task of pronouncing three 
words repeatedly for 6 minutes, Dissonance 
was manipulated by offering subjects high 
or low choice to perform the same dull task 
for another 30 minutes. Following the choice 
manipulation, the subjects performed 90 trials 
of the Stroop Color-Word Task (cf. Hochman, 
1967; Stroop, 1935) designed to provide either 
high or low verbal response competition, The 
Stroop task was irrelevant to the pronuncia- 
tion task and provided the main verbal 
performance measure. The prediction was 
straightforward: high-choice subjects should 
increase in number of errors under high re- 
sponse competition and decrease in number of 
errors under low response competition relative 
to low-choice subjects, 


EXPERIMENT I 


Method 
Subjects 


A total of 65 female undergraduates participated 
individually in the experiment for research-participa- 
tion credit, A total of 4 subjects were not included 
in the analyses (3 left at the choice manipulation 
when offered a chance to do so, and 1 subject was 
suspicious of the procedure), leaving a final sample 
of 61 subjects, 


Procedure 


Upon arrival, all subjects were told that the experi- 
ment dealt with general speech patterns and how 
these patterns changed over time. The experimenter 
elaborated briefly and explained that speech patterns 
were typically studied by tape recording people as 
they pronounced common words a large number of 
times; that future research would investigate variables 
related to speech patterns j that since the subject was 
in a control or base-line condition, she would first 
perform a color-naming filler task (the Stroop task) 
designed simply to hold constant the time elapsed 
in the experimental situation; and that the Stroop 
task was irrelevant to the study of general speech 
patterns. In order to acquaint the subject with the 
procedure required for the Stroop task and to mini- 
mize confusion for the subject following the choice 
procedure below, all subjects immediately performed 


_ 


90 i’ of the Stroop Color-Word Task described 
elow. - 
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The boring pronunciation task. Following the 
Stroop procedure, the experimenter explained that the 
subject would next perform a sample pronunciation 
task to become familiarized with the pronunciation 
task procedure and that the actual pronunciation task 
would take 30 minutes, Three common nouns (cat, 
wagon, girl) were presented singly every 3 seconds, 
and the subject was required to say the word aloud. 
Each subject was cautioned not to memorize the 
words or to figure out an order to the words, since 
they were randomly presented and the research dealt 
only with speech patterns. The sample pronunciation 
task lasted 6 minutes. 

The choice manipulation, Following the sample 
pronunciation task, the experimenter explained that 
frequently subjects had found that the pronunciation 
task was boring and had asked if they “really had 
to do the second pronunciation task.” In the high- 
choice condition, the experimenter continued by 
stating that the subject had fulfilled her obligation 
for the research credit by reporting for the experi- 
ment; that she would receive participation credit 
whether or not she completed the pronunciation 
task; and that it was entirely the subject’s decision 
to remain or leave. 

In the low-choice conditions, the experimenter con- 
tinued by stating that the subject was obligated to 
remain and complete the pronunciation task since 
she had signed up for the study; that she could not 
receive research participation credit without com- 
pleting the pronunciation task; and that the subject 
should probably feel “conscience bound” to complete 
the pronunciation task since a replacement subject 
was not avaliable. Finally, all subjects signed a 
printed “Departmental Policy” form which simply 
reiterated the choice manipulation. 

The dependent measures. Following the choice 
manipulation, all subjects performed a second series 
of 90 trials of the Stroop task, again as a “control 
for time elapsed” prior to the actual pronunciation 
task. The number of errors provided the primary 
dependent measure. 

In the high-response-competition version, items 
consisted of color nouns (“green,” “blue,” “red”) 
printed in incongruent combinations of noun and 
colored ink. Thus, the noun “red” was printed in 
blue or green ink, etc., and subjects were required 
to pronounce the color of ink rather than the color 
noun. In the low-response-competition condition, 
however, items consisted of nouns unrelated to color 
associations (lot, safe, close) which were printed in 
red, blue, or green ink also. Thus, under high re- 
sponse competition, the appropriate response of pro- 
nouncing the ink color, for example, “red,” com- 
peted with the subject’s association to the color 
noun “blue.” Under low response competition, there 
was no such competition. 

In both response competition conditions, each 
noun was printed in each ink color, and each set 
of nine items was repeated in random order for a 
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total of 10 times. All subjects received a total of 
90 items via a Stowe memory drum at the rate of 
one item per second, All responses were tape-recorded 
and scored later for errors (in all cases, the scorer 
was blind as to choice condition) such as incorrect 
color responses, noun reading, omitted responses, or 
contaminated responses (e.g., “gred”? in response to 
the noun “green” printed in red ink). We might note 
that an additional series of experiments (Pallak, 
Pittman, Heller, & Munson, 1969) found this adapta- 
tion of the Stroop procedure to be sensitive to 
arousal or drive level manipulated via threat of 
electric shock. 

Following the Stroop procedure, the experimenter 
casually mentioned that he also would like the sub- 
ject to fill out some forms before beginning the 
actual pronunciation task. These scales were designed 
to assess the effectiveness of the choice manipulation, 
perceptions of both the pronunciation and Stroop 
tasks, and perceptions of the experimenter, All 
subjects were completely debriefed and excused. 


Results 


The major results are presented in Table 1. 
As may be seen, the choice manipulation was 
quite effective. The analysis of variance of 
the perception of choice means indicated that 
subjects under high choice perceived greater 
choice to perform the dull pronunciation task 
than subjects under low choice (F = 48.18, 
df = 1/57, p < 001). 

The error data means presented in Table 1 
represent the mean increase or mean decrease 
in errors from the first 30 trials (Block 1) 
on the Stroop task to the second 30 trials 
(Block 2) on “eo task. The change 
of errors on Block jas subtracted from the 
number of errors on’Block 2. Thus, a positive 
score represents an increase, and a negative 


TABLE 1 


Errects OF CHOICE AND RESPONSE COMPETITION 
on PERCEPTION OF CHOICE AND CHANGE IN 
VERBAL PRONUNCIATION ERRORS 


‘Level of response competition 


Dissonance Measure 

High’ Low 
High choice Choice 67.83 70.46 
Errors +1.61 —.39 

n 18 13 
Low choice Choice 18.65 18.77 
Errors +.41 +.62 

n 17 13 


Note.—The higher the mean, the greater the perceived 
choice. A positive mean for errors indicates an increase in 
errors, 
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score represents a decrease in the number 
of errors. 

The analysis of the error data indicated a 
near-significant main effect for response 
competition (F = 3.91, df = 1/57, p< O) 
such that a greater increase in errors was 
obtained under high than under low response 
competition, as expected. In addition, the 
interaction between choice and response com- 
petition was statistically significant (F = 5 .89, 
df = 1/57, p< .05). Under high response 
competition, high-choice subjects increased in 
number of errors in comparison to the low- 
choice subjects (1.61 versus .41, respectively; 
t=2.73, df=57, p<.05).* In contrast, 
under low response competition, high-choice 
subjects decreased the number of errors in 
comparison to low-choice subjects (—.39 
versus .62, respectively; #= 2.27, df = 57, 
p < .05). These data replicated the empirical 
interaction effect usually obtained between 
levels of drive, in this case, cognitive disso- 
nance, and levels of response competition on 
task performance. 

However, no evidence of dissonance reduc- 
tion was obtained on any measure evaluating 
the pronunciation task, the Stroop task, or 
the experimenter. As a result, one could argue 
that these data may not have been mediated 
by dissonance arousal, but rather by some 
other arousal state inadvertently manipulated 
by our choice manipulation. These data are 
consistent with other evidence in the litera- 
ture and are also open to the same alternative 
explanation noted above. 


EXPERIMENT IT 


The second experiment was designed to 
eliminate explanations based on alternative 
arousal states by demonstrating that the per- 
formance effect reported above resulted from 
dissonance arousal per se (i.e., the juxtaposi- 
tion of two mutually inconsistent cognitions). 
Presumably, one way to reduce dissonance is 
through the addition of consonant cognitions 
(cf. Festinger, 1957, 1964) following a deci- 
sion. In the present research, postdecisional 


3In order to obtain a more stable estimate of the 
variance, the mean square within cell, based on four 
cells, was employed as the error term for all two- 
cell comparisons, All £ tests are two-tailed, 
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information consistent with the subject’s deci- 
sion to perform the dull pronunciation task 
would reduce dissonance originally aroused 
between the cognitions “I chose to perform 
this task” and “This task is dull and boring.” 
It follows that if performance on the Stroop 
task above was mediated by dissonance 
arousal, then subjects for whom dissonance 
was reduced by consonant postchoice informa- 
tion should make fewer errors under high 
response competition than subjects for whom 
dissonance was not reduced. In addition, sub- 
jects receiving consonant postchoice informa- 
tion should show little evidence of dissonance 
reduction, via differential pronunciation task 
reevaluation in this case. 

The design for the second experiment 
called for adding a manipulation of dissonance 
reduction, that is, high or low justification, 
postchoice, for the pronunciation task to the 
manipulation of dissonance arousal employed 
previously, that is, high or low choice to per- 
form the dull pronunciation task. We ex- 
pected that high justification would reduce 
dissonance and thereby attenuate performance 
differences between high and low choice con- 
ditions. However, we expected that low justi- 
fication would maintain or perhaps augment 
dissonance arousal and thereby lead to differ- 
ential task performance, as well as differen- 
tial reevaluation of the pronunciation task. 
This latter effect would provide clear evidence 
that dissonance arousal mediated task per- 
formance rather than some alternative arousal 
state. 

Experiment II was designed to provide a 
partial replication of the first experiment by 
employing the high-response-competition ver- 
sion of the Stroop task in all conditions.* In 


4 The major point of the second experiment was 
the relationship between manipulated dissonance 
reduction on both performance and task evaluation. 
In view of the results from Experiment I, the partial 
replication of Experiment I is an adequate test for 
our hypothesis. Obviously, a more elegant design 
would have included a complete replication of the 
design employed in Experiment I. However, our pilot 
research for this experiment indicated that approxi- 
mately 25 subjects per cell would be needed to pro- 
duce systematic effects on both task performance and 
task evaluation measures, Unfortunately, the avail- 
ability of subjects for such a design was becoming 
limited by the end of the term, 
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addition, the initial presentation of the Stroop 
task, prior to the choice manipulation, was 
deleted from the procedure. Pilot research 
(Pallak et al., 1969) indicated that prior 
practice on the Stroop task simply reduced 
the error rate on the task and thus made it 
more difficult to demonstrate systematic per- 
formance effects, This change in procedure 
enabled us to use total errors rather than a 
change score for number of errors in our 
analyses (see below). 


Method 


Subjects and Design 


A total of 120 female undergraduate subjects par- 
ticipated in the study for course credit. Four sub- 
jects left at the choice manipulation, leaving a final 
experimental sample of 116 subjects. All subjects 
received the high-response-competition list from the 
Stroop dependent measure (see above). High and 
low choice to perform the dull pronunciation task 
was manipulated orthogonally to high or low justifi- 
cation received postchoice. 


Procedure 


The procedure closely Prowea that employed in 
Experiment I. Upon arrival, all subjects received the 
speech-pattern cover story, performed the sample 
pronunciation task described above, and then re- 
ceived the same choice manipulation as before. 

The justification manipulation. Once the subject 
had signed the appropriate choice manipulation form, 
the experimenter suggested: “Oh, by the way—maybe 
you’d like to know a little more about the pronun- 
ciation task research.” In the high-justification con- 
dition, the experimenter continued: 


Your pronunciation task results will be extremely 
useful to us for several reasons. First of all, general 
speech patterns do in fact change systematically— 
thus, your pronunciation task results will provide 
us with some very useful information about this 
stage of the research. Second, this stage of the 
research has been going on for some time, and 
we only need a few more subjects. Thus, your 
pronunciation task results will be very useful to 
us since they will allow us to complete the study 
and to begin the data analysis. Your pronuncia- 
tion task results will be very useful in the analysis, 
‘so it’s fortunate that we had you scheduled today. 


In the low-justification condition, the experimenter 
continued: 


Your pronunciation task results actually will not 


be very useful to us for several reasons. First’ 


we've found that while general speech patterns do 
in fact change, they don’t do so systematically— 
thus, your pronunciation task results probably 
won't provide us with useful information about 
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this stage of the research, Second, this state of the 
research has been going on for some time, and in 
fact we completed this stage yesterday—so your 
pronunciation task results won’t be very useful 
as the data analysis is already being completed. 
Your pronunciation task results won’t be used in 
the analysis. Since you were already scheduled, 
we thought we’d run you through the procedure 
anyway. 


We might note that this manipulation previously has 
been successfully employed by Freedman (1963) and 
in the studies reported by Zimbardo (1969). 

The dependent measures. Following the justification 


manipulation, all subjects then performed 90 trials 


of the high-response-competition list from the Stroop 
task. The number of pronunciation errors provided 
the primary dependent measure. The experimenter 
waited several minutes and then casually asked the 
subject to fill out “some forms” before beginning 
the actual pronounciation task. The forms consisted 
of the same 91-point measures employed above and ‘ 
a 91-point usefulness scale designed to check gn. the | 
justification manipulation (“How useful -will your 
pronunciation task results be to the research, on 
general speech patterns?’’?), After completing the 
scales, all subjects were completely debriefed and 
excused, 


Results s 
The Effectiveness of the Manipulations 


Analysis of the perceived choice means 
(Table 2) indicated that high-choice subjects 
perceived greater choice to perform the pro- 
nunciation task than low-choice subjects (F 
= 59.23, df=1/112, p<.001), with no 
other effects approaching significance. Simi- 
larly, the usefulness measūre indicated that 
high-justification subjects perceived that their 
pronunciation task data would be more use- 


TABLE 2 


Errects or CHOICE AND POSTCHOICE JUSTIFICATION 
ON PERCEPTIONS OF CHOICE, USEFULNESS, AND 
NUMBER OF VERBAL ERRORS, AND EVALU- 
ATION OF THE PRONUNCIATION 
AND STROOP TASKS 


High choice Low choice 
Perception 
High | Low | High | Low 
Maie 
oice 62.38 | 69.56 | 22.1 . 
$ errang a 44.53 7508 3058 
Trors AL i 
Pinon 12.88 | 8.73} 6.38 
ronunciation task 38.25 | 40.01 | 40.23 | 30.65 
Stroop task 65.25 | 68.13 | 74.50 | 67.31 
n 32 32 26 26 
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ful than low-justification subjects (F = 68.00, 
| df = 1/112, < :001), with no other effect 
| approaching significance. These two verbal 
| report measures suggest that the manipula- 

tions were successful and unconfounded. 


Dependent Measures 


The error data (Table 2) represent the 
\ mean number of errors over the first 60 trials 
| from the Stroop task. Elimination of practice 
trials, prior to the choice manipulation, pro- 
| duced absolute differences in errors rather 
| than change in errors (as obtained in Experi- 
| ment I). However, the error data from the 
last 30 trials were less stable than from the 
first 60 trials, although the ordering of the 
means was identical, and these data were not 
Included in the analysis. 
| The analysis of variance of the error data 
indicated a main effect for choice (F578, 
df = 1/112, p<.05) such that high-choice 
subjects made more errors than low-choice 
subjects. More importantly, the interaction 
hetween choice and justification was signifi- 
mt (F= 7.06, df = 1/112, p< .O1). Under 
Agh justification we expected little difference 
between the high and low choice conditions. 
Under low justification, however, we expected 
| a replication of the effect from Experiment I, 
| that is, more errors in the high than in the 
low choice condition. Consistent with this line 
| of reasoning, the difference in errors between 
the high- and low-choice—low-justification con- 
ditions was significant (12.88 versus 6.38, re- 
| spectively; ¢ = 3.54, df = 1125p 01), 
(while the difference between high and low 
_ choice-high justification was not (¢ < 1.00). 
In addition, the crucial difference between 
high choice-high justification and high choice- 
low justification was also significant (8.41 
versus 12.88, respectively; ¢ = 2.81, df = 112, 
p< .01). Thus, elimination of dissonance 
arousal via postchoice justification also elimi- 
nated the effect of dissonance on verbal task 
performance. 

The means for pronunciation task evalua- 
tion (Table 2) represent the average response 
from two 91-point scales: “How boring was 

, the sample pronunciation task?” and “How 
- boring do you think the second pronuncia- 
tion task will be?” We expected that the high- 
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justification manipulation would reduce dis- 
sonance for the subject and hence would pre- 
clude cognitive work leading to differential 
evaluation of the pronunciation task in the 
high-justification conditions. We also expected 
that the low-justification manipulation would 
arouse or maintain dissonance, as the error 
data suggest, and lead to differential task 
evaluation. Consistent with our analysis, no 
difference in task evaluation was obtained 
between the high and low choice conditions 
under high justification (38.25 versus 40.23, 
respectively; £ < 1.00, df=1112), while the 
difference between the high and low choice 
conditions under low justification was signifi- 
cant (40.01 versus 30.65, respectively; ¢= 
2.14, df = 112, p < .05). The overall analysis 
of this measure indicated a weak interaction. 
between choice and justification (F = 3.24, 
dj = 1/112, p<.10). These data suggest 
that dissonance arousal mediated the task-per- 
formance error data, and hence make expla- 
nations based on other arousal states less 
likely, 

The means (Table 2) for Stroop evaluation 
also represent the average response from two 
91-point scales: “How boring was the Color- 
Naming Filler Task?” and “How important 
was the Color-Naming Filler Task?” The 
analysis indicated an interaction between 
choice and justification (F= 4.02, df= 112, 
P< .05), such that Stroop evaluation in 
the low-choice—high-justification condition was 
more favorable than in either the high-choice— 
high-justification (74.50 versus 62.25, respec- 
tively; t£ = 2.79, df= 112, p< 05) or in the 
low-choice—low-justification conditions (74.50 
versus 67.31, respectively; ¢ = EON af = des 
$ < .07). One major assumption in using the 
Stroop task was that it was unrelated to dis- 
sonance arousal or dissonance reduction. Thus, 
differential performance on the Stroop could 
not logically provide a mode of dissonance 
reduction, and performance effects could not 
be the result of responses instrumental to dis- 
Sonance reduction. Although the Stroop task 
was differentially evaluated, clearly the proc- 
ess mediating Stroop evaluation was not one 
of dissonance arousal or dissonance reduction, 
and in fact was Opposite to a dissonance 
effect. Thus, our working assumption that the 
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pronunciation task (the source of disso- 
nance) and the Stroop task (the performance 
measure) were independent seems justified. 
The performance data from the Stroop 
task and the pronunciation-task evaluation 
data taken together provide support for the 
hypothesis that the performance effects were 
mediated by dissonance arousal rather than 
some alternative arousal or tension state. 


Discussion 


The results from the present research sup- 
port the hypothesis that dissonance arousal 
may have properties similar to those of more 
general motivational constructs within psy- 
chology. The results from Experiment I repli- 
cated a typical empirical effect of motiva- 
tional constructs such as drive, and an 
interaction between dissonance and response 
competition was obtained. The second experi- 
ment was designed to provide a partial repli- 
cation of this performance effect and to dem- 
onstrate that the obtained performance effect 
uniquely resulted from dissonance arousal 
rather than from some other tension or 
arousal state. 

The results from the second experiment cor- 
roborated our analysis, Under low justifica- 
tion, postchoice, high-choice subjects made 
more errors than low-choice subjects, repli- 
cating the high-response-competition results 
obtained in Experiment I. However, under 
high justification, postchoice, no performance 
difference was obtained between the high- 
and low-choice conditions. In addition, high- 
choice-high-justification subjects made fewer 
errors than high-choice—low-justification sub- 
jects, as expected. We assumed that add- 
ing consonant cognitions (high justification) 
would reduce dissonance for the subject and 
thereby attenuate any performance difference 
between the high- and _ low-choice-high- 
justification conditions. 

The results from the pronunciation-task 
evaluation measures were also consistent with 
our analysis. Under low justification, high- 
choice subjects evaluated the dull pronuncia- 
tion task more favorably than low-choice sub- 
jects, a fairly straightforward dissonance 
effect. However, under high justification, dif- 
ferential task evaluation was not obtained 
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between the high and low choice conditions, 
exactly what one would expect if dissonance 
had been reduced for the subject by the high- 
justification manipulation. Assuming that 
high justification, postchoice, reduced disso- 
nance, then dissonance arousal should no 
longer mediate either pronunciation task eval- 
uation or Stroop task performance, as the 
data suggested. 

Under high choice, no difference in task 
evaluation was obtained between the high- 
and the low-justification conditions. Normally, 
one would expect to obtain more favorable 
task evaluations under high choice-low justi- 
fication than under high choice—high justifica- 
tion since greater dissonance would be present 
in the former condition. However, we should 
note that task evaluations were more favor- 
able in the low-choice—high-justification condi- 
tion than in the low-choicelow-justification 
condition (40.23 versus 30.65, respectively; 
t = 2,06, df = 112, p < .05). This compari- 
son suggests that the high-justification ma- 
nipulation, in addition to providing consonant 
postchoice cognitions, also produced positive 
affect which generalized to the pronunciation 
task. It appears that two processes mediated 
favorable reevaluation of the pronunciation 
task, one of dissonance reduction, as the com- 
parison between the high- and low-choice~ 
low-justification condition suggests, i one 
unrelated to dissonance, that of positive. affect, 
as the comparison of the low-choice-high- 
justification and low-choice—low-justification 
conditions suggests. Apparently, the strength 
of the process under high justification was 
equivalent to that of dissonance reduction 
under low justification, attenuating the dif- 
ference in task evaluation between the high- 
choice-high-justification and high-choice—low- 
justification conditions. Our explanation, 
based on positive affect under high justifica- 
tion, is a tentative one and is in need of 
further empirical attention. It should be clear 
that such a process, however, does not offer 
an alternative explanation for the task per- 
formance results obtained in the present 
research. 

None of the literature dealing with the 
general motivational effects of dissonance 
arousal has obtained evidence of dissonance 
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reduction, that is, the usual empirical effect 
of dissonance arousal. As a result, alternative 
explanations of those results based on arousal 
other than dissonance remain plausible. We 
also did not obtain such evidence in Experi- 
ment I but did obtain evidence of disssonance 
reduction in Experiment II, as the pronuncia- 
tion-task evaluation results suggested. The 
only major difference between Experiments I 
and II in the present research was the manip- 
ulation of high or low justification postchoice 
employed in Experiment II. One possible ex- 
planation is based on research reported by 
Festinger (1964) and Allen (1965). 
Festinger (1964) suggested that dissonance 
reduction is an active process requiring time 
and cognitive work. Thus, situational variables 
which facilitate cognitive work should facili- 
tate dissonance reduction, Allen (1965) re- 
ported that subjects distracted from this cog- 
nitive work by the demands of an irrelevant 
task failed to show evidence of dissonance 
reduction. In that experiment, subjects were 
offered a choice between two attractive art 
prints to keep. Following the decision between 
the prints, half of the subjects were kept 
occupied by performing a complex concept- 
formation task, while the remaining subjects 
simply waited. The results indicated greater 
dissonance reduction via reevaluation of the 
art prints for subjects who did not perform 
the concept-formation task. That is, less dis- 
sonance reduction took place for those sub- 
jects who were distracted postdecisionally by 
the demands of the concept-formation task. 
This line of reasoning would be compatible 
with the lack of evidence of dissonance re- 
duction in the experiments previously re- 
viewed. For example, one could argue that 
the pseudorecognition task (Cottrell & Wack, 
1967), the paired-associate learning task 
(Waterman, 1969), and the paired-associate 
copying task (Pallak, 1970; Pallak, Brock, 
& Kiesler, 1967) may have distracted the 
subjects from cognitive work presumably di- 
rected toward dissonance reduction. Simi- 
larly, the demands of the Stroop task in Ex- 
periment I also may have distracted the sub- 
jects from similar cognitive work. Further, if 
one assumes that the low-justification manip- 
ulation in the present research kept the rele- 
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vant set of dissonant cognitions salient, then 
one might expect evidence of dissonance re- 
duction in Experiment II but not in Experi- 
ment I. Although generally compatible with 
the present results, this is obviously a tenta- 
tive explanation. The precise relationship be- 
tween dissonance arousal, distraction pro- 
duced by task demands, and dissonance re- 
duction is left for further empirical attention. 

Attitude-change theorists typically have 
been interested in relating attitude and be- 
havior change. Dissonance theory, of course, 
has provided a useful account for behavior 
change instrumental to the reduction of dis- 
sonance. By supporting a broader view of 
dissonance as a more general motivational 
construct, the present results should serve as 
a bridge by which attitude-change theorists 
can systematically approach a broader range 
of behavior, that is, a range of behavior not 
restricted to instrumental responses, Mini- 
mally, the present research serves to clarify 
the conceptual status of cognitive dissonance 
as a motivational construct. 

As a result, the present results bear on the 
current theoretical controversy between dis- 
sonance theory and the self-perception model 
proposed by Bem (1965, 1967). Bem sug- 
gested that one observes his behavior and 
the situational constraints surrounding the 
behavior and then infers a belief or attitude 
consistent with that behavior. In contrast to 
dissonance theory, Bem’s model does not 
hypothesize a motivational construct, such as 
dissonance, and does not rely on assumptions 
concerning initial attitude or the relationship 
between initial attitude and subsequent be- 
havior. As Bem (1967) noted, the self-percep- 
tion model does account for attitude-change 
results derived from both the “forced compli- 
ance” and “free choice” dissonance paradigms, 
but does not make predictions different from 
dissonance theory for those paradigms. As a 
result, a direct test of dissonance and self- 
perception predictions has not yet been re- 
ported in the literature. The results from the 
present research, however, suggest that the 
assumption of a motivational construct, such 
as dissonance, is tenable and warranted by 
the performance data. In short, Bem’s model 
cannot readily account for noninstrumental 
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performance effects uniquely attributable to 
dissonance arousal. 


The present research supports the implica- 
tion drawn from Festinger (1957), that dis- 
sonance arousal may have more general moti- 
vational properties similar to broader motiva- 
tional constructs in psychology. Empirically, 
evidence of a Dissonance X Response Compe- 
tition interaction on performance irrelevant to 
dissonance reduction was obtained. In addi- 
tion, the results from the second experiment 
suggested that the performance data were 
mediated by dissonance arousal by (a) dem- 
onstrating that when dissonance was reduced 
by manipulation, the effect on performance 
was attenuated and (b) by demonstrating 
that when dissonance was maintained, the 
performance effect was replicated, and evi- 
dence of dissonance reduction was also ob- 
tained. The latter result makes alternative 
explanations of the performance effect, based 
on arousal other than that of dissonance, un- 
likely. Although, as noted, several points war- 
rant further investigation, the present re- 
sults support a broader view of dissonance as 
a motivational construct. 
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REINFORCING AND ATTRACTING FUN CTIONS OF YIELDING* 


JOHN P. LOMBARDO,? ROBERT FRANK WEISS, anp WILLIAM BUCHANAN 
University of Oklahoma 


The reinforcement theory of attraction assumes that agreement leads to at- 
traction, that yielding (agreement preceded by disagreement) leads to greater 
attraction than agreement alone, and that these effects are based on reinforce- 
ment. Yielding led to greater attraction than did simple agreement (p < .001), 
Two experiments (N:= 168) directly tested the reinforcement theory by demon- 
strating that subjects would learn an instrumental response, the reinforcement 


for which was the agreement or yieldin; 
more effective reinforcer than agreeme 
(1/latency 


and this effect was shown to be replicable 


ments (extinction) led to a significant d 


(p < .005). 


In several experiments, attraction toward 
another person has been shown to be a func- 
tion of a “sequence” effect in which subjects 
were more attracted toward agreers after 
initial exposure to disagreers (Stapert & Clore, 
1969; Worchel & Schuster, 1966). A similar 
effect was found when the same person began 
disagreeing with the subject and at some point 
changed to agreeing with the subject (Byrne; 
Lamberth, Palmer, & London, 1969; Gerard 
& Greenbaum, 1962; Sigall, 1968; see also 
Aronson & Linder, 1965, for similar results 
using personal evaluations rather than atti- 
tudinal agreement). 

The theory of interpersonal attraction 
through attitude similarity-dissimilarity is 
based on the hypothesis that attitudinal state- 
ments which express views similar to or dis- 
similar from those of an individual constitute, 
respectively, reinforcing and punishing stimuli 
for him (Byrne, 1969). Several experiments 
have supported this hypothesis by showing 
that attitudinal statements had the same ef- 
fect as other known reinforcers; for instance, 


1 The first experiment is based on a doctoral dis- 
sertation submitted to the University of Oklahoma 
in partial fulfillment of the requirements for the PhD 
degree, Experiment II was supported in part by 
funds from the College of Arts and Sciences and the 
Graduate College of the University of Oklahoma. 
The authors wish to thank John S. Ezell and Carl 
D. Riggs for this support. 

? Requests for reprints should be sent to John P. 
Lombardo, who is now at the Department of Psy- 
chology, State University of New York College at 
Cortland, Cortland, New York 13045. 
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) in a discrete-trials instrumental conditio 


g of another person. Yielding was a 
nt as measured by response speed 
ning paradigm (p< .01), 
( < 005). Withdrawal of agree- 
lecrease in response speeds over trials 


agreeing and disagreeing attitudinal state- 
ments were effective in altering response prob- 
abilities when presented after a specific re- 
sponse (Byrne, Griffitt, & Clore, 1968; Lam- 
berth, 1970; Lamberth & Craig, 1970; Reitz, 
Duey, & Mason, 1968), 

The attraction of people to others having 
the same attitudes as themselves has been 
attributed to the effectance motive (Byrne & 
Clore, 1967), which is defined as a learned 
drive to be logical, consistent, and accurate 
in interpreting the stimulus world. According 
to effectance theory, disagreement produces a 
state of arousal or tension (Stapert & Clore, 
1969) through consensual invalidation, while 
attitudinal agreement reduces arousal or pre- 
vents arousal through consensual validation 
(Byrne & Clore, 1967; Byrne, Nelson, & 
Reeves, 1966). Support for a disagreement- 
arousal relationship has been reported in 
experiments using both the galvanic skin re- 
sponse (Burdick & Burnes, 1958) and self-. 
reports after exposure to disagreeing attitudes 
(Byrne & Clore, 1967; Stapert & Clore, 1969) 
and to disagreements in a line-judging task 
(Gerard & Greenbaum, 1962). 

If the results of the studies cited above 
are examined using learning theory as a 
model, two conclusions are suggested: (a) If 
disagreement has the functional properties of 
drive, reduction of it through agreement fol- 
lowing an instrumental response should serve 
to reinforce that response; (b) a logical ex- 
tension of the sequence effect suggests that 
disagreement (drive arousal) followed by 
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agreement (drive reduction) has greater rein- 
forcement value than agreement not preceded 
by disagreement. 


EXPERIMENT I 


The purpose of the present experiment was 
to study the reinforcing effects of yielding 
(disagreement followed by agreement) and 
simple agreement on the strength of an instru- 
mental response. The reinforcing effects of the 
conditions of the present experiment were 
modeled on discrete-trials instrumental escape 
conditioning, in which reinforcement was the 
termination of a noxious drive contingent ‘on 
an instrumental response. 

Based on the studies cited above and their 
formulation within a learning theory frame- 
work, it was assumed that the three conditions 
in the present experiment were analogous to 
escape conditioning studies varying magnitude 
of reinforcement. For subjects in the present 
experiment, reinforcement was listening to 
another person either yield, agree, or disagree 
and was contingent on an instrumental 
response (pressing a toggle switch). 

Upon the assumption that the present ex- 
periment is analogous to discrete-trials in- 
strumental escape conditioning experiments in 
which magnitude of reinforcement was studied 
(e.g., Bower, Fowler, & Trapold, 1959; 
McAllister & McAllister, 1967), it was hy- 
pothesized that response speed would be an 
increasing function of type of agreement, with 
type of agreement (yield versus simple agree- 
ment) being analogous to magnitude of rein- 
forcement. The specific hypotheses were: (@) 
Subjects would acquire an instrumental re- 
sponse that gave them the opportunity to 
listen to another person either yield or agree 
with them. (b) Asymptotic response speeds 
in acquisition would have a specified rank 
order; that is, asymptotic speed for the yield 
group would be faster than asymptotic speed 
of the agree group, and a third group which 
was disagreed with would have the slowest 
speed. (c) Based on the results of escape 
conditioning studies in which magnitude of 
reinforcement was manipulated, a significant 
Magnitude of Reinforcement X Trials inter- 
action was predicted. (d) Response speeds 
would decrease (extinguish) after reinforce- 
ment was discontinued. 
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Method 
Subjects and Design 


One hundred and eight subjects were randomly 
assigned to the yield, agree, and disagree groups. 
Each group had 24 males and 12 females. For sall 
groups, each trial was started by a confederai 
(always the same sex as the subject) reading a opt 
After reading the topic, the confederate in the yield 
condition stated an opinion that was contrary to 
that of the subject. In the agree/disagree conditions, 
the confederate did not commit himself on the topic 
until after the subject responded. After the subject 
made the instrumental response and gave his opini 
on the topic, the confederate either yielded, 
or disagreed depending on the condition the su 
was in. 

Tn all groups, reinforcements were givén on only 
50% of the 12 acquisition trials. Pilot research indi- 
cated that it was not feasible to have the confederate 
yield on every trial because subjects became suspi- 
cious when both 100% and 75% of the trials were 
reinforced, As a consequence, a 50% partial rein- 
forcement schedule was employed in which the con- 
federate yielded to the subject on only one-half of 
the trials. In order to maximize learning, the con- 
federate yielded to the subject on items of high 
interest and maintained the original disagree position 
on low-interest items. This particular reinforcement 
procedure, which was necessary to have an effective 
experimental induction, required. congruent procedures 
in the other two groups. Thus, subjects in the 
agree group also received the same 50% partial rein- 
forcement schedule and were agreed with on high- 
interest items. The disagree group followed an identi- 
cal schedule, with their disagreements limited to 
high-interest items. ¢ 


Apparatus 


The subject’s room and the experimenter’s control 
room shared a common wall. The subject was seated 
at a table facing this wall, which included four 
windows of transparent mirror glass. The windows 
were opaque except when illuminated from behind, 
and instructional signals appeared in each window 
upon illumination. The signals were the large printed 
words (a) “listen,” (b) “throw switch if you wish 
to comment,” (c) “talk,” (d) “move dial to final 
opinion.” A panel mounted on the table top con- 
tained the subject’s “comment” switch (a telephone 
toggle switch with a spring return), a microphone, 
and a speaker, as well as an opinion-change indicator 
used in the masking task. Also mounted in the wall 
were two Standard Electric timers, by means of 
which the subject could monitor the length of time 
he and the “other subject,” respectively, had spoken. 

On the experimenter’s side of the wall were the 
controls for turning on the various signals, a loud- 
speaker for monitoring the subject’s comments, a 
microphone, and a control timer. A Lafayette 5720, 
1/100 digital readout stop clock automatically Tee 
sured the subjects’ response latency (the dependent 
variable) to 01 second (the time between the signal 
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“throw switch if you wish to comment” and the 
time the circuit was broken by the switch being 
thrown). The subject's masking-task opinion-change 
dial was attached to a synchronous motor which 
drove a matching dial in the control room, enabling 
the experimenter to determine when the subject 
was finished with the intertrial masking task and 
therefore ready for a new trial. 


Deception and Masking Task 


The experiment was presented to the subject as a 
study of opinion change. The subjects were told: 
“We are interested in how your opinions may be 


‘affected -by what someone else says; how your 


opinions may be affected by what you yourself say; 
and how what you say may affect the opinions of 
someone else.” After each statement by the subject 
and reply by the confederate, the subject indicated 
whether he changed his opinion by moving the 
opinion-change dial. Of 30 subjects given a question- 
naire after the experiment, all believed this decep- 
tion; on an open-ended question, no one even men- 
tioned the switch, which appeared to them as a 
mere incidental to the task of opinion change through 
Conversation. The deception was used to mask the 
learning task so that the conditioning process would 
not be overridden by the subjects’ normal use of 
their higher mental processes (Spence, 1966). 


Procedure 


Atiitude testing. In order to select opinions with 
which each subject agreed or disagreed before the 
experiment proper, all students enrolled in the intro- 
ductory psychology courses were given a 50-item 
questionnaire several days prior to the beginning of 
the experiment. The questionnaire dealt with current 
controversial issues such as family size regulation 
by the government, forced integration, the war in 
Vietnam, socialized medicine, etc. Students were asked 
to respond to each item on a 7-point scale, ranging 
from “strongly agree” to “strongly disagree.” After 
completing the 50-item questionnaire, all students 
were given a second shect with the instructions to 
rank order the 15 items they were most interested 
in and the 10 items they were least interested in, 
Because of the reinterpretation of attraction data 
for the present experiment, item interest was an im- 
portant part of pretesting. For the purpose of the 
present experiment, it was assumed that interest in 
an item determined its drive-arousing properties if 
disagreed with. 

Acquisition. As each subject arrived for the experi- 
ment, the experimenter explained that the other 
“subject” (the confederate, referred to as Subject A) 
had not yet arrived. After a short time both subjects 
received the deceptive rationale and the operating 
instructions over the intercom (the confederate was 
in the control room with the experimenter through- 
out the course of the experiment). It was explained 
that Subject A (the confederate) was in another 
room and had been provided with a list of topics 
they were to discuss. The subject was told that 


having the two subjects in separate rooms would 
ensure confidentiality of his/her candid opinions on 
controversial issues. Both subjects were asked to 
keep their statements to about 20 seconds if they 
could, but they were never interrupted if they spoke 
more than 20 seconds, since interruption would have 
been punishing (Mandler & Watson, 1966). During 
acquisition all subjects were given six reinforced and 
six nonreinforced trials in four prearranged random 
orders (with the exceptions of Trials 1, 2, and 11; 
Trials 1 and 11 were always reinforced, while Trial 2 
was always nonreinforced), 

An experimental trial began with the “listen” 
signal. In the yield condition, the confederate read 
the opinion statement and commented on it. In the 
agree and disagree conditions, the confederate just 
read the topic. When the confederate either finished 
commenting on the topic or reading it, the experi- 
menter operated the control switch which both 
(a) presented the conditioned stimulus, the signal 
“throw switch if you wish to comment,” and (b) 
started the latency timer. When the subject threw 
the comment switch, the latency timer stopped. 
Latency was recorded and converted to speed. If the 
trial was reinforced, the confederate agreed with the 
subject. If the subject did not throw the comment 
lever within 20 seconds, latency was considered to be 
infinite, and speed counted as zero, In this case an 
unscored makeup trial was given to equalize the 
number of reinforcements. On very infrequent oc- 
casions (8 of 672 trials in both experiments), it was 
impossible to yield to the subject, because the sub- 
ject changed his mind to the confederate’s position 
after listening to the confederate’s opinion. In such 
rare instances, the regular procedure was to complete 
the trial by the stooge’s affirming his opinion, and 
to then control the actual number of yield trials 
by giving an unscored makeup trial. 

Extinction. All subjects were given six extinction 
trials in which three high-interest and three low- 
interest items were used. In extinction, the effectance : 
initially aroused was not reduced following the 
instrumental response, just as in escape conditioning 
where the shock is not reduced following the instru- 
mental response. In the yield condition, the con- 
federate maintained the initial disagreeing position 
after the subject made the instrumental response and 
gave his/her opinion, In the agree and disagree 
conditions, the initial effectance aroused by reading 
the topic was not reduced because the confederate 
disagreed with the subject after the subject made the 
response and gave his/her opinion, 


Drive Manipulations 


All groups received the first part of the drive 
manipulations as part of the instructions. Each sub- 
ject was told that the confederate had a list of 
attitudinal statements that they would be discussing. 
In an attempt to insure some initial effectance 
arousal, the following sentence was included in the 
instructions: “Due to the nature of the opinions that 
will be expressed during the course of your conversa- 
tion, it is possible that there will be some initial 
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disagreement between the two of you,” For the 
agree and disagree conditions, it was assumed that 
this statement plus the fact that the confederate 
disagreed with the subject on Trial 2 would assure 
some arousal from Trial 3 on through acquisition as 
soon as the opinion statement was read. The assump- 
tion of arousal from Trial 3 on was based on the 
hypothesis that agreement and disagreement on 
Trials 1 and 2, respectively, would create a certain 
amount of uncertainty in the subject; that is, the 
subject would be expressing his opinion but had no 
indication of how the confederate felt about the 
issue since he was agreed with and then disagreed 
with, For the yield group it was assumed that the 
instructions plus the added initial disagreement by 
the confederate before the subject commented would 
lead to a higher state of arousal than in the agree 
and disagree conditions. 


Results 


Figure 1 shows the mean response speed 
(100/latency) for acquisition and extinction 
in blocks of two trials. Although response 
speeds for the three groups on the first trial 
block are approximately equal, the improved 
speed over trials indicates that the subjects 
learned the instrumental response. However, 
the diverging curves indicate that response 
speeds and therefore learning increased dif- 
ferentially during the course of acquisition. 
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Fic. 1. Acquisition and extinction curves of re- 
sponse speed under three conditions of social rein- 
forcement: yielding, agreement, and disagreement. 


Figure 1 also shows decreasing response 
speeds in all groups throughout extinction. 
Acquisition 
It is apparent from the curves pre- 
sented in Figure 1 that response speeds 
in the yield condition (large magnitude 
of reinforcement) diverge from those of the 
agree and disagree conditions. The prediction 
that asymptotic response speeds would have 
the specified rank order: yield, agree, dis- 
agree was tested by a Jonckheere test (Jonck- 
heere, 1954; Kirk, 1968; Siegel, 1956). The w 
Jonckheere is an elegant and ideally suited 
test for examining both the rank orderings 
of groups and the differences between them 
in a single operation. Results of a Jonckheere 
test performed on the grand means of the last 
four trials of acquisition indicated that group 
response speeds were in the hypothesized rank 
order and that speeds over the last four trials 
of acquisition for the three groups were sig- 
nificantly different from each other (z = 4.11, 
p < .0003). ; 
A 3 X 6 repeated-measures analysis of WV: 
ance (Magnitude of Reinforcement X Dis- 
crete Blocks of Two Trials) indicated a signifi. — 
cant increase in response speeds over trials 
(F = 19.40, df = 5/525, p < .001), and that - 
the diverging curves presented in Figure 1 
represented the hypothesized significant Trials 
X Magnitude of Reinforcement interaction 
(F = 2.36, df = 10/525, p< 01). Further 
support for the effects of magnitude of rein- 
forcement on response speeds was denoted by 
a significant magnitude of reinforcement main 
effect (F = 6.52, df = 2/107, p < .005). Re- 
sults of a trend analysis performed on the 
acquisition data of the disagree condition indi- 
cated the presence of both a significant linear 
component (F = 5.05, df = 1/175, p < .05) 
and a significant quadratic component (F = 
10.58, df = 1/175, p < .005). 


Extinction 


Response speed was a decreasing func- 
tion of the number of extinction trials, 
as presented in Figure 1. Results of a 
3 X 3 repeated-measures analysis of variance 
(Magnitude of Reinforcement X Discrete 
Blocks of Two Trials) indicated a highly sig- 
nificant magnitude of reinforcement main ef- 
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fect (F = 11.75, df = 2/105, p < .001) and 
a significant trials effect (B= 6:36) df= 
2/210, p < .005). However, a nonsignificant 
Magnitude of Reinforcement x Trials inter- 
action was found (F < 1), indicating that ex- 
tinction for the three conditions occurred at 
approximately the same rate. 


EXPERIMENT II 


Because the theoretical explanation of Ex- 
periment I derives from research on inter- 
personal attraction, it seemed highly desirable 
to establish a direct connection between the 
present research and attraction research. Fur- 
thermore, gathering attraction data based on 
the procedure of the first experiment afforded 
the opportunity to see if the results of the 
first experiment were replicable. Therefore, 
Experiment I was essentially repeated, with 
a measure of attraction substituted for the 
extinction procedure in an attempt to study 
the effects of magnitude of reinforcement on 
attraction. 

The theoretical assumptions used to predict 
the sequence effects on speed scores in Experi- 
ment I should also serve to predict attraction. 
Attraction scores on the Interpersonal Judg- 
ment Scale (IJS) (Bryne, 1961) should 
exhibit the same rank as did the yield, agree, 
and disagree speed scores in Experiment I; 
that is, subjects would be most attracted to 
the confederate in the yield condition, least 
attracted to the confederate in the disagree 
condition, and attracted an intermediate de- 
gree to the confederate in the agree condition. 
Following the first experiment, it was further 
hypothesized that (@) subjects would acquire 
the same instrumental response used in the 
first experiment in order to listen to a person 
yield or agree; (0) asymptotic response speeds 
in the present experiment would have the 
same rank order as in Experiment I, that is, 
yield, agree, disagree; and (c) there would 
be a significant Magnitude of Reinforcement 
X Trials interaction. 


Method 


Subjects and Design 


Sixty subjects were randomly assigned to the 
yielti, agree, and disagree groups. Each group had 
16 males and four females. 
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Modifications in Procedure 


The apparatus and procedures were identical to that 
used in Experiment I, except for three modifications: 
(a) The number of acquisition trials was reduced 
from 12 to 10 (however, in all groups, reinforce- 
ments were still given on only 50% of the 10 acqui- 
sition trials in four prearranged random orders, with 
the exception that Trials 1 and 9 were always rein- 
forced). (b) Extinction trials were not included. 
Attraction could not be effectively measured in the 
first experiment since any reinforcement effect would 
be nulliñed by the extinction procedure. (c) The 
IJS was given immediately after acquisition. Im- 
mediately after the last acquisition trial, the experi- 
menter asked “both” subjects to remain in their 
rooms because he wanted some additional informa- 
tion from them. The experimenter then asked the’ 
subject to rate the confederate on the IJS. The IJS 
consists of six 7-point scales dealing with evaluative 
items such as intelligence, knowledge of current 
events, morality, and adjustment. The last two Ta 
point scales, likability and desirability as a work 
partner, constitute the measure of attraction, and 
they are added together to yield an attraction score 
which can range from 2 to 14. 


Results 


Figure 2 shows the mean response speed 
(100/latency) for acquisition in blocks of two 
trials. As in the first experiment, response 
speeds for the three groups on the first trial 
block are approximately equal, and improved 
speed over trials indicates that the subjects 
learned the instrumental response. The diverg- 
ing curves indicate that response speed and 
therefore learning increased differentially 
during the course of acquisition—as it did in 
the first experiment. 


Speed 


It is apparent from the curves presented 
in Figure 2 that the various magnitudes of 
reinforcement led to diverging curves as in 
the first experiment. A Jonckheere test per- 
formed on the grand means of the last four 
trials indicated that the three groups were 
significantly different from each other and 
were in the predicted rank order: yield, agree, 
disagree (z = 2.60, P< .004). 

A 3 X 6 repeated-measures analysis of vari- 
ance (Magnitude of Reinforcement x Dis- - 
crete Blocks of Two Trials) indicated a sig- 
nificant increase in response speeds over trials 
(F = 8.12, df= 5/285, p < .001) and that 
the diverging curves presented in Figure 2 
represented the hypothesized significant Mag- — 
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nitude of Reinforcement X T rials interaction 
(F = 2.61, df = 10/285, $ < 01). The analy- 
sis also indicated a significant magnitude of 
reinforcement main effect: (Bes eee 
2/57, P< 01). A trend analysis performed 
on the acquisition data of the disagree group 
indicated the presence of a significant quad- 
ratic trend (F = 8.86, df = 1/95, $ < 01). 


Atiraction 


Separate one-way analyses of variance of 
IJS item scores as a function of magnitude of 


TABLE 1 


Means, STANDARD DEVIATIONS, AND F Ratios OF 
EVALUATION AND ATTRACTION (IJS) ScoRES FOR 
‘THREE MAGNITUDES or REINFORCEMENT 


Magnitude of reinforcement 


JJS items Low Moderate Large 
F 
mMm \ SD| M SD M SD 
11,00** 
i 5.90 | 2.40 9.95 | 1.97 12.70 | 1.51 | 1 
epee 3.60 | 1.22 an Sa en ae ses 
Adjustment 4.60 | 1.15 7h a S10 E 


i 5.75 95 | 5. . c 
m ige. 5.15 196 | 5.20 t87 | 5.25] -94 <i 


*p < 005. 
+*+ p < .001. 
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‘effects of magnitudegof reinforcement on the 


reinforcement were computed. Table 1 shows 
the means, standard deviations, and F ratios 
of the IJS items for the three experimental 
groups. The degrees of freedom for all one- 
way analyses of variance were 2 and 57. The 


attraction scores were statistically significant 
in the expected direction ($ < 001). The 
Newman-Keuls test of individual compati- 
sons of magnitude of reinforcement on the 
attraction scores (all comparisons were tested 
at the .01 level) indicated that all compari- 
sons between the three experimental groups 
were significant. 


IJS Evaluative Items 


Examination of Table 1 indieates that sig- 
nificant differences were found between groups 
on the evaluative items dealing with morality 
and adjustment. The Newman-Keuls test (all 
comparisons tested at the .01 level) performed 
on the morality and adjustment items indicated 
a nonsignificant difference between the yield 
and agree conditions on both items. However, 
the disagree condition was significantly differ- 
ent from both the agree and yield coni itions 
on both items. A 


DISCUSSION | 


Acquisition 

In general, the shape of the curves pre- 
sented in Figures 1 and 2 and the significant 
Magnitude of Reinforcement X Trials inter- 
action found in both experiments appeared 
to support the assumption that the present 
paradigm was analogous to the instrumental 
escape conditioning paradigms of Bower et al. 
(1959) and McAllister and McAllister (1967). 
The acquisition results of both experiments 
also indicate that the present experiments are 
analogous to those familiar experiments (e.g., 
Franchina, 1969; Trapold & Fowler, 1960) 
in which magnitude of reinforcement is 
varied simultaneously with the intensity of 
drive, so that subjects who have the termina- 
tion of a high drive as reinforcement receive 
‘a larger magnitude of reinforcement than 
those who receive the termination of a low 
drive as their reinforcement. A recent experi- 
ment by Lombardo, Libkuman, and Weiss 
(1970) supported the disagreement—drive- 
arousal assumptions of the present experi- 
ments. The experiment used the classic paired- 
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associate learning task of Spence, Farber, and 
McFann (1956) ina 2x2 factorial design 
with agreement—disagreement and simple or 
complex paired-associate learning tasks as the 
factors. The findings clearly indicated that 
disagreement. possesses general drive proper- 
ties, in that disagreement (drive arousal ) 
interacted with task complexity, facilitating 


_ performance when the correct responses were 


Sa 


dominant (simple task) and impeded per- 
formance when incorrect responses were 
dominant (complex task). 

The results of both Experiment I and Ex- 
periment II support the hypothesis that sub- 
jects would acquire an instrumental response 
when given the opportunity to listen to an- 
other person either yield or agree with them, 
and further supported the hypothesis that 
subjects in the yield conditions would exhibit 
faster response speeds than subjects in the 


“agree and disagree conditions throughout ac- 


quisition. Differential response speeds in the 
present experiments could only be attributed 
to (æ) type of agreement (yield versus 
agree) or (b) the subjects’ interest in the 
items presented to them in the different con- 
ditions. Any differences between groups must 
be due to one of these variables since the 
number of reinforcements were equated for 
all groups in both experiments. 

Since both item interest and number of 
reinforcements were equated in the yield and 
agree groups, the observed differences in re- 
sponse speed could only be attributed to the 
differences in the magnitude of the drive 
aroused and consequently reduced by the con- 
federate’s agreement with the subject in the 
yield groups.” Therefore, it appears that read- 
ing an attitudinal statement and presenting 
an opinion contrary to the subject’s was suc- 
cessful in inducing a higher state of arousal 
than simply reading the opinion statement 
and not commenting on it. The present ex- 


IJt may be noted that no control condition for 
the yield condition exists in which the stooge begins 
by agreeing rather than disagreeing. In principle 
there is a remote possibility that subjects might argue 
against their own positions simply to demonstrate 
efficacy in disputation and be rewarded by evidence 
of their effectiveness in such disputation (having the 
confederate yield). In practice, however, extensive 
unpublished data from our laboratory involving such 
conversations indicate that subjects do not in fact 
argue against someone who initially agrees with them. 
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periments also provided support for the 
disagreement-arousal and arousal-reduction 
hypotheses of Stapert and Clore (1969). 
However, the results of both Experiment. I 


and Experiment II appeared to extend the 


arousal hypothesis beyond that of arousal 
produced only through disagreement. Evi- 
dence for extension of the arousal hypothesis 
comes from the increasing response speeds 
across trials of the agree condition. Acquisi- 
tion and maintenance of the instrumental 
response appears to support the assumption 
that the instructions were effective in arousing 
drive, and that agreement by the confederate 
teduced a heightened state of arousal in the 
subject created by not knowing whether or 
not the confederate would agree or disagree 
with him, 

Further support for the assumption that 
disagreement produced a state analogous to 
that produced by the presentation of an 
aversive stimulus, and that agreement reduced 
the noxious state, came from the acquisition 
data of the disagree groups in both experi- 
ments, that is, speeds during acquisition. This 
result supported the assumption that the 
mechanism for acquisition of the instrumental 
response was the termination or offset of an 
aversive stimulus. Response suppression in 
the disagree groups could only have been 
caused by the aversive nature of disagreement 
on topics of high interest to the subject, be- 
cause the number of reinforcements in the 
agree and disagree conditions were equated. 
Support for this conclusion comes from sev- 
eral experiments by Byrne and his associates 
(e.g, Byrne et al., 1968; Byrne, Young, & 
Griffitt, 1966). Both of these studies showed 
the aversive nature of disagreeing opinion 
statements by showing that disagreeing opin- 
ion statements presented after a response led 
to suppression of that response. 

The observed differences between the agree 
and disagree conditions can further be ex- 
plained by a slight reinterpretation of experi- 
ments dealing with item interest and attrac- 
tion. Clore and Baldridge (1968) have shown 
that subjects were more attracted to others 
who agreed with them on high-interest items 
and less attracted to others who agreed with 
them on low-interest items. In the context 
of the present experiments and their results, 
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this suggests that the interest a subject has 
in a particular opinion statement determines 
both its effectiveness as a reinforcer if agreed 
with and its drive-arousing properties if dis- 
agreed with. Therefore, one would expect sub- 
jects receiving reinforcements (agreements) 
on high-interest items to respond faster than 
subjects receiving the same number of agree- 
ments but on items of low interest to the 
subject. Based on this assumption and the 
results of the experiments of Byrne et al. 
(1968) and Byrne, Young, and Griffitt 
(1966), the initial increase in response 
speeds of the subjects in the disagree condi- 
tion followed by a decrease in response speeds 
would be expected as the negative effects of 
disagreement on high-interest items negated 
any reinforcing effects of agreement on items 
of low interest. 

To fully denionstrate the reinforcing prop- 
erties of a stimulus, it is also necessary to 
show that withdrawal of that stimulus or 
nonpresentation of it after the response leads 
to a reduction in the strength of that response. 
In Experiment I it was necessary to show that 
withdrawal of agreement would lead to de- 
creasing response speeds. Examination of the 
extinction data presented in Figure 1 and the 
significant trials effect in the analysis of the 
extinction data does in fact show that with- 
drawal of agreement led to response decrement 
and further supported the hypothesis that 
agreement was reinforcing. 

Although our experiments were based on 
Byrne’s effectance theory, it is possible to 
interpret the results in terms of dissonance 
arousal and dissonance reduction through so- 
cial support: “The existence of disagreement 
among members of a group on some issue or 
opinion, if perceived by the members, cer- 
tainly produces cognitive dissonance,” and 
one “way of reducing the dissonance would 
be to influence those persons who disagree 
to change their opinions so that it more closely 
corresponds to one’s own [ Festinger, 1957, pp. 
178, 182]. Furthermore, several experiments 
have supported the assertion that dissonance 

_ has the functional properties of drive (Cottrell 
& Wack, 1967; Waterman, 1969). Therefore, 
if disagreement produces dissonance, and dis- 
sonance has the functional properties of drive, 
reduction of it through agreement should 
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reinforce the instrumental response that im- 
mediately preceded reduction of the disso- 
nance. The yield condition of both experi- 
ments by having the confederate initially 
disagree with the subject could have led to 
dissonance arousal in the subject. If this is 
so, then the subject’s reply could be inter- 
preted as behavior that is “directed to return- 
ing the organism to a state in which the drive 
is at a lower level of arousal [Brehm & 
Cohen, 1962, p. 224].” Once the subject re- 
plies that the confederate yields, it can be 
assumed that the state of arousal created by 
dissonance through disagreement was reduced, 
and that reduction of dissonance through 
yielding should have reinforced the instru- 
mental response that immediately preceded 
dissonance reduction. 

Dissonance processes are apparent not only 
in the instrumental learning situation itself, 
but also on the evaluative items of the TJS. 
The significantly lower attraction scores and 
the lower morality and adjustment scores for 
the disagree condition could be attributed to 
a source derogation effect (Festinger, 1957) 
as a result of the confederate leaving the sub- 
ject in a high state of drive. That is, given 
the opportunity, subjects not only say they 
dislike the confederate, but they derogate him 
by describing him as immoral and malad- 
justed. 


Attraction 


The theoretical model of the attitude- 
attraction relationship (Byrne, 1969; Clore & 
Byrne, in press) assumes that attitude state- 
ments elicit implicit positive and negative 
affective responses which in turn mediate 
explicit evaluative responses. The model fur- 
ther states that positive or negative affect may 
become conditioned to strangers by pairing 
them with agreeing or disagreeing attitude 
statements. By pairing a stimulus person with 
the attitude statements, the person becomes 
a conditioned stimulus for the affective re- 
sponses. The direction and intensity of the 
affective responses toward a given stimulus 
person are determined by the number of simi 
lar attitudes relative to the total number of 
attitudes, and the experienced quality of 
these implicit responses is reflected in the 
subject’s self-reported like or dislike of the 
stimulus person. Since the process underlying 


‘attraction is based on conditioning, attraction 

toward others (the rate of acquisition and 
the resulting conditioned affective response) 
should vary if the magnitude of reinforce- 
ment associated with that stimulus person 
is varied. In the present paper the magnitude 
of reinforcement associated with a stimulus 
person was manipulated by varying the 
amount of drive reduction associated with 
that stimulus person by varying the interest 
the subjects had in the items (agree versus 
disagree conditions) and the form of agree- 
ment (yield versus agree). 

If the attraction results of Experiment II 
are examined within the framework of the 
` reinforcement model proposed by Byrne 

(1969) and Clore and Byrne (in press), it 
is immediately apparent that disagreement 
(drive arousal) followed by agreement (drive 
reduction) has greater reinforcement value 
than agreement not preceded by disagree- 
ment. Since both the number of reinforce- 
ments and item interest were equated in the 
yield and agree conditions, the significantly 
higher attraction scores of the subjects to the 
confederates in the yield condition could only 
be attributed to the differences in the magni- 
tude of reinforcement that the subject re- 
ceived in the form of drive reduction by the 
confederates’ yielding. This assumption gains 
ample support from the significantly faster 
response speeds of the yield condition in 
acquisition of both experiments. 

The extremely powerful effect of the yield 
manipulation on attraction becomes very ap- 
parent when the expected attraction score 
based on a 50% reinforcement schedule is 
found using the equation proposed by Byrne 
and Nelson (1965). Based on the equation 
Y = 5.44X + 6.62, the expected attraction 
score is 9.34, which is virtually identical with 
the attraction score of 9.95 found in the agree 
condition. However, the expected attraction 
score of 9.34 is substantially lower than the 
mean attraction score of 12.70 found in the 
yield condition. In fact, the mean attraction 

œ score of 12.70 for the yield condition is 
~ slightly higher than the expected attraction 
score of 12.06 based on 100% reinforcement 
` (complete agreement on all items). It there- 
_ fore appears that yielding on items of high 
~ interest to the subject completely overcame 
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any negative effects of disagreement on the 
low-interest-item trials and in fact led to a 
condition that was similar to 100% reinforce- 
ment (agreement on all items). The attraction 
results of Experiment II, therefore, lend con- 
siderable support to the “sequence” effect first 
proposed by Stapert and Clore (1969) and 
further show that it is an extremely power- 
ful effect when applied to a discrete-trials 
situation. 

In order to explain the significantly lower 
mean attraction score of the disagree condi- 
tion, item interest must be considered. When 
the subject’s interest in the items is con- 
sidered, the mean attraction score of 5.90 for 
the disagree condition is within the bounds 
of attraction theory and can be explained by 
it. In an experiment by Clore and Baldridge 
(1968), item weight (interest) was manipu- 
lated as in the present experiments. In their 
experiment, subjects were presented with 
questionnaires that were constructed in such 
a way that the bogus stranger either agreed 
with the subject on high-interest topics and 
disagreed on low-interest topics, or agreed on 
low-interest topics and disagreed on high- 
interest topics. These two conditions are simi- 
lar to the agree and disagree conditions of 
the present experiments. The results of 
their 50% reinforcement conditions showed a 
striking resemblance to the attraction data 
obtained in Experiment II of the present 
paper, both in the significant difference be- 
tween agree and disagree groups and even in 
the actual numerical values of attraction 
scores. The agree conditions of Clore and 
Baldridge and our Experiment II yielded 
nearly identical attraction scores of 9.43 and 
9.95, respectively, as compared with attrac- 
tion scores of 5.43 and 5.90 in the disagree 
conditions. The present experiments, there- 
fore, lend considerable support to the conclu- 
sion of Clore and Baldridge that item interest 
is a potent variable affecting the agreement- 
attraction relationship. 

The findings of the present investigations 
support the assumptions of the reinforcement 
theory of attraction that agreement leads to 
attraction, that yielding leads to greater 
attraction, and that these effects are based 
on reinforcement. The theory was supported 
in experiments employing both measures 
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of interpersonal attraction and direct tests 
of reinforcement effects in instrumental 
conditioning. 
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MARRIAGE DIMENSIONS AND PERSONALITY? 


K. BARTON? ann R. B. CATTELL 
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One hundred and eighty-six married graduate students were given a factored 
Marital Role Questionnaire, and also Cattell’s Sixteen Personality Factor 
(16 PF) Questionnaire. For each of the 12 marriage dimensions, subjects were 
grouped according to their scores. The “high” and “low” groups were defined 
as those subjects scoring 1 standard deviation above and below the mean on 
a particular dimension. An “intermediate” group was comprised of subjects 
falling between 1 standard deviation above or below the mean. Analyses of 
variance were conducted with three levels of marital dimensions and two levels 
of sex as the independent variables, The dependent variables were each of the 
16 personality factors taken in turn. Results indicate several significant main 
effects and interactions, and these are discussed in detail, 


In the research on marriage, a considerable 
number of studies have involved an examina- 
tion of the relationship between quality of 
marriage and the personality of the partners. 
Some studies have involved the interperson- 
ality differences and similarities between 
spouses. Thus, Udry (1963, 1965, 1967) in- 
vestigated what he termed “complementarity 
in mate selection” and concluded that there 
was no evidence that “one’s need structure 
leads one to perceive traits in the mate op- 
posite to one’s own (1963, p. 289).” Udry 
also examined the influence of the “‘ideal mate 
image” on mate selection and mate perception, 
and found little similarity between an “ideal 
mate” and the actual mate chosen in marriage. 
Udry attempted to use personality variables 
to predict which engaged couples would finally 
marry, but a 5-year follow-up revealed very 
little predictive power in the personality 
variables. 

Other studies have involved attempts to 
define the “quality” of marriages in terms of 
their stability and then endeavor to predict 
this stability from the personality attributes 
of the spouses. Thus, Cattell and Nesselroade 
(1967) divided marriages into “stable” and 
“unstable” categories (instability being de- 
fined operationally as “any known step being 
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taken toward dissolution”) and compared the 
personality traits of husbands and wives in 
both groups. In general, it was found that 
for the stable group, there were significantly 
more positive correlations between personality 
factors for husband and wife, than for the 
unstable group. 

What is obviously needed in this type of 
research is a study using a better definition 
of marriage dimensions (rather than the gross 
distinction between stable and unstable) and 
established personality factors as predictors. 
The present study attempts to meet the above 
requirements and uses the Marriage Role 
Questionnaire (MRQ) designed by Tharp 
(1963), which measures some 12 dimensions 
or factors in marriage. This questionnaire was 
refactored by Barton, Cattell, and Kawash 
(1971), using an oblique promax rotational 
method with a new sample of married couples, 
and Tharp’s original factors were confirmed. 
The MRQ was originally designed to measure 
a wide variety of possible marriage behaviors, 
and the individual items with factor loadings 
can be found in the appendix of Tharp’s 
study. Personality measures were obtained 
from the Cattell Sixteen Personality Factor 
(16 PF) Questionnaire, as in the Udry and 
Cattell and Nesselroade studies. The 16 PF 
is a factorial test with empirically derived 
dimensions and a history of 21 years of 
research use. 

The major hypothesis is that personality 
variables are important in predicting marital 
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dimensions. The exact nature of the relation- 
ship among personality and marriage vari- 
ables is not hypothesized, but interaction ef- 
fects were considered probable, and the analy- 
sis was designed to reveal them should they 
exist. 


METHOD 
Subjects 


One hundred and eighty-six married graduate stu- 
dents from the University of Illinois were used as 
subjects. Only subjects who would allow their spouse 
to participate in the expermient were accepted, and 
each couple was paid $10 for participation in the 
study. The reason that only couples were accepted 
was that in order to make the factor analysis 
comparable to the one in Tharp’s (1963) study, the 
population of subjects had to be as similar as pos- 
sible. As the following Procedure section shows, the 
marital relationship is ignored in the analyses of 
variance. 


Tests 


All subjects were given the MRQ and the 16 PF 
test. Husbands and wives took each test in the same 
session, in order to eliminate any discussion of the 
test which could have influenced either of the spouse’s 
results. A brief description of the personality factors 
is given in Table 1 and the MRQ factors in Table 2. 

The MRQ, as used in this study, contained several 
extra items compared to the original version designed 
by Tharp. All of these additional items referred to 
possible marital problems, involving the contempla- 
tion of divorce or separation; and from the Barton 
et al. (1971) refactoring, it was found that they all 
loaded together on a factor termed Marriage 
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Procedure 


All subjects were divided into three groups for 
each of the 16 personality variables, The three groups 
were labeled “low,” “intermediate,” and “high” for 
each personality variable. The low group was de- 
fined as those persons scoring 1 standard deviation 
below the mean for each personality factor. The 
high group consisted of those subjects who scored 
1 standard deviation above the mean for any given 
personality factor, and the intermediate group con- 
sisted of those subjects whose score on any given 
personality factor was between 1 standard deviation 
above and below the mean. Thus, the personality 
scores were treated as independent variables. The 
dependent variables consisted of the factor scores on ò 
each of the MRQ dimensions. With 16 independent ` 
variables and 12 dependent variables, it was neces- 
sary to perform a total of 192 analyses of variance. 
Because conducting so many analyses on the same 
data causes the “real” alpha level to change, it was 
decided only to interpret results which reach the .01 
level. The factor of sex was also incorporated into 
the analyses of variance, thus making 2 X 3 factorial 
designs with two levels of sex and three levels of 
each personality variable. 

Since, psychologically speaking, second-order per- 
sonality factors can be considered as wider influences y 
contributing to the primary factors and accounting 
for their intercorrelations, it was decided to compute 
values for the four major second-order factors in 
the 16 PF and perform similar analyses to those 
involving the primary factors. These four second- 
order personality factors are (a) exvia (extraversion), 
(b) anxiety, (c) cortertia (cortical alertness, or the 
tendency to think rather than feel), and (d) inde- 
pendence. A detailed description of each of these 
factors can be found in the handbook for the 16 PF 


Instability. (Cattell, Eber, & Tatsuoka, 1970); but, in brief (a) j 
TABLE 1 
SIXTEEN PF FACTORS f 
Description 
Factor 
Low score High score 

A Reserved, detached, critical, aloof, stiff Outgoing, warmhearted, easygoing, participating 

B | Low intelligence, dull High intelligence, bright 

C Emotionally unstable, affected by feelings, easily | Emotionally stable, mature, faces reality, calm q 

upset, changeable A ane 
E Dominant, assertive, aggressive, competitive, stub- 


Submissive, humble, mild, easily led, docile, 
accommodating 

Sober, taciturn, serious 

Expedient, disregards rules 

Shy, timid, threat-sensitive 

Tough-minded, self-reliant, realistic 

Trusting, accepting conditions 

Practical, “down-to-earth” concerns 

Artless, forthright, unpretentious, genuine but 
socially clumsy 


zonna 


Hany luck; thusiasti 

appy-go-lucky, gay, enthusiastic 

e A E aa Ca ieni moralistic, staid 
Venturesome, uninhibited, socially bold 
Tender-minded, sensitive, clinging, overprotected 
Suspicious, hard to fool 

Absent-minded, imaginative, bohemian 

Shrewd, astute, polished, socially aware 


Self-assured, placid, secure, complacent, serene 


Conservative, respecting traditional ideas 

Group dependent, a “joiner” and sound follower 

Undisciplined, lax, follows own urges, careless of 
social rules 

Relaxed, tranquil, torpid, unfrustrated, composed 


tS 
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Apprehensive, self-reproaching, insecure, worrying, 
troubled 

Experimenting, liberal, free-thinking s 

Self-sufficient, resourceful, prefers own decisions 

Controlled, exacting will power, socially precise 
compulsive, following self-image } 

Tense, frustrated, driven, overwrought 
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% . TABLE 2 
MATCHING or Factors AND Brier DESCRIPTION 


Barton & 
tel Factor match in Tharp study Factors with no match 


I | Sexual gratification (en)®; Loads on items indicating ful- 
fillment in sex and affection. 
I | Togetherness and Role Sharing (ex): Loads on items indi- 
cating agreement between spouses in such diverse areas 
as finance, recreation, religion, friends, children, manners, 
general philosophy, etc. 
III | Wife Adequacy (ex): prefer term Home Devotion—Loads 
on items indicating that the wife has a high interest in 
keeping the home clean and neat and that she does most 
of the housework. 
G IV | Participation in Community Affairs (en): Items indicate 
much social interaction with the community and friends. 
Social-Intellectual Equality: Items indicate perceived 
equality between spouses on IQ and social strata. 
VI Marriage Instability: Items indicate 
that separation and/or divorce have 
been contemplated. Other item reflects 
visits to marriage counselors and a 
statement of general dissatisfaction 
with the marriage. 


< 


VII | Social Integration (en); Items indicate that all members of 
the family (including the children) do things together 
often. 

VIII | Work Performance (en): Items reveal whether or not each 
member of the family has his own set jobs to do. 

TX | Social Influence (en): Items show which spouse has the 
most social influence on the other, i.e., who influences 
most the choice of friends, etc, 

X | Masculine Dominance (en): prefer term Spouse Independ- 
ence—Items show who controls the money, number of 
children planned, etc. 

XI | Wife Adequacy (en): Items refer to the way in which wives 
participate in such matters as who earns income, etc. 
XII | Division of Influence (en); prefer term Male Dominance— 
Items reveal which spouse is the main influence in such 
matters as spending, sex, child rearing, etc. 


Y 


Y Note. —For description of the actual items loading each factor, see Tharp, 1963. 
“en = Tharp's “enactment” factors. 
bex = Tharp's “expectation” factors, 


exvia loads on the primary factor of A+, E+, F+, There were no significant interaction effects 
H+, and Qs—; (b) anxiety loads on C—, H—, L+, ith th -ordi lity factor 
On andl Orbs (o conei loads MI Octo and with the second-order personality factors. 


Qst; (d) independence loads on E+, L+, M+, se ge 
Ot, and Qe+ (see Table 1 for factor descriptions). MRO Factor T (Sexual Grate caian) 


Actual weights are given in the handbook. As can be seen from Table 3) subjects 
scoring high on ego strength (emotional sta- 
RESULTS bility) reported high sexual gratification, as 


All main effects significant at the .01 level did subjects scoring low on guilt proneness. 
are summarized in Table 3. The interaction On the other hand, subjects high on guilt 
effects are shown in detail in Figure 1, and proneness or low on ego strength reported low 
second-order main effects can be found in sexual gratification. Figure 1 shows the inter- 
Table 4. The interaction effects are reported action effects between sex and superego, with 
for the interest of the reader, but care should sexual gratification factor scores as the depen- 
í be taken in their interpretation as with so dent variable. A further breakdown of this 
i many interactions being tested, one would interaction, using Duncans multiple-range 
expect by chance alone to find two significant. test (Winer, 1962), shows that the sex dif- 
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TABLE 3 


Group M (Factor score) No. in cells 
MRQ factor Personality variable F ratio 

L I H L I H 
I: Sexual Gratification C (Ego strength) —.11 06 .20 | 17 |147 | 22 | 5.48 
O (Guilt proneness) 42 0} —.73 | 19 |145] 22 | 7.50 
II: Togetherness or Role Sharing | H (Venturesome) rete .06 33 | 41 |116| 29 | 6,23 
O (Guilt proneness) 46 07 | —.67 | 19 | 145} 22 7.51 
Q: (Ergic tension) 25 .09 | —.45 | 21 |123 | 42 5.42 
III: Home Devotion A (Outgoing) —.18 | —.03 55 | 33 |125| 28 | 6.59 
G (Superego) 2 00 74 | 48 | 115} 23 | 15.20 
Q; (Compulsive) 69) | e501 76 | 38 |122| 26 | 18.33 
VII: Social Integration N (Shrewdness) 30 -10 | —.37 | 33 |105] 48 | 7.54 
X: Spouse Independence E (Dominance) — 49 01 33 | 33 |106| 47 | 10.59 
F (Enthusiasm) —.42 18 42 | 53 |125) 8 4.99 
H (Venturesome) —.12 | —.06 .50 | 41 |116| 29 | 6.95 
XII: Male Dominance Qı (Experimenting) | —.65 | —.02 14} 9 | 96| 81} 4.97 


Note.—N = 186. L = low, I = intermediate, H = high. 


ference in the “high superego” group was 


significant at the .01 level. 


Males with high 


superego reported high sexual gratification, 
whereas females with high superego reported 
low sexual gratification. A similar breakdown 


of the interaction between sex and Qe (Self- £ 


Sufficiency) revealed that sex differences in 
the “low Q»” group were ‘significant at the 
.01 level. Group-dependent males scored low 
on sexual gratification, while group-dependent 


TABLE 4 
SECOND-ORDER PERSONALITY FACTORS AND THE MRQ (Main Errects) 
cana ASPA Group M (Factor score) No. in cells 
MRQ factor personality F ratio t 

factor L I H L I H 
I: Sexual Gratification Anxiety 54 .00 | —.07 19 | 160 15.55 | <.001 
Independence | —.87 -03 .05 B25 A -039 
IL: Togetherness or Role Sharing | Anxiety .69 | —.03 | —.14 | 19 | 160 7 7.72 .001 
Ill: Home Devotion Independence 95 | —.02 .02 4 5 Se: 3.74 -026 
IV: Participation in Community | Exvia —=.27 .09 54 | 43 | 137 6 3.00 .052 
Affairs Anxiety 133 01 | —.62 | 19 | 160 7 3.57 030 
V: Social-Intellectual Equality Anxiety 50 | —.02 | —.37 | 19 | 160 i; 8.77 | <,001 
VI: Marriage Instability Cortertia 2h 04 54 | 19 | 156 | 11 4.32 -015 
VIII: Work Performance Anxiety —.01.| —.04 85 | 19 | 160 | 7 4.37 014 
IX: Social Influence Anxiety 09 | —.01 | —.90 | 19 | 160 7 4.93 .008 
X: Spouse Independence Cortertia —.51 03 46 | 19 | 156 | 11 6.32 002 
XII; Male Dominance Exvia —.09 03 | —.39 | 43 | 137 6 6.59 i .002 
Anxiety .24 00 | —.56 | 19 | 160 7 3.11 .047 


Note.—L = low, I = intermediate, H 


= high, 


Abs 


} 
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Sexual Gratification Factor 
Sexual Gratification Factor 


Low Int High 
G G 


Super Ego Group 
ET Dependent 


O-—~0 =FEMALE 


p=.006 


p=-007 ae 


225 


o 


Home Devotion Factor 
1 
h 
G 


=.50 


Low Int High 
c c c 


Self Ego Strength 
Sufficient 


Fic. 1. Significant interaction effects. 


females reported a high degree of sexual 


gratification. 

At the second-order level, anxiety is highly 
related to sexual gratification as can be seen 
from Table 4. Low-anxious subjects tended to 
report a significantly higher degree of sexual 
gratification than high-anxious subjects. 


MRQ Factor IT (Togetherness and Role 
Sharing) — 


This factor involves a high degree of agree- 
ment between spouses on such important 
topics as religion, interests, and general phi- 
losophy of life. Table 3 shows that for both 
husband and wife, highly guilt-prone subjects 
scored low on this MRQ factor, whereas self- 
assured subjects scored high. Tense (high Q4) 
subjects also tended to score low on this 
marriage factor, whereas relaxed subjects 
scored high. Venturesome (high H) subjects 
scored high also, whereas shy people tended 
to score low. 

In terms of second-order personality fac- 
tors, anxiety was related to this marital 
factor, in that high-anxious subjects reported 
less “togetherness and role sharing” than did 
low-anxious subjects. 


MRQ Factor III (Home Devotion) 


Subjects with high superego ( conscientious- 
ness) and/or high self-sentiment (compulsiv- 
ity) tended to be high on this devotion to 
home factor, whereas subjects low on either 
or both of these personality traits tended not 


to report themselves as devoted to the home. 
Also, subjects high on Factor A, indicating 
a warm and outgoing nature, scored higher 
on the devotion to home factor than did sub- 
jects low on Factor A (see Table 3). Figure 1 
indicates the interaction effect between Sex 
and ego strength; and Duncan’s multiple- 
range test, when applied to this interaction, 
revealed sex differences in both the low- and 
high-ego-strength groups ($ < .01). Specifi- 
cally, males with low ego strength scored low 
on “home devotion”; and females with low 
ego strength scored high on “home devotion,” 
whereas this effect was reversed in the high- 
ego-strength group. At the second-order level, 
low-independent subjects reported higher de- 
votion to home than did high-independent 
subjects, 


MRQ Factor IV (Participation in Community 
Affairs) 


There were no significant effects involving 
this marital factor and the primary person- 
ality factors, However, at the second-order 
level, both exyia and anxiety were related to 
MRQ(IV). Subjects scoring high on exvia 
reported higher participation in community 
affairs than did subjects scoring low on exvia. 
The opposite was true for the anxiety factor 
(see Table 4). 


MRO Factor V (Social-Intellectual Equality) 


Again, there were no significant effects with 
the primary Personality factors, but at the 
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second-order, high-anxious subjects reported 
low “social-intellectual equality,” whereas 
Jow-anxious subjects reported high social- 
intellectual equality (see Table 4). 


MRQ Factor VI (Marriage Instability) 


At the second-order level, marriage instabil- 
ity scores were highest in subjects who were 
high on cortertia (i.e., tended to use cognition 
rather than feelings in problem solving) than 
in subjects low on cortertia (see Table 4). 


MRQ Factor VII (Social Integration) 


This marital factor involved much contact 
with children, and Table 3 indicates that 
subjects high on this factor tended to be 
naive or artless rather than shrewd or 
sophisticated. 


MRQ Factor VIII (Work Performance) and 
MROQ Factor IX (Social Influence) 


Table 4 indicates that highly anxious sub- 
jects tended to be high on “work perform- 
ance” but low in “social influence.” The 
opposite was true for low-anxious subjects. 


MRQ Factor X (Spouse Independence) 


In terms of the primary personality factors, 
subjects scoring high on “spouse indepen- 
dency” tended to be dominant, enthusiastic, 
and venturesome. At the second-order level, 
such subjects obtained high cortertia scores. 
The opposite was true for subjects low on 
“spouse independence.” 


MRO Factor XII (Male Dominance) 


Because of the low W in the “low Q,” 
group, interpretation of the findings in Table 3 
should be cautious. Results indicated that 
subjects high on Q, (experimenting or radi- 
cal) reported a high degree of male dominance 
in their marriage. At the second-order level, 
subjects with high exvia and/or high anxiety 
scores reported less “male dominance” than 
subjects with low anxiety or exvia scores (see 


Table 4). 
CONCLUSIONS 


In making conclusions about such response- 
response-type of analyses that are reported 
above, one has to recognize the possibility 
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that some of the relationships can possibly 
be due to the perceived similarities in the 
concepts about which the subjects are report- 
ing. However, with this limitation in mind, 
the following conclusions are suggested. 

The results obtained, relating to the per- 
sonality correlates of “sexual gratification” 
make good intuitive sense, in that subjects 
with high ego strength would be expected to 
be able to solve sexual problems in a more 
satisfactory way than would subjects with 
low ego strength or emotional stability. Again, 
with guilt proneness, it can easily be under- 
stood why a high proneness to guilt could 
lead to a low degree of sexual gratification. 
However, care must be taken not to assume 
any direction of causation in this relation- 
ship, as it is possible that a low degree of 
sexual gratification could, indeed, cause the 
guilt proneness, Research involving a manipu- 
lative technique would solve this problem of 
directionality. 

With respect to the factor of “togetherness 
and role sharing,” the fact that anxiety, guilt 
proneness, and shyness are not conducive to 
a high degree of agreement on many issues 
is hardly surprising. The highly anxious or 
guilty person might well be expected to find 
difficulty in the interpersonal relations in- 
volved in togetherness and role sharing, and 
shyness would be expected only to decrease 
the amount of communication. 

It is easy to understand why subjects who 
are conscientious and/or highly compulsive 
would score highly on the “home devotion” * 
factor, since the very nature of the task of 
keeping the home a neat place might involve 
these factors to a large degree. That depen- 
dent subjects score higher on home devotion 
than do independent subjects might be due 
to the fact that dependent subjects have need 
to please their spouses more than independent 
subjects, and they do so by attending to the 
details of homemaking. The fact that a person 
high on Factor A tends to get a higher home 
devotion score indicates that an easygoing, 
warmhearted nature seems to be related to 
home devotion. It is easy to see how such a 
disposition could help in combating the repeti- 
tivity and routine involved in keeping a home 
neat and tidy. The existence of an interaction 
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effect between sex and ego strength on the 
home devotion factor is a little puzzling, 
and no interpretation will be attempted; 
as was mentioned earlier, it could well be a 
chance artifact. 

No comments will be made on the person- 
ality correlates of the rest of the MRQ 
factors, since results seem relatively self- 
explanatory. However, in terms of MRQ 
Factor VI (Marriage Instability), the finding 
that subjects high in cortertia score higher 
on this “instability” factor perhaps deserves 
further comment. It is probable (and perhaps 
many marriage counselors would say very 
possible) that the problems in marriage that 
are reflected in a high “marriage instability” 
score could be solved if both partners were 
able to communicate on the emotional level 
rather than try to solve their difficulties 
through a purely cognitive approach. If this 
were true, then subjects high on cortertia, be- 
cause they function on a rational cognitive 
level, rather than using an emotional ap- 
proach, might well be expected to both have 
more marital problems and be less able to 
solve them. 

These conclusions are tentative in nature, 
and more research is obviously necessary to 
establish that the above relationships are valid 
and, further, to determine the direction of 


causality. This study represents an attempt 
to identify perceptual covariances between 
self-descriptions and marital descriptions. The 
next logical step is to relate partner differences 
in personality to marital characteristics. 
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COOPERATION IN A THREE-PERSON PRISONER’S 
DILEMMA * 


GERALD MARWELL °? 
Department of Sociology, 


DAVID R. SCHMITT 


University of Wisconsin University of Washington 
Subjects played either a standard two-person Prisoner’s Dilemma game or an 
equivalent three-person version, Despite the fact that an economic model 
would predict equal rates of cooperation, triads were substantially less coopera- 
tive. The transition matrices indicated that it was more difficult in the three- 
Person setting to establish a pattern of cooperation that could be subsequently 


maintained, 


The number of 
action is important 
involved in such 


participants in an inter- 
for the behavior of those 
ae social phenomena as bar- 
esther and cooperation. In this regard, 
Weil’s (1966) detailed argument for the de- 
velopment of N-person versions of the two- 
person games that have been extensively 
explored in recent research is instructive. For 
example, real-life analogues to the Prisoner’s 
Dilemma should be as common in multi- 
person as they are in two-person settings, 
Yet, we have no assurances that empirical 
generalizations from the vast amount of 
Prisoner’s Dilemma research are applicable 
beyond the dyad. 

The research reported below is concerned 
with the relationship between behavior in the 
Prisoner’s Dilemma and the number of per- 
sons involved in the interaction. Specifically, 
it compares behavior in two- and three-person 
settings. In order to make this comparison, a 
three-person Prisoner’s Dilemma game was 
developed in which the rewards for specific 
behaviors are exactly the same as those in the 
two-person game. The basic question is 
whether the amount of cooperative behavior 
is identical in the two- and three-person 
settings—as would be expected from an eco- 
nomically rational model—or if the number 
of players has an independent effect on co- 


* This research was supported by Grant GS-1695 
from the National Science Foundation. James Besser 
assisted as the experimenter. Weldon Johnson and 
Shalom Schwartz offered useful suggestions on an 
earlier draft. 

* Requests for reprints should be sent to Gerald 
Marwell, Department of Sociology, Social Science 
Building, University of Wisconsin, Madison, Wis- 
consin 53706. 


operation. Although there have been previous 
studies of interaction in three or more person 
games (Bixenstine, Levitt, & Wilson, 1966; 
Rapoport, 1962; Rapoport, Chammah, Dwyer, 
& Gyr, 1962), none have involved explicit 
comparisons of cooperative behavior in equiv- 
alent games with different numbers of players. 
Our basic hypothesis is that there will be 
less cooperation the greater the number of- 
players, and that this effect will be consider- 
ably stronger than any extrapolation of re- 
sults from behavior in the two-person game 
would predict. A major conclusion from pre- 
vious research on cooperation in another 
setting (Marwell, Schmitt, & Shotola,“1971; 
Schmitt & Marwell, 1971) is that long-term 
cooperation is a relatively fragile phenomenon 
whenever the possibility of interpersonal ex- 
ploitation is present. If either player takes 
advantage of this possibility and exploits his 
partner, the effects on the victim’s subsequent 
behavior tend to be severe and often irre- 
versible. The victim either seeks to exploit in 
turn, even beyond the point of “getting even,” 
or he tries to avoid further association with 
the exploiter. In the Prisoner’s Dilemma, 
competitive behavior is necessarily also ex- 
ploitative, and the cycle of exploitation and 
defection can be readily established. The 
larger the number of players in the game, the 
more likely it is that at least one player will 
begin this exploitative cycle. We would there- 
fore predict that regardless of how many 
people there are in a game, the introduction 
of exploitation as a response into the game 
by a single player tends to produce large 
amounts of exploitation by all players, 
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Three-Person Prisoners Dilemma Game 


The development of a three-person Prison- 
er’s Dilemma game was approached through 
the construction of equations by which any 
two-person, two-strategy game could be ex- 
panded into a rationally equivalent V-person, 
two-strategy game. Games were defined as 
rationally equivalent when a subject playing 
against others who used random strategies 
would, on the average, receive the same out- 
come from a given strategy choice, regardless 
of the number of other players. 

With this definition, the only part of the 
Prisoner’s Dilemma that must be changed in 
expanding from a two- to an N-person game 
is the outcome matrix. All other aspects of the 
game, such as order of play, level of com- 
munication, and type of available behaviors, 
remain identical. The outcome matrix itself 
remains symmetrical, in that all players have 
the same relationships with all other players. 
Players are “interchangeable.” In other words, 
in the three-person game, Person A’s out- 
comes remain the same if Person B plays 
“left” (L) and Person C plays “right” (R) 
or if B plays “right” and C “left.” 

A symmetrical matrix means that any 
N-person, two-strategy outcome matrix may 
be fully described by the outcomes for any 
individual player. These outcomes are the 
amount he earns for each of his two strategy 
choices under any possible combination of 
strategies which may be specified for the 
remaining NW —1 players. Since we are con- 
sidering only two-strategy games, the behav- 
ior of all other players combined may be 
summarized by a single number which we 
shall call Py. This number represents the 
proportion of other players who have chosen 
one of the two strategies (here called L for 
left). 

We are now prepared to expand any 2 x 2 
matrix into an N-person game with similar 
outcome characteristics. For this expansion, 
we must know only the outcomes for each 
person in each cell of the original, to-be- 
expanded, 2 X 2 matrix. Knowing these fig- 
ures, we may specify the outcomes in the 
expanded matrix for each player. Let us 
denote these outcomes for the “player” in 
the 2 X 2 matrix as shown in Figure 1. 
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Player 


Fic. 1. To-be-expanded 2 X 2 matrix. 


In the expanded game the player’s outcomes 
(U) may be specified by the following two 
equations: (4) For playing left: 


Uy, = @Py+ 0 (1 — Py); 
(b) for playing “right”: 
Up =cPy+d(i— Px). 


These equations are simply those which 
would allow the player to choose his best 
strategy in a two-person game if Pr, were de- 
fined as the probability of the other person 
choosing left. 

Figure 2 illustrates the use of these equa- 
tions in expanding the two-person Prisoner’s 
Dilemma matrix shown as Matrix A into the 
three-person game shown as Matrix B. Note 
that the three-person game retains the key 
characteristics of the Prisoner’s Dilemma. 
Joint maximization is still antithetical to pure 
self-maximization, as for any given Py, non- 
cooperation (playing R) always gives the 
larger outcome, Attempting to achieve joint 
maximization (cooperating, playing L) is still 
most risky. No matter what he plays (L or R), 
the player’s outcome is an increasing function 
of the number of other cooperators (L play- 
ers). 

More important, however, is the fact that 
the three- (or V-) person game, with economic 
rationality the only consideration, should call 
for an identical strategy on the part of the 
player as he would have employed in the two- 
person game. This may be more clear if we 
consider the following relationship: By using 
the specified equations, we have created a 
three- (V-) person game in which each play 
is equivalent to a set of two (N —1) two- 
person games on the two-person matrix from 
which it was derived, but with the following 
exceptions: 

1. The player may make only one move (L 
or R) which holds for the entire set of two 
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A. Person 2 
B. Person 2, Person 3 
LL LR RL RR 
L 4 4 26 6 2 4 4 
4 2 2 0 
Person 1 
RA 04 4 0 Rane 
6 4 4 2 


Fic. 2. Illustrative expansion of 2X2 Prisoner's 
Dilemma to three persons, (Section A: To-be-ex- 
panded 2X2 matrix. Person 1’s outcome in lower 
left-hand corner of each cell; Person 2’s outcome in 
upper right-hand corner, Section B; Three-person 
expanded matrix. Person 2’s action shown as first 
letter in column heading; Person 3’s action shown 
as second letter. Person 1’s outcomes shown in 
center of bottom line of each cell; Person 2’s out- 
comes in upper left-hand corner; Person 3’s outcomes 
in upper right-hand corner.) 


(NV — 1) games, while his partner may pre- 
select any combination of L and R moves he 
desires. 

2. There is no feedback to either player as 
to the other person’s moves or of either play- 
ers outcomes until both (all N — 1) games 
are completed. All of this information is 
given at the end of the set. 

3. The outcome for each of the two (N — 
1) games equals the outcome from the 2 Xx 2 
matrix divided by two (N — 1). Alterna- 
tively, we might change the expansion equa- 
tions so that the right-hand terms are multi- 
plied by two (NV — 1), in which case this ex- 
ception would not hold. 

It should be clear that if a very large num- 
ber of these three-person games (or two- 
game “sets” of two-person games) were 
played, the strategy of the player, defined by 
the proportion of times he plays L or R, 
should closely approximate his strategy for a 
large number of two-person games on the 
same matrix. If the logical player’s strategy 
in our three- and two-person games should 
be logically identical, any difference in the 
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behavior of persons playing these two games 
may be attributed to the social-psychological 
effects of the number of persons playing. 


METHOD 
Subjects 


Subjects were 60 male University of Wisconsin l 
freshmen and sophomores who volunteered to work x 
as paid participants. Potential applicants were told 
that the task was noneducational and that the work 
would be of interest primarily to those who wanted j 
to earn money. Twenty-four of the subjects were 5 
randomly assigned into 12 two-person groups, while 
the remaining 36 subjects were composed into 12 } 
three-person groups. 


Game and the Setting 


The Prisoner’s Dilemma games played by the z 
subjects in the two treatments were defined by the 
matrices shown in Figure 2. Outcomes in the 
matrices represented real money, with each point 
being worth 2g. 

Subjects worked in separate rooms connected to 
the experimenter’s room by an intercom system and 
containing the following task equipment. 

1. An upright metal box with a signal light, two 
toggle switches, and either two or four outcome indi- 
cator lights were mounted on a small work table (see 
Figure 3). The switches were labeled “left” and 
“right” and constituted the subjects means of par- 
ticipation in the game. The signal light went on 
whenever a new trial began and went off when the 
subjects were to return their switches to a “ready” 
position. The outcome indicator lights placed under 
the switches were lit after all players had made 
their choices and displayed the other players’ choices. 
In the two-person game there were two lights, in 
the three-person game there were four. 

2, A game matrix was posted on the wall in front. 
of the subject. Each matrix was arranged with two 
columns, one for each of the subject’s choices. Each 
subject was always referred to by a color, and the 
outcome matrices were color coded. 

3. A television receiver that showed either two or 
three labeled counters displaying the accumulated 
earnings of each player was located next to the 
panel. 


£ 


A 


A 


Procedure } 


Subjects reported to the same waiting room but 
were escorted separately to their own rooms. Tape- 
recorded instructions were then played over the 
intercom. At specified points during the instructions, 
the tape was stopped, and the subjects were asked if 
they had any questions. 

The instructional procedures were designed to 
provide complete understanding of the game and a 
neutral set. The objective of understanding was 
particularly important in the three-person game. The 
difficulty of understanding the matrix under these 


X 


COOPERATION IN A PRISONER'S DILEMMA 


Game Matrix 


2 (Ee) | Ol) 
2 


Subject’s Panel 


You are perso 


O 


person yello 


person gree 


Fic. 3. Diagram of subject’s panel and wall-mounted game matrix for three- 
person Prisoner’s Dilemma game, (Each subject’s outcomes on the matrix were 


color coded.) 


conditions seemed to be much greater than in the 
two-person variant. Thus, at the end of the instruc- 
tions for both games, test trials were run in which 
subjects were given a specified cell in the matrix 
(each of which was labeled with a small black letter, 
A-H), and asked to behave so as to produce that 
cell as the outcome, In this manner, every one of 
the cells was displayed. Subjects had to get at least 
four consecutive trials correct before the experiment 
proceeded. A second test was then run in which the 
subjects were told the amount of money they should 
try to earn, For example, the subjects were told to 
flick the switch they would use if they wanted to 
gain 6 points. The only way this could be achieved 
was by flicking the R switch so that any L choice 
was judged a misunderstanding. Using 6 and 0 points, 
several trials were run, and four consecutive trials 
had to be correct by all subjects before the experi- 
ment proceeded, 

The only “motivational” part of the instructions 
was the following statement: 


The way you make money in this game is by 
making decisions. The basic decision which you 
have to make is whether to push down the left 
switch or the right switch on the box in front 
of you. What you make will depend on the 
switch you select and the switch(es) selected by 
(both of) the other player(s). 


When the light signaled a trial, each player made 
his choice. After all players had flicked one switch, 
the experimenter switched on one of the outcome 
indicator lights on each subject’s panel and an- 
nounced the choice of each player. He then an- 
nounced the earnings of each player on that trial, 
simultaneously adding the appropriate amount to 
each counter. He then shut off the signal light until 
all subjects had returned their switches to the ready 


position, at which point a new trial began. A total 
of 150 trials were run for each group. 


RESULTS AND Discussion 


The basic data clearly support the hypothe- 
sis that rates of cooperation are inversely re- 
lated to the number of people involved in the 
interaction. Figure 4 shows the average 
amount of cooperation for individuals within 
dyads and triads by 10-trial blocks. Figure 5 
shows the average amount of cooperation for 
dyads and triads as a whole—both players 
choosing left in dyads, all three playing left 
in triads—by 10-trial blocks. Figure 5 also 
contains a curve representing the cooperative 
behavior of simulated dyads from within the 
triads. These are the dyads that result if we 
randomly discard one subject from each triad 
and then analyze the behavior of the remain- 
ing pair in exactly the same way as we 
analyze the behavior of the actual dyads. The 
numbers in the figure are derived by averaging 
the behavior of all three possible dyads within 
each triad. The particular utility of these 
figures is that they represent how players in 
a two-person game play if they face the addi- 
tional contingency of each playing a third 
person simultaneously. Thus, differences be- 
tween these data and the behavior of actual 
dyads may be interpreted as the effects of 
the presence of this third party. 


eas 
90 
| Subjects 
in Dyods 
80 Bs 
a 
ee 
E ain 
g i 
$ 507 / 
£ / 
5 seq rod 
$ 4 Subjects 
i in Triads 
= 404 i Lo 
g 
5 
S 
5 
& 


30 

20 

10- 

oO Titan piace hat oiled 
123456789 ON RB 


Blocks of 10 Trials 


Fic. 4. Percentage of cooperative choices by subjects 
in dyads and triads. 


As both Figures 4 and 5 clearly indicate, 
two- and three-person groups began with only 
slightly different levels of cooperation. In the 
last 50 trials, however, differences between 
individuals in the dyads and in triads were 
substantial and significant (p < .005, one- 
tailed ¢ test). The difference between actual 
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Fic. 8. Percentage of cooperative choices by dyads, 
triads, and simulated dyads within the triads. 
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Number of Cooperative Groups 


123456789 0N 2131415 
Blocks of |OTrials 
Fic. 6. Number of dyads and triads making co- 


operative choices on 80% of trials in 10-trial 
blocks. 


and simulated dyads in level of group coop- 
eration was also substantial and significant 
(p < .025, one-tailed ¢ test) in the last 50 
trials. 

As Figure 6 shows, the difference between 
dyads and triads was the result of the increas- 
ing number of dyads, which became basically 
cooperative over time. Seven of the 12 dyads 
were completely cooperative for at least the 
last 30 trials, with 6 of them completely co- 
operative for at least the last 60 trials. One 
additional group was predominantly coopera- 
tive during the last 30 trials, while the re- 
maining four groups were predominantly con- 
flictive.* In sharp contrast, only two of the 
triads showed any pattern of established co- 
operation. In one of these groups the pattern 
emerged only in the last 20 trials. In the 
other group it began with some 70 trials left. 
The remaining triads were overwhelmingly 


3 The rate of cooperation in dyads is higher than 
the 30%-50% usually found in Prisoner’s Dilemma 
research. From Boyle and Bonacich (1970) we find 
that the matrix values used have a low “index of 
cooperation,” which in their data would predict 
more than 60% cooperation. We also had training 
on the task more extensive than most studies, which 
we feel is conducive to higher levels of cooperation. 
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conflictive. Hence, almost the entire rise in 
cooperative rate shown in the last 20 trials in 
Figures 4 and 5 was accounted for by the 
change in behavior of one triad. If all groups 
that ever show a 10-trial block of complete 
cooperation are classified as “cooperative,” 
there is a significant difference between two- 
and three-person groups in their probability 
of being cooperative (p < .05, chi-square). 
While the data support the hypothesis that 
in the Prisoner’s Dilemma, cooperation in 
triads is markedly less than that in dyads, 
several different explanations of the results 
are possible. Perhaps the most parsimonious 
explanation is’ one based on individual pro- 
pensities to cooperate. This would assume that 
a dyad or triad becomes cooperative when- 
ever it is exclusively composed of individuals 
who are “cooperatively oriented.” If one (or 
more) of the players is “competitively ori- 
ented,” he continues to exploit the other 
player or players until they must respond with 
self-protective noncooperation as well. Triads 
therefore may be less cooperative than dyads 
because they have a 50% greater chance of 
containing a competitively oriented player. 
An analysis of the data does not support 
this explanation. If cooperatively oriented 
subjects were randomly distributed among 
the dyads, 76% would have to be coopera- 
tively oriented for 58% of the dyads to even- 
tually become completely cooperative—the 
proportion found in this experiment. If the 
same proportion of individuals playing the 
three-person game were cooperatively oriented, 
and if they were randomly distributed among 
groups, we could expect 44% of the triads to 
eventually become completely cooperative. 
Instead, 17% of the triads were cooperative, 


TABLE 1 
TRANSITION MATRIX FOR Dyaps: 
TRIAL n +1 
Trial n LL LR RL RR n 
LL 87.5 DA 5.0 1.9 818 
LR 19.0 18.5 28.1 34.4 221 
RL 12.7 25.5 22.3 39.4 251 
RR 6.2 15.3 18.7 59.8 498 


Note.—L = left, R = right. Data are in percentages, 


a significant difference from the expected value 
(p < .05, chi-square). 

In sum, these data lead us to reject an ex- 
planation of the difference in cooperativeness 
between two- and three-person groups based 
solely on the probability of finding three co- 
operative individuals, Instead, we must con- 
clude that some characteristic of three-person 
interaction inhibits the emergence of coopera- 
tion. 

Let us consider, then, the original “reac- 
tion to defection” explanation which pre- 
dicted that an individual player would react 
to defection by one of the other players in a 
triad the same way as he would react to a 
simultaneous defection by both other players, 
thus making the cycle of defection and non- 
cooperation more likely to occur in the triad. 
This assertion, and other inferences of this 
explanation, may be tested by reference to the 
transition matrices for the dyads and triads 
(see Tables 1 and 2). In terms of the transi- 
tion matrix for triads, the explanation pre- 
dicts that subjects who are cooperative at 
Time 1 would be equally likely to switch to 
noncooperation at Time 2 if either one or both 
of their partners had defected at Time 1. The 
data do not fully support this assertion. Fol- 


TABLE 2 


TRANSITION MATRIX FOR TRIADS: TRIAL n + 1 


Trial n LLL LLR LRL RLL LRR RLR RRL RRR n 
88.1 2.3 3.9 3.1 a7 7 Ki 0 127 
ee 53 35.3 1.6 2.6 15.9 23.8 3.5 11.5 113 
IRL 78 26 18.4 10.5 15.7 78 18.4 18.4 38 
RIL 40 9.4 9.4 24.3 27 12.1 21.6 16.2 74 
LRR ‘9 11.2 23 3.2 33.8 13.6 7.5 27.2 213 
RIR 3 33 11 6.3 8.6 32.0 9.4 33.5 253 
RRL 0 1.9 1.2 17 5.1 12.9 29.0 41.9 155 
RRR 2 1.7 7 14 10.1 10.0 5.0 70.5 815 


Note.—L = left, R = right. Data are in percentages. 
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lowing a defection by both partners, the co- 
operative subject switched to noncooperation 
on the next trial 55% of the time. (To calcu- 
late this figure from Table 2, several cells must 
be combined and averaged.) Following a de- 
fection by one partner, the subject switched to 
noncooperation 42% of the time. These data 
do not wholly refute the reaction-to-defection 
hypothesis, however, as an economic model 
might predict twice as much switching to non- 
cooperation when two partners defect, and the 
difference between the two figures is much 
less than that. 

The reaction-to-defection explanation is 
further undermined, however, by the data for 
the dyads. This transition matrix shows that 
cooperative subjects were even more likely to 
respond with noncooperation to defection by 
their partner than was a player in a triad 
faced with two defecting partners. The differ- 
ence is only 9%, with subjects in dyads re- 
sponding noncooperatively 64% of the time. 
The direction of this difference, however, 
makes it difficult to support an explanation 
that specifies that defection has a greater sup- 
pressive effect on cooperation in triads. 

Since the data do not support either our 
original explanation or the individual-based 
alternative, we are compelled to consider a 
third possibility. The one which seems most 
compatible with the data centers on the in- 
hibiting effect of any defection for the devel- 
opment of cooperation. The impetus for con- 
sidering such a proposition is apparent from 
the graphs in Figures 4 and 5, both of which 
clearly suggest that the difference between 
dyads and triads lies in the increasing coop- 
eration in dyads as compared with triads. 

The transition matrices reveal that in dyads 
a noncooperative subject whose partner coop- 
erated himself switched to cooperation on the 
next trial 42% of the time. In triads, by con- 
trast, a noncooperative player faced with one 
cooperative partner switched to cooperation 
only 19% of the time. In fact, if faced with 
two cooperative partners, the player himself 
cooperated on the next trial only 20% of the 
time. In short, cooperative gestures toward a 
noncooperative partner in the three-person 
game had much less chance of eliciting coop- 
eration in return. 


Beyond this there was a greater chance in 
the dyad that an individual player would try 
to initiate cooperation after a jointly non- 
cooperative trial. One or both players switched 
to cooperation 40% of the time after joint * 
defection in the dyad, while randomly se- 
lected pairs of players from the triads did so 
only 24% of the time. Again, there was a 
great advantage toward establishing coopera- 
tion in the dyads, and the two advantages 
reinforced one another. In a dyad it was both 
more likely that the player would be faced 
with a situation in which his partner indicated 
a desire to cooperate, and more likely that he 
would reciprocate with a cooperative act of 
his own. In the triad, on the other hand, a 
noncooperative subject who had one coopera- 
tive partner was still faced with at least one 
noncooperative partner on the next trial 94% 
of the time. Even when both partners were 
cooperative on Trial 1, one or both” defected 
on the next trial 66% of the time. It is per- 
haps at this point that the effect of the num- 
ber of persons in inhibiting cooperation comes 
most strongly into play. 

Taken together, these data from the transi- 
tion matrices suggest that the lower rate of 
cooperation within the triad is primarily the 
result of the difficulty of establishing mu- 
tually reinforcing relationships rather than 
the destruction of these relationships through 
defection. Difficulty in establishing trust re- 
sides in the necessity for coordinating coop- 
erative responses from more players in a set- 
ting where any defections are costly. In this 
sense, the comparison between two- and three- 
person groups might not be a good indicator 
of the effects of adding more players. In a 
four-person game, for example, the defection 


of only one player would no# be costly enough =. i 


to make cooperation nonprofitable for the re- | 
maining three. Cooperative subgroups might 
therefore emerge with concomitantly high 
rates of individual cooperation. 
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EFFECT OF FEELING GOOD ON HELPING: 
COOKIES AND KINDNESS? 


ALICE M. ISEN ? 
Franklin and Marshall College 


AND PAULA F. LEVIN? 


Swarthmore College 


Two experiments with adult subjects investigated the effects of a person’s 
positive affective state on his or her subsequent helpfulness to others, “Feeling 
good” was induced by having received cookies while studying in a library 
(Study I) and by having found a dime in the coin return of a public tele- 
phone while making a call (Study II). In Study I, where the dependent 
measure involved volunteering in reply to a student’s request, a distinction 
was made between specific willingness to help and general willingness to engage 
in any subsequent activity. In Study II, the dependent measure was whether 
subjects spontaneously helped to pick up papers that were dropped in front 
of them. On the basis of previous research, it was predicted that subjects who 
were thus made to “feel good” would be more helpful than control subjects. 


Results support the predictions. 


Recent investigations of determinants of 
helping have begun to focus on the role of 
mood state in producing differences in help- 
fulness. The first studies that indicated the 
relevance of a potential helper’s internal af- 
fective state used reports of success and fail- 
ure at a task as their independent variable. 
A study by Berkowitz and Connor (1966) 
indicated a relationship between success and 
helping, when the beneficiary was dependent 
on the subject. A later study (Isen, 1970) 
also indicated a link between success and 
helping, where there was no relationship be- 
tween the people involved and where helping 
was a low-cost, naturalistic, behavioral mea- 
sure. It was postulated that in just such a 
situation (i.e., nonsolicited, low-cost helping), 
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an important determinant of helpfulness may 
be the potential helper’s positive affective 
state or “warm glow of success” (Isen, 1970). 
In addition, even thought their success/failure 
manipulation was not aimed specifically at 
affecting internal mood state, Berkowitz and 
Connor (1966) also made reference to a “glow 
of goodwill” in their discussion. In both of 
these studies which suggested that “feeling 
good” may be a determinant of helping, posi- 
tive affective state was induced via a report 
of success. However, report of success may 
not be an entirely satisfactory way of ma- 
nipulating mood, since induced affective 
state may be confounded with estimates of 
competence. 

Several recent studies have indicated that 
manipulation of affective state in ways other 
than via success/failure also results in dif- 
ferential helping, thus lending credibility to 
the hypothesis that a relationship between 
feeling good and helpfulness does exist. Two 
naturalistic experiments seem to indicate that 
good feeling aroused through positive verbal 
contact results in increased aid, both solicited 
(Berkowitz & Macaulay, as cited in Ader- 
man, 1971) and nonsolicited (Isen, Becker, & 
Fairchild) .* 

Studies by Aderman (1971) and Aderman 
and Berkowitz (1970), while conducted in an 


+A. M. Isen, D. M. Becker, and P. Fairchild. 
Effects of nature of verbal contact on helping. 
Manuscript in preparation. 
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experimental setting, manipulated mood state 
in several novel ways. In the Aderman and 
Berkowitz study, the subject’s mood state was 
varied by having him observe one of several 
interactions between two male college stu- 
dents, one who needed aid and one who was 
a potential helper. The experimental condition 
varied according to the person with whom 
the subject was instructed to empathize, the 
helping response of the second person (helped 
or did not help), and the reaction of the 
helped person (thanked the helper or did not). 
The subject then filled out a mood question- 
naire and finally was given an opportunity to 
comply with the experimenter’s request for 
help. The results of the experiment, though 
complex, tended to support the idea that feel- 
ing good can be related to increased helping 
under some circumstances (empathy with the 
thanked helper), while feeling bad can be 
associated with increased helping under other 
circumstances (empathy with the nonhelped 
person in need). 

In the study by Aderman (1971), elation 
or depression was induced in subjects by 
having them read sets of mood statements. 
Aderman found that following the reading of 
the cards, subjects in the elation condition 
wrote more numbers for the experimenter, 
when this task was presented as a favor 
rather than a requirement of the experiment. 
In addition, elation subjects volunteered more 
often for a future experiment. Such findings 
do lend credence to the “glow” hypotheses, 
yet one complexity of the findings is that the 
“help” was solicited. 

A further question remaining is whether 
success, or good mood, leads specifically to 
helping or, more generally, to increased activ- 
ity and/or productivity. In other words, does 
the good feeling lead to an increased desire 
to do something nice for someone else, or 
would subjects who have been made to feel 
good, as opposed to those who have not, 
engage in more or any subsequent activity? 

Using a 2 X 2 design, we performed an 
experiment that attempted to answer the two 
questions posed above: First, whether feeling 
good leads to increased helping; second, 
whether, following the induction of good feel- 
ing, the response to an opportunity to help 
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differs from the response to an opportunity to 
engage in some other activity. We predicted 
an interaction between the two independent 
variables such that those subjects who were 
feeling good would subsequently be more will- 
ing to help but less willing to hinder (dis- 
tract) than those not made to feel good. 


Stupy I 
Method 


Subjects. The study, which spanned five sessions, 
was conducted in the libraries of a university and 
two colleges in the Philadelphia area, Fifty-two male 
college students who were studying in individual 
carrels served as subjects. 

Procedure. At the beginning of the session, a co- 
ordinator randomly assigned rows of: carrels to the 
feeling good or to the neutral condition. The assign- 
ment to condition was based on rows, rather than 
on individuals, in order to insure that subjects would 
be unaware that two conditions existed. 

To induce good mood, confederates distributed 
cookies along the rows that had been assigned to 
the feeling good condition, while they merely walked 
by the rows in the neutral condition. This task was 
performed by a male and female pair of confederates 
in two of the sessions and by a female confederate 
in the remaining three sessions. 

The coordinator also randomly divided subjects in 
each condition into “help” or “hinder” groups. The 
experimenter was told this assignment of “help” and 
“distract” subjects, but was was kept unaware of 
whether a particular row was “cookie” or “no 
cookie,” Similarly, the coordinator was careful to 
withhold from the confederates information as to 
help or distract condition of the subjects. 

A few minutes after the confederates returned 
following the distribution or nondistribution of 
cookies, the experimenter approached each subject 
individually and asked if, and for how many 20- 
minute sessions, the subject would serve as a con- 
federate in a psychology experiment. In the help 
condition, the purpose of the experiment was given 
as an investigation of creativity in students at 
examination times, as opposed to other times during 
the year. The confederate was needed in this experi- 
ment to act as helper to subjects who would be 
attempting to conceive of novel uses for ordinary 
items. The confederate’s aid, which involved holding 
and manipulating the items, was described as “some- 
thing which the subjects usually found very helpful 
to them.” In the distract condition, the job of the 
confederate was described as a distracter of a ran- 
domly chosen, unwitting student who happened to 
be studying in the library. As distracter, he would 
stand near the subject and drop books, make noises, 
rattle papers, all while the experimenter unobtru- 
sively recorded the subject’s reactions. The purpose 
of such an experiment was given as an investigation 
of distractibility of students at examination time as 
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opposed to other times during the year. In addition, 
the experimenter cautioned each subject in the dis- 
tract condition by saying, “I think it only fair to tell 
you before you decide to act as distracter, that the 
subjects find the distraction to be an unpleasant an- 
noyance,” Thus, in the help condition the role that 
the subject was invited to play was clearly that of 
a helper, one appreciated greatly by the creativity 
subjects; in contrast, the role that a confederate 
would play in the distract condition was clearly 
described as that of an annoying distracter of 
unsuspecting students studying in the library. 

A debriefing and discussion period followed each 
subject’s reply. Subjects’ reports indicated that the 
independent and dependent manipulations were plau- 
sible, and that they had not been associated in the 
subjects’ minds prior to the debriefing. 


Results 


Since the five sessions yielded comparable 
results, the data were combined. Table 1 
shows the proportion of subjects volunteering 
in each condition and the means and vari- 
ances of number of minutes volunteered. A £ 
test for proportions was performed on the 
number of subjects volunteering in each con- 
dition. This test revealed the predicted inter- 
action between receiving a cookie or not and 
volunteering to help or to distract. That is, 
subjects receiving cookies volunteered to help 
more, but to distract less, than those not 
receiving cookies (t = 1.96, p < .05). 

For the data on number of minutes volun- 
teered in each condition, an F test of the 
variances indicated that the data did not meet 
the assumptions of the analysis of variance 
(F = 20.69, df = 11/13, p < .01). A two-way 
analysis of variance was performed, however, 
on the transformed scores, which did meet the 
assumptions of the test. This analysis indi- 


TABLE 1 


Stupy I: MEANS AND VARIANCES OF AMOUNT OF 
Time (IN Minutes) VoLUNTEERED AND PRO- 
PORTION OF SUBJECTS VOLUNTEERING 
IN EACH CONDITION 


Condition Help Distract 
Cookie 
M 69.00 20.00 
is 6923.08 400.00 
{69 (9/13) 31 (4/13) 
kie 
Spee 16.70 78.60 
$ 563.64 11659.34 
P .50 (6/12) 64 (9/14) 
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cated no main effect, but did show the pre- 
dicted significant interaction (Fa < 1, Fs < 1, 
Fas = 7.71, p< .01), with cookie subjects 
volunteering more time to help, but less to 
distract, than no-cookie subjects. 


Discussion 


The results of this experiment indicate 
that in terms of both number of subjects 
volunteering and amount of time volunteered, 
subjects who have unexpectedly received 
cookies help more, but distract less, than do 
those who have not received cookies. Thus, 
feeling good, induced naturalistically and in a 
way other than via report of success, seems 
to lead to increased helping, and to helping 
specifically, rather than to general activity. 

Although this finding provides evidence for 
the “warm glow” hypothesis—people who feel 
good themselves are more likely to help 
others—an alternative interpretation is pos- 
sible. Following from a modeling or a norma- 
tive explanation, cookie subjects might have 
been more helpful simply because they had 
just been exposed to a helpful model (the 
person passing out the cookies) who may have 
reminded them of norms of kindness to others. 
Furthermore, a few aspects of the dependent 
measure complicate the warm glow interpreta- 
tion. Although the independent manipulation 
was more naturalistic than that found in many 
experiments, the dependent measure was one 


of solicited helpfulness. In addition, help 


was only volunteered, rather than actually 
performed. 

Thus, a second study was conducted to 
determine whether nonsolicited, low-cost help- 
fulness increases following the induction of 
good feeling, without the good mood being 
directly brought about by another person. 


The question was, Does feeling good lead ' 


to increased helping, even if there is no 
helpful model? In the “dime” study, which 
is directed at this question and which is pre- 
sented below, good feeling was induced in a 
subject by the discovery of an unexpected 
dime in the coin return slot of a pay tele- 
phone. The dependent measure was that of 
helping a young woman pick up Papers which 
she had just dropped. 


p 
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Srupy IT 
Method 


Subjects. Subjects were 24 female and 17 male 
adults who made calls from designated public tele- 
phones located in enclosed shopping malls in sub- 
urban San Francisco and Philadelphia. Excluded from 
the subject pool were those shoppers who were not 
alone and those who were carrying packages. 

Procedure, Telephone booths were “set up” in the 
following manner, The experimenter made an in- 
complete call, ostensibly took her dime from the 
return slot, and left the booth, In actuality, the 
dime was left in the coin return slot for a randomly 
selected half of these trials. Thus, subjects using 
such telephones received an unexpected 10¢ when they 
checked the coin return before, during, or upon 
completion of their calls; these subjects constituted 
the experimental group. The control group was made 
up of individuals who used a telephone that had 
not been “stocked” with a dime and who therefore 
did not receive unexpected money. 

The experimenter set up the experimental and 
control telephones without informing the confederate 
as to condition. This was done in order to eliminate 
any possible systematic bias in the confederate’s per- 
formance of the paper dropping. The experimenter 
also checked to make sure that all subjects did look 
in the coin return slot. Only a few subjects failed 
to meet this requirement, and these were not in- 
cluded in the data analysis. This was done in order 
to avoid ultimately obtaining a sample of subjects 
which was inadvertently selected for attention. For 
this reason, no subject who was at an “experimental” 
condition telephone and simply failed to see his dime 
was included in the control group. 

During the call, the confederate was able to ob- 
serve the outline of the subject unobtrusively by 
pretending to “window shop,” while actually watch- 
ing the subject’s reflection in one of the store win- 
dows. The aim of this surveillance was simply to 
know when the subject was leaving the telephone. 
When the subject did leave, the confederate started 
in the same direction as the subject and, while 
walking slightly ahead and to the side of him or 
her, dropped a manila folder full of papers in the 
subject’s path. The dependent measure was whether 
the subject helped the female confederate pick up 


the papers, 


TABLE 2 


Stupy II: NuMBER OF PEOPLE HELPING 
IN Eac CONDITION 


Females Males 
Condition k 
i Did not 
Helped Dia mot Helped help 
Dime 8 0 6 3 
No dime 0 16 1 
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Results 


Table 2 shows the number of males and 
females helping in each condition. A Fisher 
exact test on the data of the females indicated 
a significant relationship between getting a 
dime and helping (p < 005). A similar find- 
ing (p = .025) was obtained for the males. 


Discussion 


These results indicate that differential un- 
solicited helping occurs even when good mood 
is induced in an impersonal manner, The 
finding appears to be less pronounced for 
males than for females, but the smaller num- 
ber of male subjects may be responsible in 
part for this apparent difference. Because our 
society has specific norms applying to this 
particular helping situation for males, one 
might have expected the behavior of the 
males, more than that of females, to reflect 
not only the independent manipulation but 
also these norms for courtesy. The data show 
that while no females in the control condition 
helped, one male in the same condition did 
help. However, it must also be noted that 
two males in the experimental condition failed 
to help, while no female experimental sub- 
jects failed to help. Thus, while it is true 
that the behavior of the males may be more 
complex than that of the females, the simple 
courtesy expectation is not supported. 

The results of the two studies taken to- 
gether provide support for the notion that 
feeling good leads to helping. Because feeling 
good has been generated in a variety of ways 
and settings, and since the type of helping 
measure and the source of the subject popula- 
tions have also varied, this relationship seems 
to have some empirical generality. We recog- 
nize, however, that the question of why feel- 
ing good leads to helping, or more properly, 
what mediates the relationship between the 
two, remains to be answered. Moreover, such 
an answer may provide some insights into the 
more general and important issue of how the 
observed determinants of helping, such as 
success, feeling good, feeling bad in some 
circumstances, guilt, verbal contact, and the 
presence or absence of other people, relate to 
one another. That is, while these states or 
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events may seem unrelated as determinants of 
helping, they may have some common aspects 
in that capacity. If so, then the determina- 
tion of helping may be more parsimoniously 
understood in terms of broader concepts, such 
as maintenance of positive affective state, 
perception of costs and rewards, or both, and 
this possibility is now under investigation. 
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TEMPORAL SITUS OF THE EFF ECTS OF ANTICIPATED 
PUBLICITY UPON COMMITMENT AND RESISTANCE 
TO COUNTERCOMMUNICATION 


MICHAEL F. HOYT anp RICHARD CENTERS 1 


University of California, Los Angeles 


In opposition to the rehearsal after commitment hypothesis previously offered 
by Hovland, Campbell, and Brock, it was hypothesized that the greater 
resistance to countercommunication reported by those authors to exist amongst 
publicly committed subjects, as distinguished from those whose commitment 
was made anonymously, might merely be an artifact resulting from greater 
moderateness of opinion initially displayed by publicly committed subjects. 
An experimental test of an opposing anticipatory moderation before commit- 
ment hypothesis, involving 132 subjects divided into four groups who received 
varying public and anonymous commitment treatments, found the anticipatory 
moderation before commitment to be demonstrably superior to the rehearsal 


after commitment interpretation. 


An examination of the social-psychological 
literature reveals an enduring and variegated 
concern with the effects of public versus pri- 
vate commitment upon subsequent attitude 
change.? Lund (1925), in discussing his law 
of primacy in persuasion, suggested the possi- 
bility that the “desire to be consistent” may 
have been an explanation for his finding that 
after reading an argument and having once 
committed themselves via a questionnaire 
response, subjects were relatively little influ- 
enced by subsequent counterargument. Lund 
speculated that “Once we have committed 
ourselves [after hearing only one side of an 
issue] we frequently dare not change our po- 
sition lest we be challenged with our former 
statements [p. 190].” In a related study, 


1 The authors wish to acknowledge their gratitude 
to Helen R. Siporin for her valuable suggestions. 
Additional thanks are due to Richard Lance 
Christie, David James, and Sandra Lanto, who 
served as essay raters for the experiment to be 
reported here. 

Requests for reprints should be sent to Michael 
F. Hoyt, Department of Psychology, Vale Univer- 
sity, 333 Cedar Street, New Haven, Connecticut 
06510. 

2 Sherif and Hovland (1961) distinguish two types 
of involvement or commitment, the first being a task 
involvement induced in the subject by the experi- 
mental instructions and the consequences of his 
performance, and the second being a more intense 
ego involvement “aroused by the stimulus item 
related to his stand on an issue . . . [p. 197].” It is 
commitment in both these senses, but particularly 
in the latter, that is the concern of this paper. 


Hovland and Mandell (1957) failed to find 
this resistance to attitude change under condi- 
tions of an anonymous (private) commitment. 
To test their hypothesis that with a stronger, 
more public form of commitment an indi- 
vidual required to state his position publicly 
after being exposed to a communication 
would tend to retain this position in the face 
of a subsequent countercommunication, Hoy- 
land, Campbell, and Brock (1957) presented 


successively two communications on opposite sides 
of the same issue, separated in time by the experi- 
mental conditions of commitment or noncommit- 
ment, The topic of the communication was “The 
reduction of the legal voting age to 18 years—the 
current draft age.” One communication was strongly 
in favor of the change and the other strongly in 
favor of retaining the present voting age of twenty- 

one years. 3 

The communications were presented to four groups, 
two experimental and two control, in a balanced 
order. .. . 

Opinions on the issue discussed were measured one 
week before the experimental session, and twice 
during the experiment. One of the latter measures 
was taken after the first communication and one 
after the second [pp. 24-25]. 


The opinion questionnaire given 1 week 
before the experimental session was used to 
insure that each experimental and control 
group would be assigned approximately the 
same number of subjects who were initially 
for, and initially against, the proposed change. 
The first opinion measure was the “commit- 
ment,” or control, activity. In the public com- 


mitment groups, the subjects were asked to 
write a short paragraph presenting their own 
frank opinions on the issue and were told that 
their opinions would be printed in full in the 
next issue of their school newspaper, and they 
were further cautioned to sign their names 
exactly as they should appear under their 
statements. (In the earlier of the two essen- 
tially identical studies, a somewhat weaker 
public commitment manipulation was em- 
ployed: the subjects were told that their state- 
ments would appear in a “Public Affairs 
Pamphlet.”) The control groups were simi- 
larly asked to prepare a paragraph presenting 
their opinions, and were told that their papers 
would be kept anonymous and were explicitly 
instructed mot to sign their names. 

The paragraph opinion measures were 
quantified by having three judges assign each 
paragraph a rating based on a 5-point Likert- 
type scale, the same 5-point scale being used 
in 
the final opinion measure . ; ., a second questionnaire 
administered immediately after the second communi- 

_ cation. This questionnaire included the . . . item on 
voting age used in the first questionnaire as well as 


a number of unrelated questions [Hovland et al., 
1957, p. 27]. 


Amount of opinion change was thus defined 
as the difference between the paragraph and 
questionnaire scores. 

The obtained data indicated that subjects 
who made a public declaration of their posi- 
tion were less likely to change their opinion 
in the face of counterpropaganda than were 
those subjects who merely stated their posi- 
tions anonymously. 

Hovland et al. (1957) interpreted these 
results as meaning that the effect of public 
commitment is to fix the opinions of the indi- 
vidual so that he is relatively resistant to 
change. They argued that an individual, an- 
ticipating socially mediated rewards, ‘‘prob- 
ably rehearses his position in preparation for 
the imminent publicity in such a way that he 
becomes resistant to countercommunication 
[pp. 31-32].” In short, they contended that 
since “commitment makes changing the be- 
lief dangerous, costly, awkward, or at least 
harmful to self-esteem [McGuire, 1964, p. 
194],” having made a statement, an indi- 
vidual desires to appear consistent in the 
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public eye, rehearses his self-proclaimed posi- ~ 


tion, and thus resists change. 


An Alternate Inter pretation 


Although they correctly emphasize the im- 
portance of subjects considering social conse- 
quences, Hovland et al. seem to have, in the 
present writers’ view, temporally misplaced 
the occurrence of this consideration. Rather 
than making a statement and then preparing 
to’ defend it against the onslaughts of social 
criticism, it seems more likely that a subject 
would weigh the possible consequences of his 
statement before committing himself publicly 
and would thus tend to make more moderate 
and less vulnerable statements in the first 
place. This is somewhat similar to the hy- 
pothesis advanced by Deutsch and Gerard 
(1955) in their study of the phenomena of 
conformity in judgment of line length, where- 
in they suggested that individuals may have 
some motivation to conform to the expecta- 
tions of others, such as the desire to ingrati- 
ate themselves and/or to avoid ridicule. This 
motivation, they hypothesized, would be re- 
duced when the individual perceived that his 
judgment could not be identified. 

If anticipatory moderation within the Hov- 
land et al. research is, in fact, the case, then 
their finding that subjects in the public com- 
mitment condition were more resistant to 
change might have been merely an artifact 
produced by greater moderation in the public 
condition. That is, publicly committed sub- 
jects may change less simply because they 
initially confine themselves to statements 
which they are very certain that they are able 
to defend and justify, thus leaving little room 
for change. 

Several investigations, such as those of 
Deaux (1968), Deutsch, Krauss, and Rose- 
nau (1962), McGuire (1961), McGuire and 
Millman (1965), McGuire and Papageorgis 
(1962), Papageorgis (1967), and Sears, 
Freedman, and O’Connor (1964), have of- 
fered evidence of anticipatory opinion change. 
Evidence that rehearsal may serve as a cog- 
nitive bolstering mechanism operating to in- 
crease resistance has also been forwarded by 
several writers, including McGuire (1964), 
Gerard (1964), and Freedman and Sears 
(1965). It thus appears that there are some 
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k conditions—including, the present writers sug- 


gest, those of Hovland et al. (1957)—under 
which an anticipatory moderation explanation 
best accounts for the observed resistance, and 
there are other situations in which resistance 
is better explained in terms of rehearsal. 

In addition to our criticisms of the inter- 
pretation given by Hovland et al. for their 
findings, it is the present writers’ opinion 
that a methodological flaw also confounds 
their findings. Equal numbers of subjects 
initially for and initially against the lower- 
ing of the voting age were assigned to each 
treatment group in the Hovland et al. study. 
These subjects were presented pro or con 
communications, stated their own opinions, 
and then received second pro or con com- 
munications in a balanced sequence. Thus, all 
subjects who initially heard pro-issue argu- 
ments later heard anti-issue arguments, and 
vice versa. This procedure was followed re- 
gardless of the nature of the subjects’ own 
stated opinion. The consequences of this 
method are obvious: some subjects initially 
heard a pro argument, stated a similarly pro 
opinion, and then heard an oppositional 
counterargument advocating an abandonment 
or lowering of their initially stated belief; 
other subjects heard a pro argument, stated 
an anti-issue opinion, and then heard an anti- 
issue second communication that, from their 
perspective, was congruent with and advo- 


cated a strengthening of their initially stated 


opinion. A similar directional bifurcation of 
advocacy existed, of course, in the conditions 
in which the involved subjects received the 
communications in the negative—affirmative 
counterbalanced sequence. In short, for some 
subjects, the countercommunications pre- 
sented by Hovland et al. constituted an at- 
tack on those persons’ stated beliefs, while 
for others these second communications func- 
tioned as an affirmation or reinforcement of 
previously stated views. One might expect 
that subjects receiving opposition to their 
initially extreme opinion would change, 
whereas subjects receiving confirmation for 
their extreme opinion would retain their posi- 
tion. 

The purpose of the present study was to 
replicate and extend the Hovland et al. pro- 
cedure in order to determine which explana- 


tion, rehearsal or anticipatory moderation, 
best accounts for the Hovland et al. finding 
that publicly committed subjects changed 
their opinions less than did anonymously 
committed ones, and also to examine the pos- 
sible effects of subjects receiving oppositional 
versus affirmative second communications. 


METHOD 
Subjects 


One hundred and thirty-two undergraduates en- 
rolled in introductory psychology courses at the 
University of California, Los Angeles, were engaged 
as subjects in what was represented to them as a 
“Study in Student Opinion.” They served in order 
to partially fulfill an experimental participation 
requirement. Subjects were randomly assigned to 
four nearly equal-sized groups; the subjects within 
each treatment group in effect formed two sub- 
groups—one comprised of subjects whose initial 
opinion statements resulted in their receiving oppo- 
sitional countercommunications, and the other sub- 
group being comprised of subjects who subse- 
quently received second communications that were 
congruent with the subjects’ initially stated opinions.* 


Design 


The basic paradigm employed in the present study 
was essentially a replication of that followed by 
Hovland et al. Within each treatment condition two 
diametrically opposed communications regarding 
“ROTC on campus” were presented in a balanced 
order; that is, half of the subjects first received the 
pro-ROTC communication and then, after writing a 
paragraph stating their own opinion, received the 
anti-ROTC communication, while, for the remaining 
subjects, the order of presentation was reversed. 

Following this, all subjects were then administered 
a 10-item questionnaire containing items dealing 
with a wide variety of current campus and national 


3 Concluding the experiment, the subjects were 
asked what they felt was the purpose of the study. 
Almost all demonstrated their innocent acceptance 
of the study’s ostensible purpose. Nine subjects, how- 
ever, whose frequencies in each treatment group 
were almost identical, were excluded from the sample 
because they correctly perceived the study as an 
attitude-change experiment. In addition, because 
attitudes toward “ROTC on campus” were the topic 
of the present study, eight subjects were excluded 
who answered affirmatively questions asking if they 
were or had ever been a member of ROTC or any 
similar organization, or if they were or had ever 
been a member of any group opposed to ROTC 
being on campus. These exclusions were considered 
justifiable, in that membership in such organizations 
was considered indicative of an opinion too fixed to 
be reasonably vulnerable to modification. Finally, 
six additional subjects were excluded from two of 
the treatment conditions for reasons noted below. 
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issues. Embedded in the questionnaire was an item 
reading “The ROTC program should [not] be re- 
tained on campus.” Subjects responded to this item, 
as well as to the other 9 similar ones, by indicating 
their position on a 5-point Likert-type scale, the 
possible choices (and their scale values) ranging 
from agree strongly (+2) to disagree strongly 
(—2). Counterbalancing of all questionnaire items, 
including the critical one regarding ROTC, insured 
that one-half of the subjects in each treatment con- 
dition (described below) received each item worded 
positively (e.g. “should immediately withdraw”), 
while the other half received negatively worded 
(e.g “should not be legalized”) items. Addition- 
ally, the mixing of an equal number of positive and 
negative items on each questionnaire guarded against 
such possible biases as a yeasayers-naysayers effect. 

As in the Hovland et al. study, degree of attitude 
change was defined as the difference between the 
paragraph and questionnaire scores of each subject. 
Three independent judges rated each subject’s para- 
graph statement, assigning it a score based on the 
same 5-point scale employed in the questionnaire 
described above, the positivity or negativity of the 
rating being in reference to the pro or con nature 
of the ROTC communication the subject had just 
read. The mean of the three judges’ ratings was used 
as the subject’s paragraph score, Subsequent analy- 
sis revealed the agreement between the judges to be 
adequately high, the obtained interrater reliability 
coefficients ranging from .81 to .73. 

All experimental materials were presented in ex- 
ternally identical “student opinion booklets,” their 
“between the covers” differences constituting the 
four treatment conditions now to be described. 

1. The preinformed public commitment group 
underwent a replication of the Hovland et al. “com- 
mitment” procedure, including the critical announce- 
ment of the public condition before the subject 
stated his own opinion. This treatment should, 
according to the present writers’ hypothesis, result 
in maximal moderation before commitment. After 
reading an initial pro- or anti-ROTC communica- 
tion, preinformed public commitment subjects read 
the following instructions: 


You will now be asked to write, on the next 
page, your own frank opinion regarding the 
“ROTC on Campus” issue. This week, the DAILY 
BRUIN, UCLA’s school newspaper, is preparing a 
major series of articles regarding students’ opinions 
about the ROTC issue. We have agreed to assist 
the BRUIN reporters by providing them with all 
the opinions and respective authors’ names gath- 
ered in this survey. Therefore, we wish to advise 
you that the opinion you are about to give, along 
with your name, will be printed later this week 
in the DAILY BRUIN. 

Please now turn to the next page and write a 
short paragraph stating your own frank opinion. 
After completing your statement, please print your 
name in the indicated space exactly as it should 
appear in the DAILY BRUIN article. Also sign 
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your name below your paragraph, thus granting 
permission for publication of your statement in 
the BRUIN. 


After reading these directions, the subject turned 
the page to be confronted by a sheet emblazoned 
with the newspaper’s masthead, blanks for printing 
and signing his name, and a large space in which 
to write his opinion. Upon completing his public 
statement, the preinformed public commitment sub- 
ject proceeded to the ROTC communication arguing 
in the direction counter to the communication he 
had already read, the opinion questionnaire, and 
the manipulation check. 

Thirty-two subjects comprised this group.4 By 
virtue of their initial opinion statements, 18 pre- 
informed public commitment subjects formed the 
“oppositional” subgroup that subsequently received 
counterarguments attacking their initially stated 
ROTC opinion, and the remaining 14 preinformed 
public commitment subjects formed the “congruent” 
subgroup that received second communications con- 
firming their initial ROTC position. Subjects who 
initially took neutral (“neither agree nor disagree”) 
stances were, in all four treatment conditions, ran- 
domly assigned to either the oppositional or con- 
gruent subgroups within their respective treatment 
group. 

2. In the postinformed public commitment condi- 
tion, the public nature of the subject’s statement 
was not indicated to him until after he had com- 
mitted himself, a condition that should, according 
to Hovland et al’s hypothesis, maximally result in 
rehearsal after commitment. A problem this treat- 
ment presented, of course, was inducing subjects to 
publicly commit themselves to the statements they 
had already made. This difficulty was met by pre- 
senting to the subject, after the initial communica- 
tion, the following: “You will now be asked to 


write, on the next page, your own frank opinion” 


regarding the “ROTC on Campus” issue. Please now 
turn to the next page and write a short paragraph 
stating your own frank opinion.” 

After having finished writing his statement on 
the blank page provided, the subject turned a further 
page to read: 


This week, the DAILY BRUIN, UCLA’s school 
newspaper, is preparing a major series of articles 


regarding students’ opinions about the ROTC issue. y 


We have agreed to assist the BRUIN reporters 
by providing them with all the opinions and 
respective authors’ names gathered in this survey. 
Therefore, unless you have an extremely serious 
objection to our using your statement, as you 
have already presented it, would you please sign 
the standard release form provided below: 

J, _______-_ (print your name here), hereby 
grant permission to the UCLA DAILY BRUIN 
to reprint my “ROTC on Campus” statement 


4Two additional subjects were excluded for not 
including their names with their statements. 
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with my name attached in its series of articles 
regarding student opinions about the ROTC issue. 
(signed) 


After completing this page, postinformed public 
commitment subjects went on to the second com- 
munication, questionnaire, and manipulation check. 

Twenty-nine subjects comprised this group.5 Sub- 
jects’ initial opinion statements resulted in an oppo- 
sitional subgroup consisting of 15 subjects and a 
congruent subgroup comprised of 14 subjects. 

3, The uninformed group (n= 36) received no in- 
formation regarding publicity, one way or another, 
throughout the experiment. Rather, this group 
simply received the same instructions given the 
postinformed public commitment subjects prior to 
their paragraph writing, but without further details. 
The suspicion and subsequent moderation this lack 
of instruction might produce was considered a 
variable worth examining. 

Seventeen uninformed subjects made initial state- 
ments that consequently placed them in the opposi- 
tional subgroup, and 19 subjects comprised the 
corresponding congruent subgroup. 

4, Subjects in the explicitly private condition (x 
= 35) differed from the uninformed group in that 
they were specifically instructed not to indicate 
their names and were explicitly assured of absolute 
anonymity. This group corresponded to Hovland’s 
(1957) control treatment. This condition’s opposi- 
tional subgroup had 20 subjects; 15 subjects com- 
prised its congruent subgroup.® 


Predictions 


It will be recalled that the authors’ general hy- 
pothesis was that subjects’ anticipatory moderation 
before commitment, rather than Hovland’s sug- 
gested rehearsal after commitment, best accounts 
for the reported effects of public commitment on 
subsequent resistance to countercommunication. 
Specifically, it was predicted that (a) the six post- 
informed public commitment, uninformed, and ex- 
plicitly private subgroups should not differ amongst 
themselves on the initial paragraph pretest; (b) pre- 
informed public commitment subjects should express 
a more moderate initial opinion than subjects in 


5 Four additional subjects were excluded for not 
granting permission for their statements and names 
to be used. 

6 Hovland et al. were able to use the results of a 
questionnaire administered 1 week before the ex- 
periment to insure that each group would be as- 
signed approximately equal numbers of students 
initially for, and initially against, the proposed 
change, Practical difficulties made this pretest im- 
possible in the present study. However, postexperi- 
mental examination of the obtained data revealed an 
equality in the distribution of pro- and anti-ROTC 
sentiments, both overall and within each treatment 
group, suggesting that the lack of preexperimental 
control for subjects’ initial opinions constituted no 
serious difficulty within the present study. 


other groups; and (c) preinformed public commit- 


ment subjects should also show less opinion change ~ 


than those in other conditions, but only when the 
second communication is oppositional. 

Empirical confirmation of these predictions would 
be seen, of course, as directly supportive of the 
moderation before commitment explanation and 
directly contrary to Hovland’s rehearsal explana- 
tion, whereas disconfirmation of b would at least 
portend failure for the moderation proposition. 


RESULTS 


To allow for greater ease in examining the 
data, the pretest and posttest scores of our 
subjects have been simplified by standardiz- 
ing the direction of the counterbalanced mea- 
sures. This collapsing results in the original 
5-point Likert-type scales being converted 
into 3-point scales with the following values: 
2 = extreme opinion, 1 = moderate opinion, 
and 0 = neutral opinion. 

Opinion change was computed in the usual 
manner. Whereas the previously described 
opinion scores are expressed on a 3-point 
range in terms of extremeness of opinion, 
opinion-change measures take into account 
both extremity and direction of expressed 
opinion: degree of opinion change for each 
subject is equal to the difference, on the full 
5-point range, between his pretest and post- 
test opinion scores. 

As seen in Table 1, Prediction a is con- 
firmed; no significant pretest differences exist 
between the various postinformed public 


TABLE 1 


OPINION MEASURES FoR EACH SUBGROUP 


Oppositional subjects 


Pretest Posttest ah 
Gin a |, eee a Opies 

score) score) 
Preinformed 18 -908 De) -260 
Postinformed 15| 1.445 667 .867* 
Uninformed 17| 1.392 647 .745* 
Explicitly private |20| 1.517 .700 817* 

Congruent subjects 

Preinformed 14 -880 1.071 —.239 
Postinformed 14| 1.429 1.571 Bol 
Uninformed 19| 1.439 1.632 = 198 
Explicitly private | 15 | ~ 1.513 1.667 —.233 


+p <.001, 


A 


= 


commitment, uninformed, and explicitly pri- 
vate subgroups. 

As suggested by Prediction b, and in 
contradistinction to the logic of the re- 
hearsal proposition, preinformed public com- 
mitment subjects did, in all comparisons, ex- 
press a more moderate opinion than did other 
subjects. Specifically, amongst oppositional 
subgroups, preinformed public commitment 
subjects were more moderate than postin- 
formed public commitment (£ = 2.46, df= 
31, p < .01), uninformed (¢ = 2.52, df = 33, 
p< 01), and explicitly private (t = 3.24, 
df = 36, p<.001) subjects; and amongst 
congruent subgroups, preinformed public 
commitment subjects were similarly more 
moderate than postinformed public commit- 
ment (t= 3.07, df=26, p< 005), unin- 
formed (t= 3.79, df =31, p < .001), and 
explicitly private (¢=3.57, df=27, p< 
.001) subjects. If the rehearsal proposition 
were valid, one would not expect these sig- 
nificant differences, especially between the 
respective preinformed public commitment and 
postinformed public commitment subgroups. 

Prediction ¢ is also supported. Looking at 
within-subgroup opinion change, we find all 
oppositional subgroups, except the prein- 
formed public commitment, showing signifi- 
cant change (p < .001, in all cases). Con- 
gruent subgroups, on the other hand, without 
exception show insignificant opinion change. 
This latter finding is, of course, somewhat 
unexpected; one can easily account for the 
lack of significant change amongst congruent 
postinformed public commitment and explic- 
itly private subjects in terms of ceiling ef- 
fects, but no satisfying explanation for why 
the congruent preinformed public commit- 
ment subjects should not change is forthcom- 
ing. Receiving a congruent second communi- 
cation, why shouldn’t these subjects abandon 
any cautionary moderateness? Whatever the 
answer, these data cannot be regarded as 
straightforward support for either explana- 
tion here under test. 

Considering now subgroup differences re- 
garding amounts of opinion change, we find; 
in line with Prediction c, that amongst oppo- 
sitional subgroups, preinformed public com- 
mitment subjects changed significantly less 
than postinformed public commitment (t= 
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3.15, df=31, p< .005), uninformed (¢= 
3,09, df = 33, p < .005), and explicitly pri- 
vate (t = 3.63, dj = 36, p < .001), whereas 
no significant opinion-change differences ob- 
tain between congruent subgroups. Although 
the unexpectedly slight opinion change evi- 
denced by preinformed. public commitment 
congruent subjects makes unclear the mean- 
ing of the insignificant opinion-change dif- 
ferences between congruent subgroups, the ob- 
tained significant differences between opposi- 
tional preinformed public commitment and 
postinformed public commitment subgroups 
clearly supports our criticism of the Hovland 
explanation. 


DISCUSSION 


Although it was the conclusion of the Hov- 
land et al. (1957) investigation that re- 
hearsal after commitment was responsible for 
publicly committed subjects’ greater re- 
sistance to countercommunication, the data 
obtained in the present study suggest that a 
more probable and plausible explanation is 
that of anticipatory moderation before com- 
mitment. To use the term employed by Janis 
(1962) in his discussion of the psychological 
effects of warnings, it appears that more in- 
volved preinformed public commitment sub- 
jects, unlike those who made a later or anony- 
mous commitment, experienced a greater 
“arousal of the need for reassurance,” and ~ 
thus anticipatorially retreated to a more mod- 
erate statement as a way of achieving a 
“safer” or more assured position. 

The position of Jones and Gerard (1967) 
that “Once a person has committed himself 
to a course of action, he . . . engages in cog- 
nitive work that will sustain his decision [p. 
418],” also melds smoothly with our data. 


Specifically, their formulation may be seen as 4 3 


predicting the obtained change data for oppo- 
sitional subjects (the change data for con- 
gruent subjects are ambiguous, as discussed 
above, and seem to defy interpretation) ,-in 
that the postinformed public commitment, 
uninformed, and explicitly private groups, un- 
like the preinformed public commitment 
group, were only minimally committed to 
their initial opinions, since they did not know 
they would be made public at the time they 
wrote them, Relative to preinformed public 


= 
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commitment subjects, other subjects may 
have felt less obligation to defend their ini- 
tial position and thus were able to engage in 
substantial attitude change, motivated more 
by a positively oriented desire to get things 
right rather than merely to defend things 
done. 

As will be recalled, the uninformed group 
was included in the present study for the 
purpose of assessing what effects a lack of 
information regarding publicity or anonymity 
might have on subjects’ expressions of opin- 
ion. The findings that the uninformed and 
postinformed public commitment subgroups 
paralleled their explicitly private counterparts 
would suggest that uninformed subjects (cor- 
rectly) assumed their statements to be anony- 
mous. This seems quite reasonable in that 
subjects believed, at the time of the experi- 
ment, that they were participating in a sur- 
vey of student opinion, a process generally 
regarded as a private, ballotlike procedure. 
This would suggest that the uninformed sub- 
groups may be relabeled the implicitly private 
subgroups. 

The support for the anticipatory modera- 
tion before commitment hypothesis tested in 
the present study should not in itself be con- 
sidered indicative of any weaknesses in previ- 
ous studies—except that of Hovland et al. 
(1957)—that posit rehearsal explanations for 
their findings. As noted earlier, depending on 
the situation, an anticipation-moderation, re- 
hearsal, or (and most probable) combination 
moderation-rehearsal mechanism may operate 
to produce resistance. 


REFERENCES 
Draux, K. K. Variations in warning, information 
preference, and anticipatory attitude change. 


Journal of Personality and Social Psychology, 
1968, 9, 157-161. 

Deutscu, M., & Gerarp, H. B, A study of norma- 
tive and informational social influences upon indi- 
vidual judgment. Journal of Abnormal and Social 
Psychology, 1955, 51, 629-636. 


Deurscu, M., Krauss, R. M, & Rosenau, N. Dis- 
sonance or defensiveness? Journal of Personality, 
1962, 30, 16-28. 

FREEDMAN, J. L., & Sears, D. O. Warning, distrac- 
tion, and resistance to influence. Journal of Per- 
sonality and Social Psychology, 1965, 1, 262-266. 

Gerarp, H. B. Conformity and commitment to the 
group. Journal of Abnormal and Social Psychol- 
ogy, 1964, 68, 209-211. 

Hovranp, C. I., CAMPBELL, E. H., & Brocx, T. The 
effects of “commitment” on opinion change fol- 
lowing communication. In C. I. Hovland et al., 
The order of presentation in persuasion. New 
Haven: Yale University Press, 1957. 

Hovranp, C. I, & Manvetr, W. Is there a “law of 
primacy” in persuasion? In C. I. Hovland et al., 
The order of presentation in persuasion. New 
Haven: Yale University Press, 1957. 

Janis, I. L. Psychological effects of warning. In G. 
W. Baker & D. W. Chapman (Eds.), Man and 
society in disaster. New York: Basic Books, 1962. 

Jones, E. E., & Gerarp, H. B. Foundations of social 
psychology. New York: Wiley, 1967. 

Lunn, F. H. The psychology of belief. IV. The law 
of primacy in persuasion. Journal of Abnormal 
and Social Psychology, 1925, 20, 183-191. 

McGurre, W. J. Resistance to persuasion conferred 
by active and passive prior refutation of the same 
and alternate counterarguments. Journal of Ab- 
normal and Social Psychology, 1961, 63, 326-332. 

McGuire, W. J. Inducing resistance to persuasion. 
In L. Berkowitz (Ed.), Advances in experimental 
social psychology. Vol. 1. New York: Academic 
Press, 1964. 

McGuire, W. J, & Mmiman, S. Anticipatory be- 
lief lowering following forewarning of a persuasive 
attack. Journal of Personality and Social Psychol- 
ogy, 1965, 2, 471-479. 

McGuire, W. J., & Papacrorcis, D. Effectiveness of 
forewarning in developing resistance to persuasion. 
Public Opinion Quarterly, 1962, 26, 24-34. 

Papaceorcis, D. Anticipation of exposure to per- 
suasive messages and belief change. Journal of 
Personality and Social Psychology, 1967, 5, 490- 
496. 

Sears, D. O., Freepman, J. L., & O’Connor, E. F. 
The effects of anticipated debate and commitment 
on the polarization of audience opinion. Public 
Opinion Quarterly, 1964, 28, 617-627. 

Suerir, M., & Hovrann, C. I. Social judgment. New 
Haven: Yale University Press, 1961. 


(Received June 12, 1970) 


ne TAE and Social Psychology 
1972, Vol. 22, No. i 8-12 


PHYSICAL ATTRACTIVENESS AND MARITAL CHOICE * 


BERNARD I. MURSTEIN °? 


Connecticut College 


The stimulus-value-role theory of marital choice holds that individuals tend 
to choose marital partners of comparable physical attractiveness to themselves. 
Physical attractiveness was measured by self-perception, perception of the 
partner, and appearance judged from photos. Actual couples were hypothesized 
to be significantly less discrepant on these variables than a control group 
contrived by randomly pairing the scores of men and women. The two samples 
consisted of 99 and 98 college couples, respectively, who were going together 
or were engaged. Results support the hypothesis for self-percepts and photo 
attractiveness but not for perception of the partner. The results support 
stimulus-value-role theory and the concept of marital choice as an exchange- 


market phenomenon. 


The relationship of physical attractiveness 
to interpersonal attraction has recently been 
the subject of a considerable number of in- 
vestigations. One focus of attention has been 
on whether, in a heterosexual encounter, indi- 
viduals are attracted to those of a similar 
level of attractiveness to themselves. The re- 
sults have been equivocal. 

In the first of the recent series of studies 
(Walster, Aronson, Abrahams, & Rottman, 
1966), the authors found no evidence of 
similarity of physical attractiveness leading 
to interpersonal attraction. Instead, college 
freshmen, randomly matched at a computer 
dance, tended to be most attracted to physi- 
cally attractive partners regardless of their 
own physical attractiveness. Negative find- 
ings concerning the “similarity” hypothesis 
have also been reported by Brislin and Lewis 
(1968) and Byrne and Ervin (1970). 

On the other hand, Dion and Berscheid 
(1970) and Walster and Walster (1970) 
have reported positive results for the thesis 
of mutual attraction among those of compa- 
rable physical attractiveness. These authors 
explain the contradiction between their recent 
work and earlier studies in terms of different 


1 This study was supported in part by National 
Institute of Mental Health Grant 08-405. 

2The author wishes to thank Sara B. Kiesler for 
her helpful comments regarding the manuscript. Luc 
Jucquois of the Centre de Calculs of the Université 
de Louvain very kindly computed the values re- 
ported in Study II. 

Requests for reprints should be sent to Bernard T. 
Murstein, Department of Psychology, Connecticut 
College, New London, Connecticut 06320. 


demands of the situation. In studies where the 
“similarity” hypothesis was unsupported, sub- 
jects were already assured of a date before 
interacting with their partners, but they ex- 
pressed their preference after having dated. 
In these authors’ recent studies, however, sub- 
jects chose their dates before spending any 
time with them. f 

In short, a “similarity” hypothesis might 
be more effective when one has to attain a 
relationship because the possibility of fear of 
failure and rejection tends to dampen the 
aspirations of the unattractive. Once assured 
of a “date,” however, since the possibility of 
immediate rejection, at least, is negligible, the 


‘individual focuses on what rewards the other 


can offer. Because associating with an at- 
tractive individual may be presumed to be 
more rewarding than associating with a less 
attractive individual, the individual assured 
of a date tends to appreciate his partner as a 
function of her attractiveness regardless of 
his own attractiveness. 

Although this explanation appears logical, 
it should be noted that the aforementioned 
studies have tested interpersonal attraction 
based on periods of time ranging from no 
interaction to a few hours. None of these 
studies have dealt with couples who have 
demonstrated interpersonal attraction to each 
other over a period of months or years. 

A recent theory accounting for interper- 
sonal attraction in marital choice (Murstein, 
1970), however, deals with this kind of situ- 
ation. This theory, called stimulus-value-role, 
holds that, in initial encounters in which in- 
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dividuals are not forced to interact with each 
other, interpersonal attraction is a function of 
the individual’s perception of the other physi- 
cal, social, mental, or reputational attributes, 
and his perception of his own qualities as he 
thinks these will be perceived by the other 
person. Because initial movement is influenced 
primarily by perceptual, noninteractional cues, 
the first stage of the theory is called the 
“stimulus” stage. 

Many stimulus variables such as religion, 
race, professional aspiration, status, and edu- 
cation do not directly influence the initial 
encounter. Either they may be unknown at 
first (religion, professional aspiration) or 
they may be selected out in the circum- 
stances that permit the couple to meet; hence, 
level of education and status are not likely to 
be very influential in determining interper- 
sonal attraction on a college campus. One 
potent stimulus variable in our culture that 
is not highly filtered out in interpersonal en- 
counters, however, is physical attraction. 

Most individuals have a common standard 
of physical beauty, at least as far as facial 
features are concerned (Udry, 1965). Further, 
physical beauty is rewarding for both sexes 
(although more so for men in our culture) 
both as an individual experience and for the 
status it brings to oneself in the eyes of 
others. If “A’s” partner is handsome or 
beautiful, “A” is believed to possess certain 
qualities which earned him such a desirable 
partner. Unfortunately, the supply of physi- 
cally attractive partners is limited, whereas 
the desire to marry is experienced by well 
over 90% of the population. The fact that 
we do not have a largely unmarried popula- 
tion, therefore, necessitates that we conceive 
of marital choice as a kind of exchange- 
market phenomenon. 

Several writers have recently utilized some 
elementary economic concepts for explaining 
social behavior (Blau, 1964; Homans, 1961; 
Thibaut & Kelley, 1959). Essentially, these 
approaches maintain that each person tries to 
make social interaction as profitable as possi- 
ble, profit being defined as the rewards he 
gains from the interaction minus the costs he 
must pay. By rewards are meant the pleasures, 
benefits, and gratifications an individual gains 
from a relationship. Costs are factors which 


inhibit or deter the performance of more pre- 
ferred behaviors. A young man living in the 
Bronx, for example, might like a young lady 
from Brooklyn whom he met while both were 
at a resort. Back in the city, however, he 
may doubt that the rewards he might gain 
from the relationship would be worth the 
costs in time and fatigue of 2-hour subway 
rides to Brooklyn. 

Closely allied to rewards and costs are 
assets and liabilities. Assets are the commodi- 
ties (behaviors or qualities) that the indi- 
vidual possesses which are capable of reward- 
ing others and, in return, causing others to 
reciprocate by rewarding the individual. 
Liabilities are behaviors or qualities associ- 
ated with an individual which are costly to 
others and thus, by reciprocity, costly also to 
the self. 

A man who is physically unattractive (lia- 
bility), for example, might desire a woman 
who has the asset of beauty. Assuming, how- 
ever, that his nonphysical qualities are no 
more rewarding than hers, she gains less profit 
than he does from the relationship and, thus, 
his suit is likely to be rejected. Rejection is 
a cost to him because it may lower his self- 
esteem and increase his fear of failure in 
future encounters; hence, he may decide to 
avoid attempting to court women whom he 
perceives as much above him in attractiveness. 

Contrariwise, he is likely to feel highly 
confident of success if he tries. to date a 
woman even less attractive than himself 
where he risks little chance of rejection (low 
cost). However, the reward value of such a 
conquest is quite low, so that the profitabil- 
ity of such a move is also low. As a conse- 
quence, an experienced individual is likely to 
express a maximum degree of effort and also 
obtain the greatest reward at the least cost 
when he directs his efforts at someone of 
approximately equal physical attraction, as- 
suming all other variables are constant, ; 

It is hypothesized, therefore, that premari- 
tal couples will show greater than chance 
similarity with respect to physical attraction. 
Physical attraction in the present study was 
measured in a number of different ways in ; 
two separate studies. 
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Stupy I 
Method 


Ninety-nine couples who were either engaged or 
“going steady” volunteered to participate in a study 
on marital choice conducted simultaneously at sev- 
eral northeastern colleges and universities (Connecti- 
cut College, University of Connecticut, Yale). The 
subjects received $15 each for participating, and the 
testing procedure took several hours and involved 
a series of personality and person-perception tests, 
Some findings resulting from analyses of these data 
have been published earlier (Murstein 1967a, 1967b, 
1970, 1971). 

Two questions asked of the subjects dealt with 
physical attraction. The first asked how physically 
attractive he found his partner, and the second asked 
for his estimate of his own physical attraction. Each 
question was evaluated on a §-point scale ranging 
from “extremely good looking,” which received a 

score of 5, to “considerably below average in looks,” 
~ which received a score of 1. 

In addition, during the general testing procedure, 
each couple was called to another room and photo- 
graphed twice by means of a Polaroid camera. One 
-photo was taken with the couple in a natural pose 
looking directly at the camera, the second with the 
couple smiling, These pictures were evaluated by 
eight judges (young professors and graduate stu- 
dents of whom four were men and four were 

women) on a 5-point scale. The judges were in- 
structed that, in the general population, a score of 
5 (extremely good looking) would be obtained by 
about 8% of the population, a score of 4 (better 
Jooking than average) by about 17%, a score of 3 
ay (average) by about 50%, a score of 2 (somewhat 
below average) by about 17%, and a score of 1 
(considerably below average) by about 8%. Com- 
parison of the average of the male and female 
judged scores yielded a Pearson r of 80, suggesting 
that no pronounced sex differences were present; 
hence, to determine interjudge reliability, the scores 
of all eight judges were pooled and then separated 
into two random halves. The mean scores of each 
half were correlated by means of the Spearman- 


TABLE 1 


Comparison oF DISCREPANCY SCORES FOR PHOTO 
ATTRACTIVENESS BETWEEN ACTUAL AND 
RANDOMLY PAIRED COUPLES 


Frequency of p for median 
Couples discrepancy random couples 
> Se (Trial 4, 

Actual 39 

Trial 1 55 

Trial 2 47 
Random 

Trial 3 45 

Trial 4 50 .01 

Trial 5 60 
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Brown formula, resulting in an r or .91, a reliability 
value sufficiently high for the purposes of the study 


Results 


To test the hypothesis, it was decided to 
create a control group of couples whose dis- 
crepancies in physical attraction could be 
compared with the discrepancies of the actual 
group. Accordingly, a control group was 
formed by randomly pairing the physical at- 
traction scores of the 99 men and women with 
each other. To ensure stability, this randomi- 
zation procedure was repeated for a total of 
five occasions. 4 

The absolute discrepancies between mem- 
bers of each of the actual couples were cole 
puted, and a cut as close to the median as 
possible was made. Using this cut, the dis- 
crepancies of each random group were exam- 
ined to determine the number of discrepancies 
above the cutting point of the real couples. 
The median disérepancy (third in size) of 
the five random groups was chosen as most 
representative of the “true” random couple 
distribution and served as the control group 
to compare against the discrepancies of the 
actual group. An example will illustrate the 
analysis. 

For the photo judgments, the mean of the 
ratings of the eight judges served as the at- 
tractiveness score of each subject.* The cut 
closest to the median for the actual subjects 
showed a discrepancy between the partners 
of .5 or less units for 60 cases and a dis- 
crepancy of greater than .5 for the other 39 
cases. The probability of getting a discrep- 
ancy score greater than .5 for the actual group 
is therefore .39 (39/99). What was the 
frequency of cases where the discrepancy was 
greater than .5 for the most typical random 
group? The distribution of the five random 
groups is shown in Table 1. Inspection of this 
table shows that Random Group 4 possessed 
the median value of frequency of discrepan- 
cies greater than .5 (50) among the random 
groups. What then is the chance probability 
of getting a frequency of 50 or more cases 
with discrepancies greater than .5, where 
N = 99 and the probability of a discrepancy 
greater than .5 is 39? 


8 Judges did not know which partner went with 
whom in making the judgments. 
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The answer to this question is found in a 
table of binomials (Romig, 1953). For the 
present example, this occurrence by chance is 
most unlikely (< .01); hence, it is con- 
cluded that the photo attractiveness of real 
couples was significantly less discrepant than 
those of artificially paired couples. The same 
procedure was used to test the discrepancy 
between each subject’s perception of his part- 
ner and between the self-concepts for physical 
attractiveness of the couple. The former com- 
parison was clearly nonsignificant, whereas 
the latter was of marginal significance in 
accordance with the hypothesis (p = .06). 
The self-concepts of actual couples, in other 
words, were less discrepant than the self- 
concepts of randomly paired men and women 
from the sample, and this difference ap- 
proached significance (p < .06). 


Stupy IT 
Method 
Essentially the same study was replicated with 98 
new couples, except that photo attractiveness which 
had been clearly significant in the first study was 
omitted, and only three random replications instead 
of five were employed. 


Results 


Again, in the second study, the perception 
of the partner’s physical attractiveness was 
not significantly smaller for actual couples 
than for contrived ones. The self-concepts of 
the partners for physical attractiveness, how- 
ever, was highly significant (p < .001) in ac- 
cordance with the hypothesis. After the in- 
tended analysis had been completed, it 
was realized that the best measure of 
equality of physical attractiveness might be 
an intraperceptual one in which the sub- 
ject compared his perception of the partner 
with his perception of himself. In accordance 
with the hypothesis, a significant positive cor- 
relation should be found between each sub- 
ject’s self-concept for attraction and his per- 
ception of his partner. The results bore out 
this expectation, the correlation for men’s 
perceptions being .50 and for women’s per- 
ception, .45, both highly significant ($ < 
01). The correlation between couples for 
objective attractiveness (photo) was 38 (p 
< .01) and for self-perceived attractiveness, 


31 (p < .01). Neither of these interpercep- 
tual measures was significantly smaller than 
the aforementioned intraperceptual one, al- 
though a trend (p < .10) was in evidence for 
the difference in correlation between self- 
perceived attractiveness by both partners com- 
pared to the self-partner correlation by the 
women. 


Discussion 


The results indicate that physical attrac- 
tiveness, both as subjectively experienced 
and objectively measured, operates in accord- 
ance with exchange-market rules. Individuals 
with equal market value for physical attrac- 
tiveness are more likely to associate in an 
intimate relationship such as premarital en- 
gagement than individuals with disparate val- 
ues. 

The fact that the comparison for the cou- 
ple’s respective perceptions of the partner 
failed to follow this rule is probably due to 
the fact, as the data reveal, that most sub- 
jects tend to overvalue the physical attrac- 
tiveness of the partner and give him a higher 
rating than they give themselves; thus, the 
mean self-attractiveness ratings for men and 
women were both 3.4. The men’s average rat- 
ing for women was 4.1, and that of women 
for men, 3.9. This overestimation of the part- 
ner seems to result in a ceiling effect for 
attractiveness which destroys the possibility 
of differences being found between random 
and real couples. It is also noteworthy that 
many individuals’ self-concepts for attractive- 
ness are grounded on objective reality to some 
extent, the correlation between self-concept 
for attractiveness and photo attractiveness 
being .33 (p < .01) for men and .24 (p< 
.01) for women, 

In sum, although the typical subject per- 
ceives his partner as slightly above himself 
in physical attractiveness, he, nevertheless, 
perceives both himself and his partner as 
quite comparable on this characteristic. It is 
readily acknowledged that physical attractive- 
ness is but one of many factors influencing — 
marital choice. Stimulus-value-role theory — 
(Murstein, 1970) holds that value homo- 
geneity and role compatibility, which repre- — 
sent the last two stages of this three-stage — 
theory, are also quite important in determin- 
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me 
ing marital choice. But, even if one considers 
only the first (stimulus) stage, physical at- 
tractiveness is but one of a multitude of varia- 
bles which have been enumerated earlier. 
However, the fact that equality of -physical 
attractiveness tends to influence marital 
choice even when these other variables are not 
controlled testifies to its ubiquitousness dur? 
ing the entire course of marital courtship.” 
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OF INTENDED BEHAVIOR TOWARD BLACKS * 
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This study investigates the effects of four independent variables on a specific 
instance of intended behavior toward blacks. The intended behavior was de- 
fined as the willingness of subjects to pose with blacks for textbook pictures to $ 
be used for different purposes. Twenty-two moderately anti-black subjects and 
22 moderately equalitarian subjects were tested both on a behavioral and an 
attitudinal measure. Results indicate that attitude, degree of intimacy of an 
anticipated relationship, and the interaction between degree of exposure and 
degree of intimacy of an anticipated relationship had a significant effect on 
willingness to pose in interracial scenes. The significant main effects confirm 
and extend the findings of DeFleur and Westie and confirm what is generally 
known about social distance, The significant interaction suggests the need for 


and the direction of possible further study, 


A number of laboratory and field studies 
(DeFleur & Westie, 1958; LaPiere, 1934; 
Linn, 1965; Warner & DeFleur, 1969) have 
attempted to investigate the relationship be- 
tween attitudes and behavior; a comparison 
of the results based on the existing studies 
indicates that this relationship is, at best, 
equivocal (Wicker, 1969). For instance, De- 
Fleur and Westie (1958) demonstrated a 
positive relationship between attitudes and 
behavior, whereas LaPiere (1934) showed no 
evidence of a positive relationship. The pur- 
pose of the present study is to specify more 
clearly than has been done in previous studies 
certain specific situational influences on an 
attitudinally relevant task and, thereby, to 
clarify the relationship between attitudes and 
behavior. 

LaPiere (1934) made the first attempt to 
investigate, on an experimental basis, the 
relationship between attitudes and action. He 
toured the United States with a Chinese 
couple, stopping at places where public accom- 


17This report is based on a dissertation submitted 
to the University of Colorado Psychology Depart- 
ment in partial fulfillment of the requirement for 
a PhD degree. The research was partially supported 
by Grant GS-303 from the National Science Founda- 
tion. The author is indebted to his committee chair- 
man, Stuart W. Cook, for his extremely helpful 
advice and encouragement throughout the course of 
the study. 

2 Requests for reprints should be sent to James A. 
Green, Department of Psychology, University of Cali- 
fornia, Los Angeles, California 90024. 
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modations were available. His behavioral mea- 
sure involved acceptance or rejection of the 
trio by the proprietors of these public accom- 
modations. After the trip was completed, 
LaPiere sent a questionnaire to the people 
who had served him and asked them if they 
would serve a Chinese couple (his attitude 
measure). He found a lack of relationship 
between his attitude measure and his behay- 
ioral measure. The relevance of LaPiere’s 
study to the present one is of a histori- 
cal nature; that is, his was the first study 
to point out the apparent inconsistency be- 
tween one’s attitude and one’s behavior. The 
broader interpretation given to LaPiere’s 
study by other social scientists (but not by 
LaPiere) advances the notion that there is 
no relationship between attitudes and behav- 
ior. La Piere’s (1934) measure of behavior 
was actual behavior, while his attitude mea- 
sure was intended behavior. 

The first study to take attitudinally rele- 
vant behavior into the laboratory was that 
of DeFleur and Westie (1958). The purpose 
of their experiment was to develop an inter- 
view measure that would tap the “salience” 
of a person’s attitudinal orientation. It was 
also designed to examine the relationship be- 
tween attitudes and behavior. The results of 
this study showed that prejudiced subjects 
were less willing to pose with blacks than were 
unprejudiced subjects; there was not, however, 
a clear one-to-one relationship. 
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The next study to deal with the relation- 
ship between attitudes and behavior was 
completed by Linn (1965). One of the pur- 
poses of his study was to examine the utility 
of attitude measurements as a means for pre- 
dicting future behavior. In his attempt to 
predict behavior from knowledge of attitudes, 
Linn employed questionnaire items, identical 
except for context, to measure behavior and 
attitudes; he did not find any relationship 
between the two. 

The present study used willingness to pose 
in pictures as its dependent variable. DeFleur 
and Westie (1958) operationalized their de- 
pendent variable in much the same way, using 
a “photograph release statement” as their 
measure of behavior. However, before a sub- 
ject filled out the behavioral intention test, 
he was first asked whether he would be willing 
to be photographed with a black of the op- 
posite sex. Many said yes, but others were 
adamant about their refusal. Those who 

agreed were presented with a series of release 
statements which allowed the photograph to 
be used in different ways. They showed slides 
(instead of photographs) which depicted a 
heterosexual relationship. The present study 
adds the dimension of intimacy of the antici- 
pated relationship, ranging from a fairly im- 
personal relationship to a close personal one. 

The present experiment attempted to test 
the following hypotheses, A subject is less 
willing to pose with blacks for photographs 
(a) the more unfavorable his attitude toward 
blacks, (b) the lower the socioeconomic status 
of an anticipated black associate, and (¢) the 
higher the degree of intimacy of the antici- 
pated relationship. It was further hypothe- 
sized that there is a significant interaction 
between attitude and degree of exposure. In 
addition, the study explores the relationships 
among these variables. 


METHOD 
Procedure 


Subjects were recruited by the experimenter from 
a variety of college classes. The experimenter intro- 
duced himself as a college student working part time 
for a publishing company and explained that he was 
interested in locating people who would be included 
in photographs for some textbooks that a publishing 
company was releasing in the near future. He stated 
that a meeting would be held at a specified time 
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and place for those who would be participating and 
that subjects who were interested were invited to 
attend. Students were told that all those. who 
attended the meeting would be paid. 

The behavioral measure, which was operationalized 
as agreeing or disagreeing to pose in interracial 
photographs, was collected at this series of meetings. 
At the meeting, subjects were given 32 sketches and 
a release statement for each sketch. The release 
statement was a form allowing a subject to state 
his willingness to pose in a photograph under the 
specified conditions, Sketches, mainly done by gradu- 
ate students in art and advertising, were completed 
and lithographed so that each subject had a copy of 
each sketch when he was completing the behavioral 
intention test. To avoid total acceptance or rejection 
of the entire picture set, each subject was asked to 
agree to pose in about half of the photographs. They 
were told that “a wide range of average people makes 
for better illustra ons.” The stimuli were composed 
of 16 interracial sketches and 16 all-white sketches. 
The intent of the all-white sketches was to control 
other influences upon posing, that is, an opportunity 
for a modeling career or the need for money. 

Each individual’s score was computed as follows: 
Each of the interracial pictures was paired with an 
all-white picture, matched on all the variables rele- 
vant to the study; thus, the only difference between 
a given pair of sketches was the race of the models. 
If the subject rejected the interracial picture and 
accepted the white picture, he was given a score of 
+1 for that pair; if the subject rejected or accepted 
both members of a pair, he was given a score of 0; 
and if the subject rejected the white picture and 
accepted the interracial picture, he was assigned a 
score of —1 for that pair. Thus, for a subject to 
get a high rejection score, he had to differentially 
reject the interracial sketches and accept the all-white 
sketches, This scoring procedure was devised because 
a preliminary analysis indicated no difference between 
subjects who rejected both members of a pair of 
sketches and those who accepted both members, As 
noted above, the all-white sketches were used as a 
control. Analysis of variance using the number of all- 
white sketches rejected revealed no significant main 
effects or interactions, and a composite score was 
therefore devised, Both the order of degree of expo- 
sure and degree of intimacy of an anticipated rela- 
tionship were randomized among the stimuli. (There 
was only one “form” of the test.) As each subject 
completed this task, he handed in his materials and 
was paid $1.50. Later, the subjects were informed 
by mail that they would not be needed for the 
photographs. 

About 1-2 weeks following the administration of 
the behavioral measure, another experimenter went 
back into the classes where the subjects were re- 
cruited and administered the Multifactor Racial Atti- 
tude Inventory (MRAI) (Woodmansee & Cook, 
1967). The rationale given to the subjects for the 
attitude testing was that it was part of a larger study 
concerned with public opinions of college students, 
Along with the MRAI, four questionnaires were in- 


cluded, composed of items taken from four liberal- 
ism-conservatism attitude scales. These were given 
so that the subjects would not make any connection 
between the behavioral measure and the MRAT. 

The design of the experiment was a 2X42 x4 
factorial. The factors were (a) attitude toward blacks 
(moderately unfavorable or moderately favorable) ; 
(b) degree of exposure, four levels from low to high; 
(c) socioeconomic status of an anticipated black 
associate (high or low); and (d) degree of intimacy 
of the anticipated relationship, four levels from least 
to most intimate. 

Attitude toward blacks is defined as a score on a 
paper-and-pencil measure, the MRAI. Subjects were 
chosen for the moderately favorable or moderately 
unfavorable attitude groups on the basis of their 
total scores from the MRAI. Moderately favorable 
subjects were defined as people who scored from 3 
to 1.5 standard deviations from the mean, and 
moderately unfavorable subjects’ scores ranged from 
—5 to —1.5 standard deviations from the mean. Of 
the 76 subjects who were tested on the “behavioral 
intention,” 32 were not included as subjects in the 
design because they did not obtain scores in the 
appropriate ranges of the MRAI. The second vari- 
able, degree of exposure, refers to the expected use 
of the photograph and the kind of people who will 
eventually see it. The four levels of degree of 
exposure were a Peace Corps textbook used at the 
college level in one of the underdeveloped coun- 
tries; a textbook for classes in prisons and reforma- 
tories (to be used by inmates who wish to complete 
their high school and college education) ; a college- 
level textbook in the field of sociology; and a Life 
magazine story concerned with the use of Peace 
Corps textbooks. 

The third variable, socioeconomic status of an 
anticipated black associate, was operationalized 
through the use of confederates.? The high socio- 
economic condition employed two well-dressed, well- 
groomed black confederates—one male and one fe- 
male, In the other condition, the same black con- 
federates presented an unkempt, sloppy appearance. 

The fourth variable was the degree of intimacy 
of the anticipated relationship depicted in the inter- 
yacial pictures, defined as the amount of familiarity 
that existed between the subject and the stimulus 
person in the sketch. The least intimate condition 
portrayed equality in a public situation; the next 
level of intimacy depicted two people of equal status 
in a public situation of a type that implied repeated 
association; the next higher level showed two people 
of the same sex in a friendly relationship; and the 
fourth and most intimate level showed a fairly close 
heterosexual relationship. 


RESULTS 


The results of an analysis of variance of the 
rejection scores are presented in Table 1. The 
3 This variable is not discussed in the rest of the 


paper because it did not show any differences on the 
dependent variable. 


DETERMINANTS OF INTENDED BEHAVIOR TOWARD BLACKS 


TABLE 1 


REPEATED-MEASURES ANALYSIS OF VARIANCE OF 
REJECTION SCORES OF Pairs OF SKETCHES 
WITH 11 SUBJECTS PER CELL 


Source df MS F 
Attitude (A) 1 | 12.023 | 8.1088** 
Degree of exposure (B) 3 345 | .9556 
Degree of intimacy (C) 3 | 1.742 | 3.1762* 
AXB 3 557 | 1.5437 
ASC 3 .098 1795 
BXC 9 1.117 | 3.1442" 
AXBXC 9 481 | 1.3536 
Subjects within groups (D) | 40 1.483 
CxD 120 549 
BXD 120 361 
AXBXD 360 355 


Note.—The variance due to the confederate variable is not 
reported here because of a lack of relationship with other vari- 
ables; N = 44 

*p <.05. 

**p < OL. 


means are presented in Table 2. Two of the 
main effects were clearly significant. The 
attitude effect provides support (p < .01) for 
the hypothesis that the more unfavorable a 
subject’s attitude toward blacks, the less will- 
ing he will be to pose with blacks in photo- 
graphs. The mean rejection score for the 
moderately anti-black subjects was .25, and 
the mean for the moderately favorable group 
was —.006, with a possible range of scores 
from —1 to +1. The effect of the degree of 
intimacy of the anticipated relationship was 
also significant as predicted (p< 05, see 
Figure 1). More specifically, as the degree of 
intimacy of the anticipated relationship in- 


creases, the number of rejections of inter- 


racial sketches relative to all-white sketches 
increases. Main effects for degree of exposure 


TABLE 2 
MEANS OF BEHAVIORAL INTENTION INDEX 


Degree of exposure 


Degree of 
intimacy 
1 2 3 4 
1 00 -30 .09 —.21 
2 05 14 =.07, ey a 
3 Al .05 -16 .23 
4 14 18 23 50 


Degree of intimacy 


205 14 .26 


Note,—The higher the number, the more rejections of inter 
tacial sketches. 
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Mean Score 


T 2 3 
Degree of Intimacy of an Anticipated Relationship 


(higher the number, more intimate) 


Fic. 1, Means of behavioral intention index of 
pairs of sketches for all subjects (N = 44). (The pos- 
sible range of scores is from +1 to —1. Each point 
on the line represents the mean of all subjects’ mean 
score across four trials—over four levels of exposure.) 


and socioeconomic status of an anticipated 
black associate did not show stable trends. 
Figure 2 shows a first-order interaction 
effect (p < .01) between degree of exposure 
_ and degree of intimacy of an anticipated rela- 
tionship. For the highest degree of exposure, 
that is, a photograph in a Life magazine, un- 
willingness to pose in interracial scenes in- 
creases as the degree of intimacy of an antici- 
pated relationship increases; at the lowest 
degree of exposure, there is little difference 
among the mean rejection scores at the dif- 
ferent degrees of intimacy. At the two inter- 
mediate degrees, the relationship is unclear. 


Discussion 


The first important finding from this study 
is that attitude does significantly effect atti- 
tudinally relevant behavior. This replicates 
the findings of DeFleur and Westie (1958), 
extending their findings from extreme to 
moderate attitude groups. DeFleur and Westie 
(1958) found that extreme racial attitudes 
were positively related to behavior; this study 
adds the information that moderately equali- 
tarian subjects are more willing to pose with 
blacks than are moderately anti-black sub- 
jects. Given these two demonstrations of a 
positive relationship between attitudes and 


behavior, it is important to examine the fact 
that other studies have not shown such a 
relationship. 

Many field studies have found this lack of 
relationship, whereas laboratory studies, such 
as the present one, tend to show a definite 
positive relationship between these two vari- 
ables. There appear to be two possible reasons 
for this discrepancy. One is the lack of reli- 
ability of the attitudinal and behavioral mea- 
sures in the field studies. It is known that 
any measure based on a single response is 
likely to be unreliable. Many of the studies 
that found a lack of relationship between 
attitudes and behavior used single items as 
their measure. 

Another reason for the discrepancy may be 
that situational influences are more salient in 
field studies than in laboratory studies. The 
influences operating in the field might modify 
the responses to the attitudinal measure in 
a negative direction, and the influences oper- 
ating on the behavioral measure might modify 
it in the positive direction. To illustrate, the 
Saenger and Gilbert 950) study used re- 
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Fic. 2. Means of behavioral intention index of pairs 
of sketches for all subjects (V = 44). 
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sponses to a telephone call as an indicator of 
attitude. The proprietor of a restaurant was 
asked whether reservations could be made for 
an interracial group. Influences that ‘enter 
into this response to change it in an unfavor- 
able direction might be (a) norms relating 
to operating a restaurant, (b) norms relating 
to cautiousness of the respondent, (c) norms 
relating to concern for feelings of other cus- 
tomers, etc. Situational influences operating 
on the behavioral measure (whether or not 
an interracial group was served at the restau- 
rant) to change it in the positive direction 
might be the following: (a) embarrassment 
of the proprietor if he turned the customers 
away, (b) need of the proprietor for business, 
and (c) concern of the propietor for the feel- 
ings of the group being hurt if he turned 
them away. 

Turning now to the second finding, that 
is, the degree of intimacy of an anticipated 
relationship, it is important to note the 
implications. The results indicate that as 
anticipated intimacy with the black or white 
person increases, the number of rejections of 
interracial pictures relative to all-white pic- 
tures increases. The unique contribution of 
this study with respect to the degree of inti- 
macy of the anticipated relationship is to 
enlarge on the results of these previous studies 
by showing that subjects regardless of racial 
attitude are less willing to pose with blacks 
as degree of intimacy with the black increases. 
This finding, of course, has been demonstrated 
only for subjects who are either moderately 
equalitarian or moderately anti-black, as 
shown by their responses to an attitude mea- 
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sure; subjects having extreme attitudes toward 
blacks have not been examined. 

The significant interaction between degree 
of exposure and degree of intimacy documents 
DeFleur and Westie’s (1958) notion of 
reference-group ties affecting attitudinally 
relevant behavior. That is, as more reference 
groups become relevant with respect to a 
decision about interacting with a black of the 
opposite sex, other factors become less impor- 
tant determinants of attitudinally relevant 
behavior, and norms regulating reference 
group behaivor exert more influence. 
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Age trends in conformity and independence were investigated in an Asch-type 
conformity situation with 366 children from the first, fourth, seventh, and 
tenth grades. Three types of stimuli were used: visual judgments, opinion 
statements; and delay of gratification preferences. Age trends were observed 
for unanimous peer pressure, adult influence, and peer pressure with social 
support from a partner, Unanimous peer pressure produced, in general, 
decreasing conformity across age, being more pronounced for males than for 
females. Adult influence declined over age for both sexes. Social support from 
a partner who gave correct or modal answers reduced conformity to a constant 
low level across all ages. It was concluded that mechanisms of group influence 
are highly similar across age, and that age trends may reflect variations in 
situational factors associated with age differences. Hence, the variable of age 
may be of limited value, in itself, in studying peer influence. Direct investi- 
gation of situational variables associated with conformity—and only indirectly 
with age—might be a more useful research strategy. 


In a recent exhaustive survey of research 
on peer interaction, Hartup (1970) discussed 
available data dealing with age trends in so- 
cial influence. Concluding the section on peer 
influence, he stated that 


it is remarkable that the research is so thoroughly 
consistent in showing that the functional relation 
between age and peer conformity is curvilinear, and 
that middle childhood is generally the period of 
greatest responsiveness to normative influence from 
peers [p. 411]. 


These data from age-trend studies are in- 
terpreted by Hartup (1970) as supporting 
Piaget’s (1932) conception of social stages. 
Piaget suggested that children pass from a 
presocial or egocentric stage to a stage of 
absolute conformity to social rules lasting 
from age 5 to 8. The second stage gradually 
gives way around age 10 to essentially mature 
conceptions of social norms. From this stage 
conception, one could predict increasing con- 
formity to peer influence in early childhood, 


1 The research reported herein was performed pur- 
suant to a contract with the United States Office of 
Education, Department of Health, Education, and 
Welfare, under the provisions of the Cooperative 
Research Program (Center No, C-03/Contract OE 
5-10-154). We are particularly grateful to Judy 
Wood Turner for her expert help in collecting and 
analyzing data reported in this study. 

2 Requests for reprints should be sent to Vernon L. 
Allen, Department of Psychology, University of 
Wisconsin, Madison, Wisconsin 53706. 
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peaking at ages 5-8, then decreasing and re- 
maining at a relatively low level. A closer 
examination will disclose some difficulties in 
the application of Piagetian theory to the area 
of conformity. 

By adapting Piaget’s (1932) theory of 
moral development to conformity, Costanzo 
and Shaw (1966) offered an explanation for 
their curvilinear findings. The period of 
greatest conformity in Costanzo and Shaw’s 
study occurred at ages 11-13, yet they used 
Piaget’s description of the stage occurring at 
ages 5-8 for an explanation of this high con- 
formity. Piaget’s stage explanation was ap- 
plied to the wrong age. Furthermore, a care- 
ful reading of Piaget indicates that implica- 
tions of his ideas for conformity are quite 
equivocal. According to Piaget (1932), in the 
second stage of social development (ages 5-8), 
the child 


begins to want to play in conformity with certain 
rules received from the outside. . . . From the mo- 
ment that the child begins to imitate the rules of 
others, no matter how egocentric in practice his play 
may be, he regards the rules of the game as sacred 
and untouchable; he refuses to alter those rules and 
claims that any modification, even if accepted by 


general opinion, would be wrong [p. 54, italics 
added]. 


Contrary to the interpretations of Costanzo 
and Shaw (1966) and of Hartup (1970), 
then, Piaget’s theory does not imply that 
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- susceptibility to influence will be high, but 
that children at this stage (ages 5-8) will be 
extremely rigid in following rules—even to 
the point of being independent of peer influ- 
ence attempting to change the rules. Hence, 
whether one can conclude, on the basis of 
Piaget’s theory, that conformity should be 
high or low at this stage (ages 5-8) remains 
somewhat unclear. Norms and rules seem to 
be very explicit at this early age, which makes 
conformity pressure easier to exert. Greater 
adherence to norms at this time might occur, 
then, without necessarily being the result of 
greater “influenceability” at this age. Piaget’s 
description of the nature of norms at this 
stage also clearly suggests that under certain 
conditions the child will be independent of the 
group. 

In Hartup’s (1970) review, some inconsis- 
tencies in the existing data concerning the 
curvilinear relation predicted (perhaps incor- 
rectly) from Piagetian theory are noted: peak 
conformity occurring as early as 9 years and 
as late as 15 years has been reported. Hartup 
used three studies (Costanzo & Shaw, 1966; 
Iscoe, Williams, & Harvey, 1963, 1964) as a 
basis for concluding that age and conformity 
are related in a curvilinear fashion. These 
three studies deserve further critical discus- 
sion. 

Iscoe et al. (1963), in a normative study of 
the age—-conformity relation, found that con- 

formity on a click-counting task increased 
from ages 7—9 to 11-13 for both males and 
females. Males showed increasing conformity 
up to age 15, and conformity decreased for 
females from the 11-13 peak to age 15. A 
second study by the same authors failed, 
however, to replicate this sex difference (Iscoe 
et al., 1964). 

Both Negro and white children partici- 
“pated in the second study. Combining scores 
across all subjects, mean conformity over age 
was curvilinear, peaking at age 12. But an 
analysis of variance yielded a significant in- 
teraction between age and race. Examination 
of the age-conformity relation separately by 
race showed the expected curvilinearity only 
for Negro subjects (Iscoe et al., 1964, p. 457, 
Figure 3). Conformity for Negro children in- 
creased from age 7 to 9, and then decreased. 
For white children, replication of the 1963 


study was, however, questionable; conformity 
decreased from age 7 to 9, increased to age 12, 
and then decreased to age 15. These differ- 
ences appear to be quite small, but mean tests 
were not reported. It is possible that the sta- 
tistical significance of these findings (only F 
tests were reported) was due entirely to age 
changes occurring in the Negro sample. Both 
the 1963 and 1964 studies reported higher 
conformity for white females than for white 
males, but the second study found a reversal 
of this sex difference for Negroes. 

In the third study used by Hartup (1970) 
in his review, Costanzo and Shaw (1966) re- 
ported a strong curvilinear relation between 
age and conformity. Conformity increased 
from ages 7-9 to 11-13, with a continuous 
decrease thereafter through ages 15-17 and 
19-21. In addition, questionnaire data were 
obtained on the subjects’ perception of cause 
of disagreement between themselves and other 
group members. Frequency of reasons at- 
tributed to self for disagreement with the 
group closely paralleled the conformity—age 
relation, yielding a point-biserial correlation 
of .87 with conformity. Costanzo and Shaw 
interpret their findings as being consistent 
with Piaget’s analysis. They suggest that the 
data reflect an increasing awareness of social 
pressure up to the age of puberty, with a 
corresponding decrease in self-confidence. De- 
crease in conformity in postadolescence and 
early adulthood results, in their view, from 
increased self-confidence as one learns to cope 
effectively with his social environment. 

The Costanzo and Shaw (1966) results rep- 
resent the strongest and clearest evidence in 
the literature for a curvilinear relation be- 
tween age and conformity: no significant sex 
differences appeared, and results were nicely 
buttressed with plausible questionnaire data. 
Unfortunately, there is a serious methodo- 
logical problem in the study. Using a line- 
judgment task, Costanzo and Shaw reported 
only two errors in their entire sample on a ` 
total of 480 preexperimental, nonpressure 
trials. They assert that “This result indicates - 
that the errors made under pressure conditions 
cannot be attributed to the difficulty of the 
task [pp. 970-971].” i 

A careful examination of Costanzo and 
Shaw’s (1966) procedure shows that the con- 
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clusion is unjustified. In groups of four, sub- 
jects received general instructions on using 
a modified Crutchfield conformity apparatus. 
Following these instructions, “each person was 
assigned the number ‘4’. When the Experi- 
menter was sure that everyone understood the 
instructions, five practice trials were adminis- 
tered ... [p. 970].” All trials were appar- 
ently presented with the subject answering 
fourth, preceded by three simulated subjects 
giving the correct answers, since five “non- 
pressure” trials for each of the 96 subjects 
yields the 480 judgments reported. Task 
difficulty was therefore not controlled: con- 
formity to the simulated group’s correct an- 
swers on these trials could have masked age 
differences in task difficulty or in attention. 
Difference in the length of lines was quite 
small (4 inch)—which suggests that the dis- 
crimination task may have been somewhat 
difficult. Information on task difficulty is cru- 
cial for interpretation of the results, since the 
conformity measure was simply the average 
number of errors in discrimination at each 
age level, rather than the difference score be- 
tween responses given alone and under group 
pressure. 

Subsequent to Hartup’s review of the 
literature, two other pertinent experiments 
have appeared by Hoving and his co-workers 
(Hamm & Hoving, 1969; Hoving, Hamm, & 
Galvin, 1969). Results of these studies seri- 
ously question Hartup’s conclusions regarding 
curvilinearity of the age-conformity relation. 
Hoving and his colleagues noted inconsisten- 
cies in age—conformity results across studies; 
for example, both Berenda (1950) and Hov- 
ing (1964) have reported a linear decrease 
in conformity across limited age ranges, in 
contrast to results of studies cited earlier. 
Comparison of tasks employed in the several 
studies led Hoving et al. (1969) to suggest a 
means of resolving these conflicting results. 
They noticed that one factor has apparently 
varied considerably across studies—degree of 
ambiguity of the task. Ambiguity was defined 
as the clarity with which the pressure group’s 
position differed from the objectively correct 
alternative. Hoving et al, designed a study 
using a visual judgment task and varied dis- 
criminability of the objectively correct alter- 
native. Results disclosed that (a) on an un- 


y 
VERNON L, ALLEN AND DARREN NEWTSON 


ambiguous task, conformity decreased through 
Grades 2, 5, and 8; (b) on an ambiguous 
task, conformity was, however, positively. re- 
lated to age; and (c) on a task of intermedi- 
ate ambiguity, the conformity—age ‘relation. 
was curvilinear, similar to that reported by. 
Costanzo and Shaw (1966) and Iscoe et al.’ 
(1963). 

The measure of conformity in the Hoving 
et al. (1969) study was mean judgment error 
under group pressure corrected by mean error 
on a nonpressure control series, where each 
subject answered alone. For the completely 
ambiguous condition, “error” correction was 
therefore somewhat arbitrary. Because of this 
problem, Hamm and Hoving (1969) repli- 
cated the completely ambiguous condition us- 
ing autokinetic judgments as the task. Sub- 
jects were 7-, 10-, and 13-year-olds. As in the 
first study, results showed that conformity 
increased with age. 

Hoving et al. (1969) interpreted their re- 
sults in terms of “normative” and “informa- 
tional” motivational systems. They posited a 
“need to be correct” and a “need for peer ap- | 
proval,” both of which are assumed to increase 
with age. These needs presumably reflect the 
accumulation of experience gained through | 
social learning. The observed decrease in con- 
formity over age on unambiguous items is due 
to the increasing need to be correct; the in- 
crease in conformity over age on ambiguous 
items is a function of increasing need for peer | 
approval. Degree of ambiguity of the task is) 
important in understanding the age-conform4f 
ity relation, according to Hoving et al., be- 
cause older subjects view the two motiva- 
tional systems as being interrelated. That is, 
being correct is increasingly linked to peer 
approval. In the absence of an obviously cor- 
rect answer, older subjects can serve both 
needs by conforming; when a clearly correct 
answer is available, older subjects can serve 
both needs by remaining independent. 

These investigators stop short, however, of, 
offering a clear interpretation for the inter- 
mediate ambiguity condition. They suggest 
that a partially ambiguous judgment reduces 
the need to be correct, and at the same time 
is compatible with the need for approval. It is 
not clear how this explanation could account 
for a curvilinear relation between conformity 


and age without additional assumptions about 
rate of increase in approval and accuracy 

¥ needs or consequences of conflict between the 
two needs. 

Several important issues concerning the 
age-conformity relation are still unresolved.. 
Conformity to peer influence does not appear 
to be a simple, curvilinear function of age as 
concluded by Hartup (1970); the shape of 
the relation changes quite dramatically as a 
function of the nature of the task, as Hov- 
ing’s research nicely demonstrates. A close 

| correspondence between peer influence and 
| developmental stages, which could be pre- 
| dicted from Piaget’s (1932) theory, has not 
| yet been convincingly demonstrated. Another 
-problem concerns the psychological processes 
“underlying social influence at different stages 
of development. As Hartup (1970) pointed 
~ out: “We do not yet know whether there are 
unique forms of social influence occurring in 
the interaction of children with age-mates [p. 
404].” The present study was designed to 
- further our understanding of these issues by 
extending and improving previous research, 
Four issues of theoretical importance were 
investigated. 

First, the relation between age and con- 
formity to peer-group norms was scrutinized 
once again, but methodological improvements 
were introduced that should increase confi- 
dence in results. In earlier studies the group 
typically gave the same absolute answers 
across all age levels. Difference in subjects’ 

{accuracy or understanding of the task at dif- 
ferent ages would result in variation across 
age levels of distance between the pressure 
group and the subject. Several studies have 
shown that distance of the group norm from 
the individual’s own position strongly affects 
degree of conformity (Allen, 1965). In previ- 
ous developmental studies of conformity, the 
position of the group norm has not been di- 
rectly controlled across age. Hoving et al. 
(1969) attempted to control this factor by 
pretesting and by using items with similar er- 
ror rates across age. In the present, experiment 
this variable was held constant by maintain- 
ing, at each age level, a constant relative psy- 
chological distance between the position of 
the group and the modal score obtained from 
subjects answering alone (privately). 
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Second, effectiveness of social support in 
reducing conformity was investigated across 
age. Social support, that is, presence of a 
partner agreeing with the subject in opposi- 
tion to the pressure group, dramatically re- 
duces conformity for adults, but has not been 
investigated with children. Independence pro- 
duced by social support is one of the most 
stable findings in the conformity literature, 
and has cast considerable light on the mech- 
anisms underlying group influence (Allen & 
Levine, 1968, 1969, 1971a; Asch, 1951). 
Marked differences in effectiveness of social — 


_ Support in inducing independence across age 


levels would suggest that different psycho- 
logical processes underlie conformity at dif- 
ferent ages. 

Third, opinion statements and delay of 
gratification preferences were included as 
stimuli, as well as visual judgments. Previous 
studies have employed only objective, skill- 
related tasks such as visual judgments. It is 
unclear how such items relate to the dimen- 
sion of task ambiguity. Hoving et al.’s (1969) 
definition of ambiguity is itself rather ambig- 
uous in failing explicitly to distinguish be- 
tween two aspects of ambiguity. The critical 
aspect of task ambiguity that interacts with 
age could be either (a) degree of objective 
correctness (and, hence, empirical verifiabil- ~ 
ity) of responses or (b) clarity with which the 
group’s position differs from the subject’s own 
position. That is, a task may be ambiguous 
with respect to physical or social referents, In 
using items of low empirical verifiability, such 
as opinion and delay of gratification items, 
and by placing the group norm at a position 
clearly discrepant from the subject’s position, 
the present study can help resolve this issue. 

Fourth, the present study compares the 
relative effect of peer influence and adult or 
authority influence. Age-trend data on adult 
influence that can be directly compared to 
peer influence have not been obtained on 
value-free judgments (Hartup, 1970). A com- 
parison of trends in peer and adult influence 
may help explain patterns of social influence 
across age. Similar patterns over age for con- 
formity to peers and adults would suggest 
the operation of a general “influenceability” 
factor. Lack of correspondence over age be- 
tween conformity to peer and adult influence 
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sources would suggest that a differential 
change across age in the social relationship 
between the individual and the two influence 
sources might account for age trends. 


METHOD 
Subjects 


Subjects were 366 children enrolled in the first, 
fourth, seventh, and tenth grades of the Oconomowoc, 
Wisconsin, school system.3 The sample included 58 
first graders (29 males and 29 females), 110 fourth 
graders (55 males and 55 females), 114 seventh 
graders (59 males and 55 females), and 84 tenth 
graders (44 males and 40 females). 


Apparatus ~ 


A standard Crutchfield (1955) apparatus was used 
to measure conformity. The experimenter controlled 
the signal lights (five rows and nine columns) ap- 
pearing in the subjects’ booths. Each of the five sub- 
jects believed that he alone answered last in turn 
in the group; actually, the experimenter simulated 
the first four responses in a prearranged manner, 


Design 


A three-factor analysis of variance design was 
used, The three factors were (a) grade (first, fourth, 
seventh, and tenth), (b) sex of subject (male, fe- 
male), and (c) experimental condition (unanimous 
group, social support, adult pressure). In addition, 
three types of stimuli were used (visual, opinion, 
delay of gratification). Each subject participated in 
only one of the three experimental conditions de- 
scribed below. 

In the unanimous group condition, the group of 
four (simulated) persons gave extremely incorrect 
or unpopular answers on half of the trials (pressure 
trials) prior to the subject’s turn to answer. On the 
other half of the trials, answers of the group were 
correct or popular. 

In the social support (partner) condition, the first 
three persons in the group gave the same extremely 
incorrect (or unpopular) answers on pressure trials 
as in the unanimous condition, but the simulated 
subject in Position 4 gave the correct or popular 
answer, in disagreement with the other three sub- 
jects, On neutral trials, all four simulated subjects 
gave the correct or popular answer. 

In the adult pressure condition, the following 
instructions were given to the subjects: 


To make this more interesting for you, before you 
give your answers I will tell you how three —_ 
grade teachers [same grade as subject] answered 
the same questions. We gave them the same 


8The authors would like to thank the adminis- 
tration and teachers of the Oconomowoc, Wisconsin, 
school system for their aid and cooperation in con- 
ducting this study, and especially Don Kramer, Di- 
rector of District Secondary Education. 
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questions you will answer. After you hear me give 
your teacher’s answer, please turn on the switch 
which indicates the answer you think is correct. 


After reading the question for each slide, the experi- 
menter called aloud the teachers’ alleged answer. On 
pressure trials the experimenter announced the same 
extremely incorrect or unpopular answers given by 
the group in the other two conditions. For the other 
half of the trials, the correct or popular answer was 
announced, 


Stimuli 


A series of 24 items was used: 12 visual, 6 opin- 
ion, and 6 delay of gratification. Half of the items 
of each type received group pressure, and half were 
neutral, Thus, 12 neutral and 12 pressure items were 
distributed evenly across the stimulus series. 

Visual items required the subjects to match a 
standard visual stimulus with one of nine compari- 
son stimuli. For example, a vertical line was matched 
in length with one of nine similar comparison lines. 
Visual items were answered by choosing one of the 
nine switches corresponding to the nine comparison 
stimuli, Opinion items consisted of statements such 
as “Kittens make good pets” or “On weekends, stu- 
dents my age should be allowed to stay up later 
than on school days,” An example of a delay of 
gratification item is, “I would rather have 50¢ today 
than $1 tomorrow.’ Opinion and delay of gratifica- 
tion items were answered by pressing one of five 
switches labeled from “strongly agree” to “strongly 
disagree.” To prevent confusion, the five answers 
also appeared on each of the opinion and delay of 
gratification slides. 

In order to construct group pressure responses of 
constant psychological distance across age groups, 
standardization data were collected prior to the ex- 
periment. Visual, opinion, and delay of gratification 
items were administered to subjects at all grade ~ 
levels. After obtaining the response distribution at 
each grade level, items having a low variance were 
selected as group pressure items. For visual items, 
the simulated group’s answer on pressure trials was 
located 1 scale point (on the 9-point scale) beyond 
the 99th percentile of the standardization distribu- 
tions, calculated separately by grade. For opinion 
and delay of gratification items, the pressure group’s 
answer was placed at a point beyond the 99th 
percentile of the distribution whenever possible, but 
due to greater variability on these items and the 
truncated (5-point) scale, the group’s response some- 
times had to be located at the 95th percentile of 
the class distribution. For neutral items, the group’s 
answer was located at the modal position of answers 
given by the standardization group. 

Order of items was identical for the three experi- 
mental conditions, but the exact location of group 
pressure differed somewhat from grade to grade, 
due to application of the criterion described above. 
Methodological superiority of this technique of 
locating the group response, as contrasted to pro- 
cedures used in other studies, should be noted. 


“saas Standardizing the location of the grou pressure 
Tesponse according to private answers at each age 
level assures that Psychological distance and varja- 

* bility are controlled, a particularly important con- 
sideration if one wishes to compare data across 
different age groups. 


Procedure 


All subjects Participated in two sessions approxi- 
mately 1 week apart. In the first session the experi- 
menter was male; in the second, female. During the 
first session, stimuli were presented in class, and the 
subjects recorded their responses privately in a 
booklet. The experimenter was introduced as making 

oes a statewide school survey. It was emphasized that 
the procedure was not a test, and that teachers 
would not know the answers given, 

; Approximately 1 week after the first session, stu- 
dents were summoned from the classroom in same- 
sex groups of five. Subjects were randomly selected 
from the classroom, and each group was randomly 
assigned to one of three experimental conditions. 
Subjects in the unanimous group and social support 
conditions were seated at the five booths of the con- 
formity apparatus, and they then received instruc- 

ig tions and four practice trials. Particular care was 


~ 
T 
Wa 


clearly understood the task and operation of the 
apparatus. In the unanimous group and social sup- 
port conditions, the subjects were placed in the fifth 
(last) answering position throughout the experi- 
mental session, and responses shown on all subjects’ 
panels were henceforth supplied by the experimenter. 
As each slide was presented, the experimenter read 
the appropriate question or statement. In the adult 
pressure condition, the subjects were simply given 
instructions on using the apparatus; they did not see 
œ Tesponses of other group members. 
Na Following presentation of the 24 items, a question- 
naire was administered that assessed (in appropriate 
Conditions) perception of the social supporter, reac- 


“taken to ensure that the subjects in lower grades. 
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tions to other group members, and knowledge of 
deception. The tenth, seventh, and fourth graders 
answered the questionnaire themselves; for the first- 
grade subjects, two female interviewers were used. 
All subjects were thoroughly debriefed. 


Method oj Analysis 


Data consisted of a mean conformity score for 
each subject, calculated separately for each of the 
three types of items. This score was based on six 
critical items for visual stimuli and on three for 
opinion and delay of gratification stimuli. Scores 
were obtained by taking the difference in scale points 
between the subject’s public answer to an item under 
group pressure, and his private answer on the same 
item given during the earlier standardization test, 
Tf the subject shifted his answer toward the group 
Pressure position, it was scored as positive; if he 
moved away from the group, it was scored as nega- 
tive. The algebraic mean of item scores was then 
computed by type of item. 

When a subject’s answer on the standardization 
session fell on or beyond the Pressure point on an 
item, his score for that item only was excluded from 
the analysis, since he was not subjected to group 
pressure. This occurred 10 times in the first grade, 
11 times in the fourth, 8 in the seventh, and once in 
the tenth grade, and involved 16 females and 14 
males. No subject was dropped completely from the 
analysis, 

A harmonic-means analysis of variance was per- 
formed on mean conformity scores for each type of 
item, and a trend analysis (Grant, 1956) was con- 
ducted across the four grades within each condition. 
Questionnaire data were coded by judges, and fre- 
quency counts were made for the four grades, 


RESULTS 


Summary tables for analyses of variance are 
reported in Table 1. Highly significant grade 


TABLE 1 
ANALYSIS OF VARIANCE OF MEAN CONFORMITY Scores 
Visual items Opinion items Delay of gratification items 
Source df 
MS F MS F MS F 
| Grade (A) 3 | 430 | 1055 | 1275 18,90" 4.78 3.35" 
| Condition (B) 2 3.47 8.50*** 10.42 15.44" 10.57 LAL eet 
Sex (C) 1 17 E -36 a .60 = 
AXB 6 76 1.86 2.95 4.370% 1,21 — 
AXC 3 FN 1.90 392 = 6.52 A SISES 
Bxc 2 1.75 4203 64 = 32 — 
AXBXC 6 16 Be 1S 1.67 3.35 2.35* 
MS, 343 .41 67 1.42 
* b <.05 
Db < 025, 
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and conditions effects were obtained with all 
three types of stimuli. In addition, yisual 
items yielded a Condition X Sex interaction, 
and delay of gratification items yielded a Sex 
x Grade interaction. The three-way interac- 
tion for delay of gratification preferences, 
Conditions X Grade X Sex, reached signifi- 
cance. Analysis of variance on opinion items 
yielded a significant Conditions X Grade in- 
teraction. 


Conformity to Peers 


The unanimous group condition contains 
data relevant to the controversy about change 
in conformity over age. Due to the presence 
of sex interactions with grade on visual and 
delay of gratification items, trend analyses 
were performed separately for each sex. Fig- 
ures 1 and 2 present mean conformity over 
age for males and females, respectively. Table 
2 summarizes results of trend analyses on 
these means. It should be noted that wide 
differences in variance were observed for the 
three types of items in the unanimous group 
condition (.54 for visual items; 1.08 for opin- 
ion items; and 1.59 for delay of gratification 
items). 

i As shown in Figure 1, conformity on visual 
and opinion items decreased for males from 
the first to the seventh grade, and then in- 
creased slightly at the tenth grade. Trend 
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Fic. 1. Male conformity to peers by age. 
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Fic. 2. Female conformity to peers by age. 


analysis (Table 2) yielded significant quad- 
ratic effects for male conformity on visual 
(p < 001) and opinion items (/ < .025). 
Conformity on visual judgments also yielded 
a linear component of marginal statistical sig 
nificance (p < -10). Trend analysis of CON- 
formity on delay of gratification preferences 
failed to detect any significant components. 

Mean differences in conformity across age 
for males were tested by the Newman-Keuls 
technique of multiple comparisons (Winer, 
1962). Statistically significant differences in” 
conformity were found on visual judgments 
between the first and fourth grades (? < 01), 
between the first and seventh grades (Pp < 
01), and between the first and tenth grades 
(p < .05). On opinion statements, conformity 
for first-grade males was significantly differ- 
ent from fourth-grade (p < 05) and seventh- 
grade males ($ < 05). Differences between 
mean conformity in the tenth grade and oth 
grades were not significant. 

For females, conformity showed slight de- 
creases over age on visual and opinion items 
(Figure 2). Trend analysis failed to yield any 
significant components for visual, opinion, Or 
delay of gratification items (Table 2). 


D 
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Social Support 


Table 3 presents mean conformity reduction 
produced by the presence of social support 


at each of the four ages studied. Data in 
Table 3 represent the differences in means 
between conformity in the unanimous condi- 
tion and in the social support condition. 
Larger numbers indicate more independence, 
that is, greater reduction in conformity, So- 
cial support significantly reduced conformity 
on opinion items at all age levels. Results for 
visual and delay of gratification items were 
less consistent. Conformity reduction on visual 
items failed to reach acceptable levels of sta- 
tistical significance for fourth-, seventh-, and 
tenth-grade males, and for seventh- and 
tenth-grade females. On delay of gratification 
items, social support failed to yield significant 
conformity reduction for fourth- and tenth- 
grade males, and for first- and tenth-grade 
females. 

Two lines of evidence suggest, however, 
that inconsistencies in these results reflect a 
“bottoming” effect of conformity in the 
unanimous group condition. First, examination 


"of the unanimous group curves (Figures 1 


and 2) confirms that without exception, social 
support failed to reduce conformity signifi- 
cantly only at the lowest points on the curves. 
If social support fails to yield significant con- 
formity reduction at a given level of conform- 
ity, it also fails at all other points below that 
level. Second, means tests within the social 
support condition failed to yield a single sig- 
. nificant, or near significant, difference across 
~ age. Therefore, the low level of conformity 


TABLE 2 


TREND ANALYSIS OF UNANIMOUS Group INFLUENCE 
BY AGE FOR MALES AND FEMALES 
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TABLE 3 


CoNFoRMITY REDUCTION BY SOCIAL 
SUPPORT Across AGE 


Type of item 
Grade Sex x 
Delay of 
Visual Opinions gratifi- 
cation 
First Males .45* cio 1.622 
Females | .80*** .01 
Fourth Males -24 eid 31 
Females | .40** .84** 
Seventh Males OL 30* .69* 
Females .26 .68* 
Tenth Males $ | .24 647 .22 
Females | .20 48 


Note.—Data represent the difference between the unani- 
mous group and social support condition means. Fisher’s least 
significant difference technique was used to test differences, All 
tests are one-tailed. 

* Data were combined across sex due to lack of significant 
sex interactions in the analysis of variance on opinion items. 


produced by social support does not change 
significantly across age. This evidence sup- 
ports the assertion that failure of social sup- 
port to reduce conformity significantly in the 
present study simply reflects presence of lev- 3 
els of conformity so low that further reduc- 
tion is not detectable. 


Adult Influence 
Table 4 presents results of trend analyses 
performed for each sex on adult influence 


TABLE 4 


TREND ANALYSIS oF ADULT INFLUENCE BY AGE FOR 
Mates AND FEMALES 


Delay of á 
Visual Opinion gratifi- > ah Delay of — 
Group df items items eee Sais D Visual Opinion graria; 
F F F F F F 
Mal Males Bg 
eke 1/68 | 3.15* 1.05 1.98 Linear 1/52 | 2.66 TIL 1.90 
Quadratic | 1/68 | 12.50" | 6.01 | 259] Quadratic | 1/52 | 6.14%* | 8929| ‘96 
Cubic 1/68 | 98 ai 1.60 Cubic 1/52 | 06 1.28 61 
MS. seul 99 1.74 MS. -50 AS 1.60 
Females Females 
Linear 1/63 | 1.66 18 153 Linear _ 1/49 | 2.81* 3.68* 12 
Quadratic | 1/63 43 1.17 2.65 Quadratic | 1/49 5.97** | 12.78%** 139 
Cubic 1/63 | 13 06 55 Cubic 1/49 | 1.54 68 117 
MS. .82 1.13 1,46 MS. 33 43 1.42 
* p <.10, *p <.10 
3 SE zii 


wk p < 001. 
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scores. Means for males and females are pre- 
sented graphically in Figures 3 and 4, respec- 
tively. Visual items yielded significant quad- 
ratic trends for males and for females (p< 
05 for both). Adult influence decreased 
sharply from the first to the fourth grade, and 
then increased slightly in the tenth grade. 
Tests of means indicated that both quadratic 
trends reflected the steep decline of influence 
from first to fourth grades; the first grade in 
both cases differed significantly from other 
grades, which did not differ significantly from 
each other. 

A similar pattern of adult influence ap- 
peared for opinion items, but even more 
strongly. Female data showed a strong quad- 
ratic component (p < .001), with nonsignifi- 
cant linear and cubic components. Adult 
influence declined to a low level from the first 
to the fourth grade, remained low through 
the seventh grade, and then rose slightly at 
the tenth grade. Male data showed significant 
linear and quadratic components (p< 01 
for both). Effects of adult influence on male 
opinion statements declined from a higher 
level in the first grade than for females, and 
failed to show the slight rise in the tenth 
grade that appeared in the female data. The 
significant quadratic component appears to 
reflect the precipitous drop in influence from 
the first to the fourth grade; significance of 
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the linear component is apparently due to the 
continuing decrease. ‘ 

As in the unanimous group condition, large 
variability occurred with delay of gratifica- 
tion items. Despite large mean differences on 
these items, especially for males, no signifi- 
cant trends were detected. 


Postexperimental Questionnaire 


Costanzo and Shaw (1966) reported a close 
relation between amount of conformity and 
self- or other-attribution of reason for dis- 
agreement with the pressure group, a strong 
positive correlation existed between conform- © 
ity and blame of self for disagreement. In 
the present study, reasons given for disagree- 
ment with the majority were coded into four 
categories: (a) self-blame; for example, “I 
was wrong” and “I couldn’t see well.” (b) 
Other-blame; for example, “They were crazy” 
and “They were goofing off.” (¢) People dis- , 
agree; for example, “Different people see — 
things differently” afd “They have their opin- 
ions, I have mine.” (d) Don’t know and un- 
classifiable. In contrast to Costanzo’s and 
Shaw’s finding, self-blame responses were too 
few to require combining with the miscel- 
laneous category for analysis. 

In the unanimous group condition, re- 
sponses increased with age for the category 
people disagree: 35% of first-grade and 73% 
of tenth-grade responses in this category 
(x2 = 13.22, df=6, P< 05). In the adult 


influence condition, responses increased with 

age for the other-blame condition. Only 3% 
` at the first grade said their teacher was wrong, 

but 37% at the fourth grade, 60% at the 

seventh grade, and 63% at tenth grade 

(x? = 17.2, df = 6, p < .01) did so. 

Another question asked subjects how they 
thought the group had reacted to them. Older 
subjects decreasingly gave a simple “They 
thought I was wrong” attribution in both the 
unanimous group and social support condi- 
tions. Frequency of this category declined 

j- from about 50% in the first grade to 15% 

in the tenth grade (y?= 16.31, df= 6, 

b < .02, for the unanimous group; x? = 12.01, 

df = 6, p < .10, for social support). 

Subjects in the social support condition 
were asked (a) if they perceived the part- 
ner’s agreement; (b) if so, to identify him by 
number; and (c) to give a reason for that 
agreement, Results indicated that subjects 

») were aware of agreeing more often with one 

= person uniformly across age (x° = 14.10, 
df = 6, p < .05), although younger subjects 
had diffculty remembering who he was. 
Reasons given for the partner’s agreement 
suggested that older subjects were more aware 
of the implications of pressure on opinion and 
delay of gratification items than younger sub- 
jects. Frequency of responses-in the category 
“We Think the Same, or We are Similar” 
increased constantly across grades, from 
29% in the first grade to 76% in the tenth 
Ge = 2015S df= 6, p < oiy: 

A final question assessed suspicion of the 
experimental deception. A lenient criterion of 
suspicion was adopted—responses identifying 
the purpose of the experiment as being, “To 
test for the influence of others,” for example, 
è was classified as suspicious. Only eight some- 
a what suspicious subjects were identified in 
 * the unanimous group and social support con- 
ditions combined, and seven of these were in 
the tenth grade. The adult influence condition 
fared much worse in the tenth grade, with 
43% of the subjects classified as suspicious. 


DISCUSSION 


; For visual and opinion items, results of 
| the present experiment were quite clear con- 
cerning trends across age for peer-group 
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pressure, social support, and adult influence. 
Conformity to unanimous peer-group influ- 
ence decreased over age, though reaching 
significance only for males. This finding 
strongly supports data reported by Hoving 
et al. (1969). The continuous decrease 
through Grades 1, 4, and 7 is entirely con 
sistent with their data for Grades 2, 5, and 8 
on unambiguous tasks. Present results showed — 
further that social support from a partner s 
produced nonconformity at all age levels. ` 
Data also revealed clearly that effect of adult 
influence decreased sharply over age for both 
sexes. Results indicating the weak impact of 
adults on older children in our sample 
(Grades 7 and 10) are consistent with find- 
ings of other studies (Bowerman & Kinch, — 
1959; Coleman, 1961). Finally, our data | 
showed no significant age trends as a func- 
tion of peer or adult pressure on delay of 
gratification items, although social support 
did reduce conformity on these items.* 


Stage-Trait Hypotheses 


The generally accepted explanations of age 
changes in conformity have been based on 
stage conceptions: degree of influenceability 
is supposedly determined by the stage the 
child has reached. Costanzo and Shaw (1966) — 
and Hartup (1970) have interpreted Piaget’s — 
(1932) stage theory of moral development as 
accounting for the observed curvilinear rela- 
tion between age and conformity, They posit 
that children pass from an egocentric stage 
to a stage of absolute conformity to social 


4 It should be noted that the large variability of 
opinion and delay of gratification items, as com- — 
pared to visual items, is in part artifactual. Six : 
critical (pressure) visual items were used, while only 
three each of the other type of items were used for 
pressure trials, Conformity on visual items should 
have been, therefore, more stable. Even so, this does 
not account for the large variability on delay of 
gratification items. One possible reason for this 
extreme variability may be that preferences are un- 
stable as objects of social influence, due to the ab- 
sence of physical or social referents. Crutchfield i 
(1955) failed to find conformity on personal pref- 
erences and attributed this failure to the extreme 
subjectivity and isolation from group standards of _ 
these judgments. Allen and Levine (1971b) did ob- 
tain conformity on preferences, but they found ~ 
widely differing variability for different types of 
preference stimuli. 
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norms, and finally to a more mature and rela- 
tivistic conception of norms. Another stage- 
trait conception has been advanced by Hoving 
et al. (1969) to explain developmental 
changes in conformity. They assume that two 
motives (“need to be correct” and “need for 
peer approval”) both increase with age. 
Data from the present study have relevance 
to these stage-trait theories of conformity 
development. 

Results of the present study showed the 
same pattern of conformity to peers across 
age for types of items differing greatly 
in stimulus ambiguity—objectively verifiable 
visual items and subjective opinion items, An 
implication of parallel results for visual and 
opinion items clarifies Hoving et al.’s (1969) 
definition of ambiguity and poses some dif- 
ficulty for their theory. Decrease in conform- 
_ity_over age appears not to be a function of 
stimulus ambiguity (i.e., clarity with which 
the group’s position differs from the objec- 
tively correct alternative), but rather to be 
a function of “social ambiguity”—the clarity 
with which the group’s position is different 
from the subject’s own position. Age trends 
may therefore reflect differences in norm 
structure (i.e., social ambiguity), rather than 
developmental changes in the “need to be 
correct” and the “need for approval.” When 
norms are clearly defined and generally agreed 
upon, deviation is more readily recognizable, 
and conformity pressure is more easily 
exerted. Thus, given equivalent ambiguity of 
social norms across age, level of conformity 
might not change. The present observed de- 
crease in conformity over age may reflect the 
existence of more ambiguous norms, that is, 
norms having a less universal and less 
absolute quality. 

Increase in conformity to peers in tenth- 
grade males is also relevant to developmental 
or stage theories. Despite failure to reach 
significance, this finding is consistent with 
research on adults. Adults conform consider- 
ably more than did eighth graders in this 
study. An increase toward adult levels of 
conformity in the oldest group, therefore, is 
reasonable in view of other data. Hoving et al. 
(1969) did not include subjects older than 
Grade 8, so our data are not inconsistent with 
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their results. It is, however, inconsistent with 
their explanation in terms. of the cumulative 
development through reinforcement of a “need 
for accuracy.” Iscoe et al. (1963) reported 
an increase in male conformity similar to that 
reported here at the same age levels, although 
their results at younger ages were inconsistent 
with the present findings. 

The preceding comments about an increase 
in conformity to peers at older ages applies 
only to male findings. Female conformity de- 
creased slightly over age for both opinion 
and visual items, but the difference did not 
approach statistical significance. Failure to 
find age trends for females does some 
damage to a maturational or stage explanation 
of conformity development. Developmental 
stages are usually conceived as being inde- 
pendent of sex typing. That sex typing may 
have important consequences for group func- 
tioning at these ages is suggested by Vinacke 
and Gullickson’s (1964) study. They demon- 
strated significant age trends in frequency of 
coalition formation for males across ages 7-8} 
14-16, and 18-22, but failed to find an effect ` 
of age for females. 

Results from the present experiment for the 
unanimous group condition, then, support 
neither a reinforcement—nurtured need expla- 
nation, as Hoving et al. (1969) proposed, 
nor a Piagetian stage explanation as put forth 
by Costanzo and Shaw (1966) and espoused 
by Hartup (1970). Results clearly did not 
show the slightest hint of the type of curvi- 
linear relation between age and conformity 
(inverted-U curve) that would be predicted 
by these stage theories. 


A Situational Theory 


As an alternative to stage conceptions for 
explaining these results, we hypothesize that 
age changes in conformity are due to direct 
action of situational factors. Several aspects 
of the present results are consistent with such 
an analysis. Results for the social support 
condition are compatible with an explanation 
of social influence in terms of situational 
factors. The uniform effectiveness of social 
support (partner) in reducing conformity 
indicates that reaction to group pressure is 
quite similar across age levels. Explanations 


invoking greater or lesser awareness of the 
peer group as a reason for different levels of 
conformity are not consistent with these re- 
sults (Costanzo & Shaw, 1966; Hamm & 
Hoving, 1969). Such explanations would have 
difficulty accounting for both high conformity 
at lower ages and reduction of conformity to 
equivalent levels at all ages by social support. 
Results for adult influence in the present 
study also indicate the importance of situa- 
tional factors. The sex difference found in 
conformity to peers did not appear in the 
adult pressure condition. Moreover, age-trend 
curves for adult and peer pressure were not 
identical, contrary to what might be expected 
from a general influenceability theory. Age 
trends in female data are particularly inter- 
esting in the absence of strong trends for 
females in the peer-influence condition. What- 
ever the qualitative difference in peer rela- 
tions that accounts for sex differences in peer 
influence, such sex differences apparently do 
not exist in relationship to adult authority, 

Two kinds of situational changes related 
to age can be suggested to account for age 
differences in conformity—(a) changes in so- 
cial organization and (b) changes in flexibil- 
ity and generality of norms. We do not intend 
to imply that these two factors encompass all 
situational variables that may be differentially 
associated with age. But these two broad 
factors do seem to include most of the im- 
portant variables affecting conformity that 
change systematically across age. 

Evidence for change in social organization 
across age comes from diverse sources, 
Hartup (1970) cited evidence indicating 
several changes that occur through middle 
childhood: boys increasingly form cliques, 
cohesiveness of informal groups increases, and 
rejection based in part on norm sharing by 
ingroup members “appears to function as 
a more powerful determinant of cleavages in 
older children’s groups than in younger peer 
groups [p. 375].” Vinacke and Gullickson 
(1964) found that boys were increasingly 
willing to form coalitions as they got older, 
but that girls maintained an accommodative 
or noncoalition strategy. Changes such as 
these in social organization could account for 
decline in conformity through middle child- 
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hood, and for increase in conformity at the 
high school level as older adolescents begin 
to be socialized into the broader society. 

The second suggested situational factor 
accounting for conformity trends over age is 
change in flexibility and generality of social 
norms, Piaget’s (1932) theory implies two 
basic, related changes in children’s under- 
standing of social norms. One change is from 
a rigid, literal, and highly explicit conception 
to a more relativistic and flexible conception 
of norms. A second and closely related change 
is in learning that norms are essentially con- 
tractual, mutually held agreements applying 
only to the persons involved, instead of being 
absolute and universally applicable rules. 
These changes bear directly upon the “social 
ambiguity” notion advanced earlier. The ef- 
fect of both of these changes is to increase 
social ambiguity through middle childhood. 
With increasing flexibility of norm conception, 
recognition of deviation becomes less clear 
and certain for both actor and audience; with 
decreasing generality of norm conception, the 
relationship between the audience and the 
specific norm may become a more important 
determinant of the effectiveness of conformity. 
pressure. 

Our postexperimental data suggest that 
changes occur in the child’s conception of 
the meaning of consensus and dissent in 
groups. Older subjects seemed to tolerate dif- 
ferences with peers and expected this toler- 
ance to be reciprocated. Reasons that subjects 
gave for disagreement with peer pressure be- 
came increasingly “realistic” with increase in 
age—for example, “Different people can be 
expected to disagree.” (Interestingly, this type 
of attribution was not given as a reason for 
disagreement with adult authorities.) 

The situational theory presented above, 
which proposes that age differences in con- 
formity are directly mediated by social or- 
ganization and by norm structure, has impor- 
tant implications for research on conformity 
development. For example, classrooms used 
for research may be characterized by increas- 
ingly fragmented social organization at older 
ages. Pressure-group members are thus more 
likely to be members of an outgroup; or, 
at least, they are less likely to be members 


of an important reference group. Therefore, 
obtained age trends in conformity may not 
reflect age differences in influenceability. 
Instead, results might be due to the inadver- 
tent failure to hold constant. difference in 
nature of pressure groups across age- 
Although social organization and norm 
structure may be affected by maturational 
factors, of more direct impact on social or- 
ganization and norm structure are envi- 
ronmental factors only fortuitously or co- 
incidentally connected with age. For example, 
children from several neighborhood elemen- 
tary schools usually all attend the same large 
central school for the last years of high school. 
This change would very likely affect both so- 
cial organization and norm structure. That 
such a change from one school to another 
occurs at a particular age is not, of course, 
integrally related to the appearance of any 
critical maturational stage. Other situational 
factors will likewise be tied to age merely as 
the result of cultural tradition or conveni- 
ence. We suggest, however, that situational 
factors are the critical determinants of level 
of conformity, instead of age or developmen- 
tal stage per se. Since situational variables 
are intricately embedded in the cultural and 
social environment, direct investigation of 
the changing social ecology across age levels 
will be necessary to clarify the nature of 
conformity development. 
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STEVEN G. COLE? 


Texas Christian University 


A truel—a three-person duel—was employed to examine strategy preferences, 
the effect of trust, and a parity versus equal division of a payoff on alliance 
partner preference in potentially intense conflict situations. Six representative 
situations from the disparity-of-relative-strengths continuum are examined. 
The observed behavior in the potentially intense conflict situations suggested 
that merely providing a means for terminating an intense conflict situation 
by cooperative behavior is not sufficient to reduce the level of conflict. Given 
incompatible goals, limited communication, the need for complete trust in 
your opponent, and a means by which a situation can be terminated by 
conflict, the conflict strategy will be chosen, The same incompatible goals, an 
improved channel of communication, and a negation of the need to trust 
your opponent, will result in a cooperative strategy. Keeping the same incom- 
patible goals and improved channel of communication without a negation of 
the need to trust your opponent will result in the selection of a quasi-coop- 
erative strategy that deteriorates into a conflict strategy at the first chance. 


In the past 20 years, there has been a 
prodigious amount of scholarly effort di- 
rected toward understanding conflict. One 
phase of the examination of conflict has fo- 
cused on controlled laboratory experiments 
and has studied the behavior of individuals 
in games designed to include the possibility 
of conflict behavior. Prominent among schol- 
ars who have chosen to usesgames to examine 
conflict in the laboratory is Anatol Rapoport, 
who proposed that a “degree of realism” is an 
essential feature of game research designed to 
examine conflict (Rapoport, 1967). Such 
realism may range from complex facsimiles of 
an imagined situation in intricate detail to 
those games that remove all but the essential 
features that have been marked for special 
attention. Rapoport suggested that the intri- 
cate detail found in the complex experiment 
results in difficulty in both interpretation and 
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replication. Thus, he chose the Prisoner’s Di- 
lemma paradigm as a means for studying con- 
flict. 

One of the attractions of the Prisoner’s 
Dilemma as an experimental paradigm is the 
fact that it is a well-controlled situation in 
which the dependent variable—the number 
of cooperative responses by each subject—is 
easily quantifiable. In addition, it has been 
considered to be a semirealistic situation in 
which the players make decisions and bar- 
gain as they would in real life. There are 
situations, however, in which the Prisoner’s 
Dilemma encounters difficulties. Foremost 
among them is the difficulty that arises when ~ 
an attempt is made to design replicable and 
interpretable experiments using the Prisoner’s 
Dilemma paradigm in conflict situations in- 
volving more than two participants. ; 

The n-uel (an m-person duel; Cole & Phil- 
lips, 1969; Phillips, Cole, & Hartman, 1970) 
overcomes the dyadic restriction placed on 
the Prisoner’s Dilemma, while it retains those 
aspects that are considered to be desirable. — 
Any conflict situation in which x participants 
contend for incompatible positive payoffs, in 
which it is not necessary that any of the par- ` 
ticipants receive a positive payoff, and in — 
which, at most, one of the participants may — 
receive a positive payoff is defined as a -uel. — 
The contention between participants in an 
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attempt to obtain a positive payoff is defined 
as “uelative conflict” (Cole & Phillips, 1969). 
A simple example of the pure uelative con- 


“flict situation is a duel to the death, which has 


two possible outcomes; either one participant 
wins and the other loses or both participants 
lose. A somewhat less extreme form of uela- 
tive conflict is exempliñed by a duel in which 
the negative payoff involved is not necessarily 
death. For example, consider a duel in which 
a wound or a unilateral refusal to participate 
is sufficient to discontinue the conflict. In 
such a situation the goals of the participants 
are still incompatible. However, the availabil- 
ity of a divisible payoff structure associated 


“with the situation increases the number of 


possible outcomes (Nitz & Phillips, 1969; 
Phillips & Nitz, 1968), thus increasing the 
number of possible strategies which may be 
employed. The defining factor—that at most 
one participant can receive a positive payofi— 
remains. Moreover, it remains possible for 
all of the players to lose. 


Game Design 


The present study utilized a three-person 
n-uel, termed a truel by Shubik (1964), to 
examine potential uelative conflict and some 
possible means for transforming a potential 
uelative conflict situation into a cooperative 
situation. The game used in the present study 
was a modification of the game used by Cole 
and Phillips (1967), Cole (1968, 1969), 
Phillips et al. (1970), and Hartman (1970). 
One player was strong, one player was of me- 
dium strength, and one player was weak 
(A > B > C). Each player began each game 
with 20 points and a given number of points 
that he had to remove from one of the other 
players on each move of the game. A move 
consisted of all three players simultaneously 
choosing the opponent from whom they 
wanted to remove points. When a player 
chose to remove points from one of his op- 
ponents, those points were subtracted from 
the opponent’s point total. The game ended 
when, at most, one player had points remain- 
ing. 

The number of points that each player 
could take away was chosen such that six 
different point distributions from the all-equal 
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point to the dictator region on the disparity- 
of-relative-strengths continuum were exam- 
ined, Since it was considered to be an im- 
portant case, the situation in which one 
player had veto power was included. Table 1 
reports the number of points that each player 
could remove as a function of game type. 

In the pure truel games, the sole survivor 
was the winner, and the payoff for winning 
the game was 100 points. As in previous re- 
search, if there was no sole survivor, there 
was no winner, and none of the participants 
received a payoff for that game. 

The problems of involyement and under- 
standing were reduced by hiring the subjects 
at student labor wages ($1.50 per hour) plus 
bonuses. The subjects participated in the 
experiment for 3 hours each day for 14 days. 
‘At the end of each 6 hours of game playing, 
the subject in each triad who had accumulated 
the most points received double salary—he 
was paid for 6 hours of work at $3 per hour. 
The other two players in each triad were paid 
for 6 hours of work at $1.50 per hour. 


Potential Uelative Conflict 


To examine the behavior in potential uela- 
tive conflict situations, the game was modified 
to allow it to end by cooperative behavior. 
Each of the participants was given an option 
to pass rather than attack one of the other 
players on each trial. Thus, the game could 
end with three, two, one, or no survivors, 
depending on the participants’ choice of 
strategies. If at any time all surviving partici- 
pants passed for three simultaneous consecu- 
tive trials, the game ended. The payoff was 
structured such that all survivors divided a 
100-point payoff in direct proportion to the 
ratio of the number of points that they had 
remaining when the game ended to the total 
points remaining. The monetary reward that 
was paid to the player who had accumulated 
the most points at the end of each 6-hour 
period remained indivisible. Thus, the choice 
of a cooperative response (to pass) could re- 
sult in raising one’s own point total at the 
expense of raising the other survivor’s (or 
survivors’) point total(s). Moreover, since it 
required cooperation from all surviving play- 
ers to end a game by passing, it was necessary 
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to trust that the other player or players 
would also pass. The confounding of immedi- 
ate and long-range goals was designed to make 
the participants in the game carefully con- 
sider the conditions under which they would 
prefer to pass, thus sharing the 100-point 
payoff. ; 

It was predicted that the propensity to 
pass would be an inverse function of the dis- 
parity of relative strengths for two reasons. 
First, as the strengths become less disparate, 

. it becomes more difficult for the participants 
to predict who will survive in a pure uelative 
conflict situation. Second, as the strengths of 
the participants become less disparate, the 
possibility of sharing the 100 points equally 
becomes more acceptable to all concerned. 

Hypothesis 1: The propensity to pass will 
increase as the disparity of relative strengths 
decreases, 


Alliances in the Potential Uelative Conflict 
Situation 


Consider a potential uelative conflict situ- 
ation in which one participant is eliminated 
and the remaining participants choose to pass 
for three simultaneous consecutive times, thus 
sharing the 100-point payoff for that game. 
It could be said that the survivors had 
formed an implicit alliance. To form such an 
alliance, the participants must have overcome 
the fact that communication prior to the 
commitment to a strategy was unavailable. 
There is also some amount of trust necessary 
to continue such an alliance strategy once it 
has been reciprocated by the other partici- 
pant. Willis and Long (1967) found that once 
all participants had passed for two consecu- 
tive moves, in virtually all instances they 
continued to pass until the game ended. This 
would seem to imply that some degree of 
trust was present. However, Willis and Long’s 
paradigm was unique in that the participants 
had a retaliatory capability. Moreover, each 
participant had the power to eliminate any 
other participant with one attack. The combi- 
nation of the total destruction and retaliation 
capabilities controlled by each participant 
made passing a desirable strategy. It was 
fairly simple to coordinate, since the likeli- 
hood of surviving once one had made an at- 


TABLE 1 


Points Each PLAYER Coutp REMOVE 
IN Each GAME TYPE 


Player position Game type 
A could remove ó 7 8 9 | 10 | 11 
B could remove 6 6 6 6 6 6 
C could remove 6 5 4 3 2 1 


Note.—The number of points that each player could remove 
was chosen such that the disparity-of-relative-strengths con- 
tinuum was examined from the all-equal point to the dictator 
region. 


tack was low. Willis and Long found that in 
almost all cases, once an attack was made, all 
participants were eliminated. The retaliatory 
capability virtually eliminated the need for 
trust. It was not necessary to trust that a 
participant would not attack because retalia- ` 
tion simply canceled any advantage that he 
might gain. 

A glance at the paradigm for the present 
experiment reveals that such was not the case. 
In fact, in many instances in which only two 
players are remaining, it is the case that 
should one pass and the other attack, the 
attacker gains enough to obtain complete con- 
trol over the outcome of the game. Thus, he 
can obtain the entire 100-point payoff rather 
than share it. 

The effect of trust on alliance formation 
was examined by allowing explicit alliances 
to form prior to the attack periods. In one 
condition (the nonaggression condition), the 
alliance partners were not allowed to attack 
each other at any time. If the isolate (the 
participant not in the alliance) was elimi- 
nated, the game ended, and the alliance 
partners divided the 100-point payoff in 
direct proportion to the ratio of the 
points they had remaining to the total 
points remaining. In another condition (the 
mutual defense condition), the alliance 
partners were not allowed to attack each 
other while the isolate had points re- 
maining; however, once the isolate was 
eliminated, the alliance partners had the op- 
tion to attack each other. If they chose not 
to attack for three simultaneous consecutive 
moves, the game ended, and the 100-point 
payoff was divided as it was in the nonagegres- 
sion games. Thus, both the nonaggression 
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and mutual defense games could end in an 
identical manner. In the nonaggression games 
the subjects were forced to honor their alli- 
ance, while in the mutual defense games each 
alliance member had to trust his partner not 
to attack once the isolate was eliminated. 
Moreover, in many cases, should one alliance 
meber pass and the other attack, the at- 
tacker would gain complete control of the 
outcome. This latter situation has been clas- 
sified as a terminal coalition-formation situa- 
tion by Caplow (1959). Caplow predicted 
that in such a situation no alliance would 
form. It seems that such a prediction is un- 
necessarily pessimistic and that trust might 
very well be triumphant. Thus, it was pre- 
dicted that subjects would initially trust each 
other and that alliances would be formed. 


“Whether alliances continued to be formed 


would be a function of the propensity for the 
alliance members to refrain from attacking 
each, other once the isolate was eliminated. If 
the alliance members revealed a propensity 
to attack, the formation of alliances would 
cease. Otherwise, alliances would continue to 
be formed. 

Since in the mutual defense games the alli- 
ance members had the option to attack each 
other once the isolate was eliminated, the 
formation of an alliance between the two 
weaker players would insure each of the 
alliance members that their partner could do 
the minimum amount of damage once the iso- 
late was eliminated. For example, consider a 
game in which one player is strong (A), one 
is medium strength (B), and one is weak 
(C), and in which any alliance could win. 
Should B and C form an alliance and A be 
eliminated, both B and C would be in the 
alliance in which the number of points that 
their partner could remove was minimized. 
Should B or C form an alliance with A, how- 
ever, and should the isolate be eliminated, 
both B and C would be in the alliance in 
which the number of points their partner 
could remove was maximized. As a result, in 
the mutual defense games, it was predicted 
that there would be a tendency to form alli- 
ances between the two weaker players given 
that such an alliance would win. 

On the other hand, in the nonaggression 
games, once the isolate was eliminated, the 
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game was over, and the amount of the payoff 
that each alliance member received was a 
function of the total points that he had re- 
maining. If the weakest participant were the 
isolate, the least damage could be done to 
each alliance member, and each would expect 
to receive a greater share of the payoff than 
if the strongest participant were the isolate. 
Thus, in the nonaggression games, it was pre- 
dicted that a propensity to form strong alli- 
ances would follow from the desire to mini- 
mize losses suffered from attacks by the iso- 
late. 

The immediately preceding predictions only 
fit those games in which the distribution of 
relative strengths is located between the all- 
equal point and the veto point on the dis- 
parity-of-relative-strengths continuum. At the 
all-equal point a random propensity for alli- 
ances to form was predicted. Since at the veto 
point and above, any winning alliance must 
include the strongest participant, the strong- 
est participant was predicted to be the pre- 
ferred alliance partner. At the veto point the 
strongest participant was predicted to prefer 
the weakest participant, and alliances com- 
posed of the strongest and weakest partici- 
pants were predicted in the mutual defense 
games. In the nonaggression games, alliances 
were predicted between the strongest and 
medium-strength participants in an attempt 
to minimize possible losses from attacks by 
the isolate. In the dictator situation it was 
predicted that regardless of the fact that the 
strongest participant was the preferred alliance 
partner, few alliances would form becausé the 
strongest participant had no need to form an 
alliance. 

Hypotheses 2 and 3 apply only to those 
situations between the all-equal point and 
the veto point on the disparity-of-relative- 
strengths continuum. 

Hypothesis 2: In the nonaggression games, 
the stronger of each player’s opponents will be 
the preferred alliance partner. 

Hypothesis 3: In the mutual defense games, 
the weaker of the two opponents will be the 
preferred alliance partner. 

Hypotheses 4, 5, and 6 apply only to those 
situations at the veto point and above on the 
disparity-of-relative-strengths continuum. 

Hypothesis 4: In both the nonaggression ; 


and mutual defense games, both of the weaker 
players will prefer the strongest player as an 
alliance partner. 

Hypothesis 5: At the veto point the strong- 
est player will prefer to form an alliance with 
the weakest player in the mutual defense 
games and with the medium-strength player 
in the nonaggression games. 

Hypothesis 6: In the dictator situation the 
strongest player will choose not to form an 
alliance; thus, there will be a propensity not 
to form alliances in the dictator situation. 

Gamson (1961a) proposed that each par- 
ticipant in a coalition-formation situation 
would attempt to maximize the proportion of 
the resources that he contributed to a coali- 
tion while at the same time entering into a 
winning coalition (minimum resource theory). 
Gamson (1961a) suggested that such behavior 
resulted from the parity norm—“any partici- 
pant will expect others to demand from a 
coalition a share of the payoff proportional to 
the amount of resources which they contribute 
to the coalition [p. 376].” By maximizing 
one’s share of the resources contributed to a 
coalition, one would be in a better position to 
bargain for a larger share of the payoff. In 
general, the research on coalition formation 
has tended to support minimum resource 
theory (e.g., Gamson, 1961b; Phillips & Nitz, 
1968; Vinacke & Arkoff, 1957). 

The present experiment made a direct test 
of minimum resource theory. In one condition 
(the coalition 50/50 condition), the coali- 
tion’s share of the payoff was divided equally 
by both members of the coalition. Thus, no 
advantage in bargaining position could be 
gained by forming the coalition in which one 
contributed the most resources. Another con- 
dition (the coalition parity condition) was 
designed such that each coalition member’s 
share of the payoff was in direct proportion 
to the ratio of the number of points that he 
could remove to the number of points that 
the coalition could remove. In effect, the 
members of the coalition were forced to di- 
vide the payoff according to the parity 
principle, and an advantage was gained by 
forming that coalition to which one contrib- 
uted the most resources. 

If minimum resource theory is correct, the 
participants in the coalition 50/50 games 
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would have no basis for determining which 
coalition to form, and a random preference 
for coalitions would be predicted. Moreover, 
in the coalition parity games, a propensity 
for weak coalitions would be predicted. Since 
coalition research to date has uncovered no 
reasons to believe minimum resource theory 
to be grossly inaccurate, the present author 
predicted results consistent with minimum re- 
source theory. 

Hypotheses 7 and 8 apply only to those 
situations between the all-equal point and the 
veto point, since they are the only games that 
are included within the boundary conditions 
of minimum resource theory. 

Hypothesis 7: A propensity to form weak 
coalitions will be manifest in the coalition 
(parity) games. 

Hypothesis 8: Random formation of coali- 
tions will be manifest in the coalition 50/50 
games, 

Hypotheses 9, 10, and 11 apply only to 
those situations at the veto point and above 
on the disparity-of-relative-strengths contin- 
uum. Since such situations fall outside of the 
boundary conditions of minimum resource 
theory, they cannot be considered a test of 
the theory itself, On the other hand, an ap- 
plication of the parity principle at the veto 
point and above does result in some rational 
predictions. 

Hypothesis 9: In both the coalition 50/50 
and coalition parity games, the strongest 
player will be the preferred alliance partner of 
both of the weaker players. 

Hypothesis 10: At the veto point the 
strongest player will prefer to form an alli- 
ance with the weakest player in the coalition 
parity games and will have no preference 
in the coalition 50/50 games. 

Hypothesis 11: In the dictator situation the 
strongest player will choose not to form an 
alliance; thus, there will be a propensity not 
to form alliances in the dictator situation. 


METHOD 
Subjects 


The subjects in the experiment were 24 male stu- 
dents at Michigan State University. They were re- 
cruited by an advertisement in the school newspaper 
which offered work between the fall and winter 
terms at a rate of $1.50 per hour plus bonuses. 
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Apparatus 


The subjects played the games in an 84 foot X 
16 foot room which was surrounded on three sides 
by a U-shaped viewing room. One-way mirrors and 
an intercom allowed the experimenter to keep a 
constant surveillance on all of the activity in the ex- 
perimental room. A table divider which was designed 
to divide a 24 foot X 64 foot table into three sec- 
tions so that the subjects could not see each other 
was used to minimize variance as a result of face-to-* 
face interaction. Each subject had a communication 
terminal located directly in front of him in his sec- 
tion of the table divider. Three lights on the com- 
munication terminal allowed the experimenter to 
communicate information to the subjects indicating 
which position he had during any game. Three push- 
button switches on the communication terminal al- 
lowed each subject to indicate his attack choice and, 
when applicable, his alliance choice. 

A scoreboard which contained information con- 
cerning: the number of points that each player had 
remaining and the number he could remove was 
located 8 feet in front of the subjects. Three rows 
of 20 lights indicated the number of points that 
each player had remaining for the respective player 
positions. A sequence of four white lights and one 
blue light was used to facilitate the computation of 
the number of points that each player had remain- 
ing by the subjects during a game. The number of 
points that each player could remove was displayed 
by a rear screen projector immediately to the left 
of each row of lights. Each digit projected by the 
rear screen projector was approximately 4 inches X 
22 inches, The entire experiment was controlled by a 
programable solid-state logic device. The experi- 
menter had access to a master control panel which 
allowed him to monitor the attack choices of the 
subjects as well as giving him the means for setting 
up the proper experimental manipulations prior to 
each game. 


Procedure 


Eight triads participated in the experiment. Each 
subject was assigned to a triad such that he worked 
during a time period that he preferred. In addition, 
care was taken to insure that none of the subjects 
within a given triad knew each other. Each triad 
remained intact for fourteen 3-hour game-playing 
sessions. Six 3-hour sessions were played each day 
between 6:00 a.m. and 12:00 midnight. None of the 
groups played more than one session during any 
day, and 3 days between game-playing sessions was 
the maximum for any group. The labels VAF, ZEJ, 
and YOV were counterbalanced over the number of 
points to be removed and randomly assigned to the 
subjects for each game. 

Due to the time required to run the experiment, 
five trained experimenters were employed; however, 
all instructions were read to the subjects by the 
same experimenter. The method of presenting the 
instructions combined with the distance imposed by 
the viewing room arrangement and the control of the 
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experiment by the programable solid-state logic 
device resulted in a minimal amount of interaction 
between the experimenters and the subjects, Thus, 
no attempt was made to counterbalance or random- 
ize for experimenter effect. Instead, both the sub- 
jects and the experimenters were assigned to time 
slots that they preferred. 

When the subjects applied for the job as adver- 
tised in the newspaper, they were given a mimeo- 
graphed job description, Those subjects who agreed 
to participate in the experiment were asked. to indi- 
cate the hours that they preferred to work during 
the week. Sixty-four subjects “agreed to participate 
at the time of the initial interview. The subjects were 
later contacted according to the time slots that they 
preferred and the order in which they had answered 
the advertisement. At this time the subjects who 
agreed to participate were told that they would not 
be paid until their part of the experiment was 
finished and that if they did not finish the experi- 
ment, they would not receive any payment for the 
time that they had spent. This was tempered some- 
what by an agreement on the part of the experi- 
menter to consider special cases that might be 
caused by emergencies. In reality, no emergencies 
occurred, and all of the subjects who started the 
experiment finished the experiment. Once the 24 
subjects needed for the experiment had been se- 
lected, the others who had answered the advertise- 
ment were contacted and informed that they would 
not be needed. 

When the subjects first entered the laboratory they 
were given a quick tour of the experimental rooms 
to reduce any effect associated with the remote sur- 
veillance procedure. They were then seated at the 
table divider and read instructions that described 
the use of the communication terminals and the 
scoreboard as well as the rules of the game and how 
the bonuses would be determined. 

The subjects were informed about whether they 
had won the bonus for any particular 6-hour period 
immediately preceding the beginning of the next 
6-hour period. The information concerning who had 
won the bonus was known only by the subject that 
won. The other subjects only knew that they did not 
win. (3 

To familiarize the subjects with the rules of the 
game and the relative power of the player positions, 


. all of the triads played 60 of each of the six game 


types, as pure truel games. That is, they were re- 
quired to attack one of thé opponents on each move. 
After the 360 pure truel games were played, the pass 
option games began. 

The pass option games were different from the 
pure truel games in one major respect. A decision 
period, 30 seconds in length, was incorporated into 
the game. There were seven decision periods per 
game. During each decision period, each player was 
allowed to attack or not to attack. No player could 
attack more than once in any given decision period 
The attack choices were recorded on the scoreboard 
at the end of each 30-second decision period. If no 
attack was made by each player for three decision 
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periods in succession, the game ended. Thus, there 
could be three, two, one, or no survivors depending 
on the play of the game. In the pass option games, 
the 100-point payoff was divided in direct proportion 
to the number of points that each survivor had 
remaining to the total points remaining in that 
game. After 30 of each of the six game types had 
been played with the pass option, the alliance games 
began. 

The alliance games were identical in procedure to 
the pass option games with the insertion of three 
30-second contact periods which preceded the seven 
decision periods allowed for attacks. During these 
three 30-second contact periods, alliances could be 
formed. Contact choices were made by using the 
same button switches that were used to indicate 
attack choices. Each player was allowed only one 
contact choice during each contact. period, and 
reciprocal choices were a prerequisite for an alliance 
in all of the alliance games. As soon as a reciprocal 
contact occurred, the alliance process ended. If there 
were no reciprocal contacts during the three contact 
periods, the game was played as a pass option game. 
“ Triads 1 and 2 played 30 of each of the six game 
types as nonaggression alliance games. A nonag- 
gression alliance dictated that the members of the 
alliance could not attack each other. It did not dic- 
tate that they had to attack the third player. If the 
third player was eliminated, the game was over, and 
the payoff was divided in direct proportion to the 
points each of the surviving players had remaining 
relative to the total points remaining. It was possible 
for one member of the nonaggression alliance to be 
eliminated and for the other member to receive all 
of the points. Since the pass option was in effect, it 
was also possible for the members of the nonaggres- 
sion alliance to pass, resulting in three survivors. 
When that happened, the payoff was distributed as 
it was in the same situation in the pass option games. 

Triads 3 and 4 played 30 of each of the six game 
types as mutual-defense alliance games. One require- 
ment of a mutual-defense alliance was that should 
the third player attack one of the members of the 
alliance, his alliance partner had to attack the third 
player on the following move. The members of a 
mutual-defense alliance were allowed to attack each 
other once the third player was eliminated; how- 
ever, they could not attack each other prior to the 
elimination of the third player. The procedure for the 
distribution of the payoff was identical to the pro- 
cedure in the nonaggression alliance games. 

Triads 5-8 played 30 of each of the six game types 
as coalition games. If a coalition was formed, the 
game was played exactly as it was in the pass option 
games. The only change resulting from a coalition 
was the rule for distributing the payoff. Since a 
coalition was a common-fate alliance, if at least one 
member of the coalition survived, they both received 
a payoff which was distributed according to a pre- 
arranged decision, If neither of the members of the 
coalition survived, then the coalition’s share of the 
payoff was zero. For Triads 5 and 6, the coalition’s 
share of the payoff was divided equally between the 


members (50/50). For Triads 7 and 8, the coalition’s 
share of the payoff was divided directly propor- 
tional to the number of points that each of the coali- 
tion members could remove relative to the total 
points the coalition could remove (parity). Since 
the pass option was in effect, the members of a coali- 
tion could choose not to eliminate the third player. 
In such a case, the share of the payoff given to the 
coalition was directly proportional to the number of 
points that the members of the coalition had remain- 
ing relative to the total number of points remaining. 

The last 108 games (18 of each of the six game 
types) played by each triad were pure truel games. 


RESULTS 


To. simplify the presentation of the data, 
the players are labeled A, B, and C. The rela- 
tionship between players was considered to be 
such that A > B >C (see Table 1). Player 
A’s stronger opponent was Player B, and 
Player B and C’s stronger opponent was 
Player A. For consistency, in the all-equal 
(6/6/6) game, the relationship between play- 
ers was also defined to be A> B > C. Thus, 
even though Player A and Player C could 
each remove 6 points, Player A is referred to 
as Player B’s stronger opponent. Player A is 
also referred to as Player C’s stronger op- 
ponent, and Player B is referred to as Player - 
A’s stronger opponent. 


Pass Option Results 


Hypothesis 1, that the propensity to pass 
increases as the disparity of relative strengths 
decreases was the only hypothesis relevant to 
the pass option games. To test that hypothe- 
sis, a probability-of-passing score was calcu- 
lated for each power position in each game 
type for each of the eight experimental 
groups. The probability-of-passing score was 
considered to be the ratio of the total number 
of passes for each player position in each 
game type relative to the total opportunities 
to pass. Table 2 reports the probability-of- 
passing scores. 

A 3 X 6 analysis of variance with repeated 
measures on both factors was computed on the 
probability-of-passing scores. Table 3 reports 
a summary of that analysis. Since groups were 
the unit of analysis, the between-groups effect 
was not examined; however, the between- 
groups variance was partialed out of the 
within-groups analysis. The results of that 
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TABLE 2 
PROBABILITY-OF-PASSING SCORE FOR PLAYER Position (A, B, or C) AND GAME TYPE 
Game type : 
Group 6/6/6 1/6/5 8/6/4 9/6/3 10/6/2 11/6/1 

A Kana: oe fet OYA peter: bgt sd O A fed: FM Lan pees haul ar SL cad E A Bea 
1 .27 | .28 | .35 | .07 | 18 | .12 | :12 | .07 | .08 | .06 | .04 | .16 | .05 | .03 | .02 | .02 | -43 | .54 
2 42 | .30 | 34] .28 | .26 | .35 | 42 | .22 | .33 | .23 | 22 | .21 | 05) .05 | .07 | .03 | .27 | .32 
3 .05 | .08 | .10 | .03 | .03 | .04 | .02 | .05 | .06 | .01 | .01 | .01 | .00 | .00 | .00 | .00 | .07 | .17 
4 .08 | .04 | .07 | .13 | .07 | 08 | .09 | .03 | .05 | .02 | .08 | .07 | .00 | .00 | .02 | .00 | .00 | .07 
5 .15 | .28 | .19 | .02 | .03 | .02 | .08 | .00 | .01 | .00 | .00 | .01 | .00 | .00 | .00 | .00 | .05 | .03 
6 .03 | .03 | .04 | .00 | .02 | .01 | .03 | 00 | .01 | .03 | .00 | .03 | .00 | .00 | .00 | .00 | .03 | .00 
7 33 | .26 | .30 | .21 | .11 | .18 | .19 | .03 | .15 | .19| .07 | .12 | .00 | .00 | .00 | .00 | .00 | .07 
8 01 | .06 | .04 | .00 | .04 | .11 | .01 | .01 | .26 | .01 | .02 | .24 | .00 | .02 | .03 | .05 | .21 | .50 


analysis revealed a significant effect for game 
type (F = 4.57, p < .01) and a significant 
interaction between game type and player 
position (F = 4.67, p < .01). 

Scheffé’s multiple-comparison ¢ tests of the 
means relevant to the interaction suggested 
that the behavior in the 11/6/1 game was 
responsible for the significant interaction. In 
the 11/6/1 game, the mean probability of 
passing for Player A was significantly dif- 
ferent from the mean probability of passing 
for Player C (tao = 7.143; t = 5.089, œ < 
.01) and the mean probability of passing for 
Player B (tay = 4.643, t' = 4.439, a = 05). 
There were no other significant differences 
between players within any of the six game 
types. 

To further test Hypothesis 1, a revised 
analysis of variance identical to the previous 


analysis except for the exclusion of the 11/6/1 
data was computed on the probability-of- 
passing scores. A summary of the 3X5 
analysis of variance with repeated measures 
on both factors is reported in Table 3. The 
results of the analysis of variance revealed a 
significant effect for game type (F = 8.33, p 
< .001) and no other significant effects. 
The significant effect for game type did not 
necessarily support Hypothesis 1; therefore, a 
Duncan multiple-range test with a signifi- 
cance level of a= .01 was computed on the 
means of the probability-of-passing scores 
across player positions within the 6/6/6 
through 10/6/2 games. The results of the 
Duncan multiple-range test indicated sup- 
port for Hypothesis 1. There was a signifi- 
cant difference between the mean probability 
of passing in the 6/6/6 and the 10/6/2 


TABLE 3 
SUMMARY OF ANALYSIS OF VARIANCE ON PROBABILITY-OF-PASSING SCORE 
11/6/1 included 11/6/1 excluded 
Source 
SS: df MS F SS df MS F 
Game type (A) robe 5 064 4.57* .30 4 075 8.33** 
A X between Ss 48 35 014 25 28 -009 
Player position (B) .06 2 030 3.33 02 2 .010 2.00 
B X between Ss 13 14 .009 07 14 .005 
AXB 14 10 014 4.67* 02 8 .003 1.50 
A X B X between Ss .20 70 ,003 .09 56 002 
Total 1.33 136 75 112 


*p <01. 
** p < 001. 
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TABLE 4 
Contacts IN THE INITIAL CONTACT PERIOD IN THE 7/6/5, 8/6/4, AND 9/6/3 Games 
7/6/5 8/6/4 9/6/3 Combined 
Game type 
Strong Weak Strong Weak Strong Weak Strong Weak 
Nonagression. 99 72 94 80 103 71 296 223 
Zs 1,99* 99 2:35% 3.16** 
Mutual defense 28A] 78 Aye 57 BS: oel 75 | 210 
Za AL7 1 .88 4,90*** 6,19%** 
Coalition parity Doers Nad, 20 | 145 22 144 64 | 430 
ae 9.24%** 9.65*** 9.39%** 16.14*** 
Coalition 50/50 74 |. 105 84 | 94 81. | 98 239 297 
Zs 2.32* .67 1.20 2.46* 
a For random preference of alliance partner. 
* p <.05. 
KD L 0l. 
*** b < 001. 


games; however, the mean probability of 
passing was not significantly different between 
the 6/6/6 through 9/6/3 games or between 
the 7/6/5 through 10/6/2 games. For all five 
game types, the mean probabilities of passing 
were ordered such that the probability of 
passing increased as the disparity of relative 
strengths decreased. 


Alliance Game Results 


To test Hypotheses 2, 3, 7, and 8, two sets 
of data were examined: (a) the number of 
times that the stronger opponent was con- 
tacted in the initial contact period (see Table 
4) and (b) the alliances resulting from re- 
ciprocal contacts in the initial contact period 
(see Table 5). Only data obtained in the ini- 


tial contact period were examined because it 
was assumed that these data reflected pref- 
erences for alliance partners that might be 
modifed on subsequent trials. One-tailed Z 
tests were computed on the contact data for 
each game type within each alliance condi- 
tion reported in Table 4. To permit the use of 
the Z tests, the probability of contacting 
either opponent was assumed to be .5. 
Hypothesis 2 predicted that in the non- 
aggression alliance games, the stronger of the 
other two participants would be the preferred 
alliance partner in the 7/6/5, 8/6/4, and 
9/6/3 games. In the 7/6/5 and 9/6/3 non- 
ageression alliance games, Hypothesis 2 was 


30. 


supported—the stronger opponent was con- 


tacted significantly more often than the 


TABLE 5 
ALLIANCES RESULTING From INITIAL CONTACTS IN THE 7/6/5, 8/6/4/, AND 9/6/3 GAMES 


7/6/5 8/6/4 9/6/3 Combined 
Game type 
AB AC BC AB AC BC AB AC BC AB AC BC 
Nonaggression 27 10 8 23 20 15 27 24 8 77 54 31 
14,59%** 1.69 10.61** 19,59*** 
Mutual defense Sis [n 8a EEO: GOS eee 1 1 5 10 S| 11-9. 
Qs 6.13* 82 4.57 3.65 
Coalition parity | 1 | 1 { 5 o | 1 | 48 P EREA nul 46 3° ERT 
Not computed 
Coalition 50/50 | 16 | 9 | 29 Vid Resa Jen pe 4 14 | 10 | 24 42 28 75 
Qs 11.44** 5.37 6.50* 24.10*** 


Note.—df = 2 for all Q tests. 
a For random alliance formation. 
*p <05. 
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weaker opponent (z = 1.99, p < .05, and z 
= 2.35, p < .05, respectively); however, it 
was not supported in the 8/6/4 game (z = 
.99). There was a significant propensity to 
contact the stronger opponent for the com- 
bined 7/6/5, 8/6/4, and 9/6/3 data (z= 
3.16, p < Ol). 

To further test Hypothesis 2, three separate 
Q tests (McNemar, 1957, p. 232), one each 
on the 7/6/5, 8/6/4, and 9/6/3 game types, 
were computed on the nonaggression games 
data to test the hypothesis that the number 
of AB, AC, and BC alliances resulting from 
initial contacts was not different from zero. 
That hypothesis was rejected in the 7/6/5 
(Q = 14.59, df=2, p< 001) and 9/6/3 
(Q = 10.61, df=2, p< .01) games; how- 
ever, it was not rejected in the 8/6/4 game 
(Q = 1.69, df = 2). The rejection of the hy- 
pothesis indicated that the stronger opponent 
was the preferred alliance partner. Thus, Hy- 
pothesis 2 was supported by both the initial 
contact and initial alliance data in the 7/6/5 
and 9/6/3 games. It was not supported in the 
8/6/4 game. A Q test on the combined 
7/6/5, 8/6/4, and 9/6/3 initial alliance data 
supported Hypothesis 2 (Q = 19.59, df = 2, 
$ < 001). 

Hypothesis 3 predicted that in the mutual 
defense games the weaker of the other two 
participants would be the preferred alliance 
partner in the 7/6/5, 8/6/4, and 9/6/3 
games. The initial contact data reported in 
Table 4 supported Hypothesis 3 in the 7/6/5 
and 9/6/3 games (z= 4.71, p< .001, and 
z= 4.90, p< 001, respectively). Hypothe- 
sis 3 was not supported in the 8/6/4 game 
(z = 88); however, the combined 7/6/5, 
8/6/4, and 9/6/3 initial contact data revealed 
a tendency to prefer the weaker opponent as 
an alliance partner in the mutual defense 
games (z = 6.19, p < .001). The alliances re- 
sulting from initial contacts in the mutual 
defense games as reported in Table 5 sup- 


ported Hypothesis 3 in the 7/6/5 game 


(Q = 6.13, df = 2, p < 05), but it did not 
support Hypothesis 3 in the 8/6/4 or 9/6/3 


games (Q = 82, df = 2, and @ = 4.57, df = 


2, respectively). No tendency to prefer the 
weaker opponent as an alliance partner was 
noted in the combined 7/6/5, 8/6/4, and 
9/6/3 initial alliance data (O = 3.65, df=2). 
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Thus, Hypothesis 3 was supported by both 
the initial contact and initial alliance data in 
the 7/6/5 game, was only partially supported 
in the 9/6/3 game, and was not supported 
in the 8/6/4 game. 

Hypothesis 7 predicted that a propensity 
to form weak coalitions would be manifest in 
the coalition parity games. The initial con- 
tact data reported in Table 4 supported 
Hypothesis 7 in the 7/6/5, 8/6/4, and 9/6/3 
game types (z = 9.24, 9.65, and 9.39, respec- 
tively, p < .001). Further support for Hy- 
pothesis 7 is indicated by the combined initial 
contact data (z = 16.14, p < .001). The alli- 
ances resulting from initial contacts reported 
in Table 5 were such that statistical analyses 
were not necessary. In the 7/6/5 game, 53 of 
55 alliances were weak; in the 8/6/4 game, 
48 of 49 alliances were weak; and in the 
9/6/3 game, 46 of 49 alliances were weak. 
For the combined initial alliance data, 147 
of 153 alliances were weak. It was obvious 
that a propensity to form weak (BC) alli- 
ances was manifest in all three game types 
and that there was strong support for Hy- 
pothesis 7 by both the initial contact and 
initial alliance data. 

Hypothesis 8 predicted that a random 
formation of coalitions would be manifest in 
the coalition 50/50 games. A random pref- 
erence for alliance partners was manifest in 
the 8/6/4 and 9/6/3 games as reported in 
Table 4. Moreover, in the 8/6/4 game, a 
random propensity to form alliances was 
manifest as reported in Table 5. Hypothe- 
sis 8 was not supported, however, by the 
preference for alliance partners manifest in _ 
the 7/6/5 game (z= 2.32, p < 01) or the 
propensity to form alliances in the 7/6/5 and 
9/6/3 games (Q = 11.44, df= 2, p< 01, 
and Q = 6.50, df = 2, p< .05, respectively). 
Moreover, both the initial contact data and 
the initial alliance data indicated that there 
was a nonrandom preference for alliances in 
the coalition 50/50 game (z = 2.46, p < .05, 
and Q = 24.10, df = 2, p < 001 for the ini-- 
tial contact data and initial alliance data, 
respectively). Thus, Hypothesis 8 was only 
partially supported by the data. 

Hypothesis 4 predicted that the strongest 
player would be the two weaker players? 
preferred alliance partner in the 10/6/2 and 
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TABLE 6 
INITIAL CONTACTS FOR EACH PLAYER POSITION IN THE 10/6/2 AND 11/6/1 GAME TYPES 
10/6/2 11/6/1 
Player position 
Game type 
A B € A B | ¢ 
Strong | Weak Strong | Weak | Strong | Weak | Strong | Weak | Strong | Weak | Strong | Weak 

Nonaggression | 29 | 31 | 59 | 1 1 0 10 | 33 9 

2% A3 7.36** 5.78** 3.90* 3.75* 
Mutual defense 18 | 26 ZASE 58 2 | 50 16 7 Pete: TEUA 

Z» 1.06 6.67** 6.62** 3.73* 4.27* 
Coalition parity AEE ale diet EA eos 0 2 R TA EE 36 |16 

Ze 7.03** .95 2.24* 2:12% 2302F 
Coalition 50/50 aN Peay 54 | 5 ) 4 2 2 pi bie a) 49 | 9 

Zs 13 672555 6.51** 5.29** 6.30** 


= For random preference of alliance partner. 
„01 
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11/6/1 games in both the mutual defense and 
nonaggression alliance games. The initial con- 
tact data from the mutual.defense games 
(see Table 6) not only failed to support Hy- 
pothesis 4, but it indicated a significant pro- 
pensity for the weaker player to be the 
preferred alliance partner (z = 6.67, p = .001, 
and z = 6.62, p < .001 for Players B and C, 
respectively, in the 10/6/2 game; and z = 
3.73, p < .01, and z = 4.27, p < .01 for Play- 
ers B and C, respectively, in the 11/6/1 
game). On the other hand, Hypothesis 4 was 
supported by the initial contact data (see 
Table 6) in the nonaggression games (z = 
7.36, p< .001, and z= 5.73, p< .001 for 
Players B and C, respectively, in the 10/6/2 
game; and z = 3.90, p < .01, and z = 3.75, 
p < .01 for Players B and C, respectively, in 
the 11/6/1 game). 

Hypothesis 5, that at the veto point the 
strongest player prefers to form an alliance 
with the weakest player in the mutual defense 
games and with the medium-strength player 
in the nonagegression games was not supported 
by the initial contact data. One-tailed Z tests 
computed on the initial contact data in 
both the nonagression and mutual defense 
games suggested that the behavior of the 
strongest player was random (z=.13 and 
1.06, respectively). 

Hypothesis 9 predicted that the two weaker 
plavers would prefer the strongest player as 


their alliance partner in both the 11/6/1 and 
10/6/2 games. The initial contact data 
reported in Table 6 indicates that except for 
the 10/6/2 coalition (parity) games, Player B 
displayed random behavior (z= .95), and 
Player C preferred his weaker opponent as 
an alliance partner (z = 2.24, p < .05). For 
Players B and C (z= 2.12, p< 05, and 
z = 2.63, p < .05, respectively), there was 
a significant propensity to contact the 
strongest player in the 11/6/1 coalition 
(parity) games. In the coalition (50/50) 
10/6/2 games, Players B and C revealed a 
significant propensity to contact the strongest 
player (z = 6.25 and 6.51, respectively, p < 
.001). For the 11/6/1 coalition (50/50) 
games, a similar propensity was manifest 
(z = 5.29, p < .001 for Player B; and z= 
6.30, p < .001 for Player C). 

Hypothesis 10, that at the veto point the 


TABLE 7 


ALLIANCES RESULTING FROM INITIAL CONTACTS 
IN THE 10/6/2 ann 11/6/1 GAME TYPES 


10/6/2 11/6/1 
Game type 
AB | AC | BC | AB | AC BC 
Nonaggression 29 | 26 0 1 0 1 
Mutual defense 2 1} 42 0 1 20 
Coalition parity 1 17 | 18 1 1 8 
Coalition 50/50 | 25 | 30 0 2 0 3 
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strongest player prefers to form an alliance 
with the weakest player in the coalition 
parity game and has no preference in the 
coalition 50/50 games, was strongly sup- 
ported by the initial contact data (see Table 
6). In the coalition 50/50 games, the 
strongest player contacted his stronger op- 
ponent 27 times and his weaker opponent 31 
times, which did not reject the hypothesis of 
random behavior (z = .13). In the coalition 
parity games, the assumption of random be- 
havior for Player A’s contact propensities was 
rejected (z = 7.03, p < 001). 

Hypothesis 6 predicted that in the mutual 
defense and nonaggression games within the 
dictator region, the strongest player would 
choose not to form an alliance, resulting in 
fewer alliances being formed in the 11/6/1 
game than in any of the other games. Hy- 
pothesis 11 made a similar prediction for the 
coalition parity and coalition 50/50 games. 
Table 8 presents the number of contact 
attempts made by Player A in all four 
of the alliance conditions for Game Types 
7/6/5 through 11/6/1. A 4 x 5 analysis of 
variance with repeated measures on one factor 
was computed on these data. Table 9 reports 
a summary of that analysis. A significant main 
effect was indicated for alliance type (F = 
51.57, p < .01), game type (F = 529.81, p< 
001), and the Game Type x Alliance Type 
interaction (F = 28.23, p < .001). Since Hy- 
potheses 6 and 11 are not concerned with 
effects across alliance types or effects result- 
ing from an interaction between alliance types 
and game types, a Scheffé multiple-compari- 
son £ test was computed which compared the 


TABLE 8 
NUMBER or Contacts MADE BY STRONGEST PLAYER 


Game type 


Alliance type 


7/6/5 | 8/6/4 | 9/6/3 | 10/6/2| 11/6/1 


Nonaggression 29 28 20 30 0 
oo 30 29 30 30 0 


Mutual defense 18 19 17 
ere, 18 16 19 25 12 
Coalition parity 28 27 28 29 1 
cee 29 28 30 30 1 
30 29 30 29 3 

1 


lition 50/50 
ita i 30 30 29 29 
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TABLE 9 


SUMMARY OF ANALYSIS OF VARIANCE COMPUTED ỌN 
NUMBER or Conracrs MADE BY THE 
STRONGEST PLAYER 


df | MS F. 


Source SS 
Between groups 293.60| 7 | 
Alliance types (A) 286.20/ 3| 95.40] 51.57% 
Groups within alliance | 
types 7.40| 4 1.85 
Within groups 3958,40 | 32 
Game type (B) 3390.75 | 4 | 847.69 | 529.8 1** 
AXB 542.05 | 12| 45.17 | 28.23* 
~ B X Groups within Alli- 
ance types 25.60 | 16 1.60 
Total 00 | 39 


4252, 


*p <.01, 
Db <.001, 


mean propensity for the strongest player to 
make a contact between game types within 
each alliance condition. The results of that 
multiple-comparison ¢ test indicated that 
there was a significant difference in the mean 
propensity for the strongest player to make 
a contact between the 11/6/1 and 10/6/2 
games in each alliance condition (nonaggres- 
sion, t= 23.81; mutual defense, ¢ = 8,33; 
coalition parity, ¢ = 22.62; coalition 50/50, 
t= 21.23; = 6.96, a=.01). No sig 
nificant differences in the propensity for the 
strongest player to attempt to form an alli- 
ance were noted between the 10/6/2 games 
and any of the other game types in any of the 
alliance conditions. 

In the mutual defense games, the number 
of alliances resulting from the initial contacts 
further supported Hypothesis 6 with respect 
to strong alliances; however, the 20 weak al- 
liances that were formed in the 11 /6/1 mutual 
defense games questioned that support. A 
search for the reason that 20 weak alliances 
were formed in the mutual defense games 
suggested that they were a reflection of the 
unexpected propensity for the two weaker 
players to prefer their weaker opponent as an 
alliance partner. That phenomenon was previ- 
ously acknowledged as a rejection of Hy- 
pothesis 4 and is not considered a rejection of 
Hypothesis 6. The lack of alliances formed 
in the 11/6/1 nonaggression games coupled 
with the propensity for the two weaker play- 
ers to prefer the stronger opponent as an 
alliance partner in the 11/6/1 nonaggression 
games tends to support such an interpreta- 
tion. Hypothesis 11 was given further support 
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by the fact that only 5 of the 60 possible alli- 
ances were formed in the 11/6/1 coalition 
(50/50) games, and 10 of the possible alli- 
ances were formed in the 11/6/1 coalition 
(parity) games, 


Results Relevant to a Reduction of Conflict 
Behavior 


One of the stated purposes of the present 
experiment was to determine whether any of 
the manipulations examined did, in fact, re- 
sult in a reduction of conflict behavior. Three 
aspects of the data—the observed probability 
of attacking given the opportunity to attack, 
the mean number of attacks per game, and 
the probability of survival—were considered 
to provide some measure of the level of con- 
flict behavior. Each of those three aspects of 
the data were examined separately and are 
presented below. 

Table 10 presents the probability of attack- 
ing given an opportunity to attack for each 
player position by game type, alliance condi- 


43 


tion, and conflict type (the pure truel games, 
the pass option games, and the alliance 
games). A 4 X 6 X 3 X 3 analysis of variance 
with repeated measures on three factors was 
computed on these data so that the simple ef- 
fects could be thoroughly analyzed by means 
of Scheffé’s multiple-comparison ¢ tests. The 
data were ordered such that there were no 
significant differences in the probability of 
attacking given the opportunity to attack as a 
function of alliance type or player position. 
There were significant effects for game type 
(F = 3.107, p œ .031) and conflict type (F 
= 8.642, pœ .010) as well as a significant 
Game Type X Conflict Type interaction (F = 
3.283, p œ .003). The Game Type X Player 
Position interaction and the Game Type X 
Conflict Type X Player Position interaction 
were also significant (F = 6.098 and 3.774, 
respectively, p < .0005). 

Scheffé’s multiple-comparison ¢ tests sug- 
gested that the significant difference for game 
type was a function of the difference between 


TABLE 10 
MEAN PROBABILITY OF Arrackinc GIVEN AN OPPORTUNITY To ATTACK 
Player position 
ane Conflict type Mutual defense Nonaggression Coalition parity Coalition 50/50 
A B Cc A B Cc A B Le A B Cc 
6 Pure truel 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 
6 Pass option 82 87 84 -86 -87 -86 87 86 82 -90 87 -90 
6 Alliance 78 85 85 96 97 97 94 85 95 99 97 99 
7 Pure truel 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 
6 Pass option .88 89 86 <92: .95 .89 -96 .92 .94 -95 .97 .97 
5 Alliance .89 .89 .89 .97 .98 .97 .89 .97 -96 -98 97 99 
8 Pure truel 1,00 | 1.00 | 1,00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1,00 | 1.00 
6 Pass option 82 91 87 93 98 85 94 96 96 94 -99 .97 
4 Alliance 91 .89 81 97 95 .97 -86 98 97 | 1.00 | 1.00 98 
9 Pure truel 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1,00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 
6 Pass option .90 1 .90 .92 97 -86 9O7 98 94 99 97 97 
3 Alliance 83 94 94 203 96 98 91 .98 .97 | 1.00 | 1.00 -99 
10 Pure truel 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 
6 Pass option 98 -98 98 99 98 -98 .98 -99 | 1.00 -99 | 1.00 -98 
2 Alliance 98 96 97 99 .95 .91 97 94 95 | 1.00 | 1.00 98 
11 Pure truel 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 
6 Pass option 98 .90 .89 .98 93 -81 99 83 .76 | 1.00 .98 97 
1 Alliance 96 82 as 99 85 -79 .96 -90 82 | 1.00 99 .99 
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the 6/6/6 games and the 10/6/2 games in 
the pass option condition (¢= 6.94; #= 
5.89, w = .01). There were no other signifi- 
cant differences in the probability of attack- 
ing given the opportunity to attack as a 
function of game type in the pure truel, pass 
option, or alliance games. The significant dif- 
ference between conflict types was a function 
of the difference between the pure truel and 
pass option 6/6/6 games (t = 7.67, t' = 5.89, 
œ = .01). There were no other significant 
effects as a function of conflict type. 

Table 11 reports the mean number of at- 
tacks per game for each player position, in 
each game type within each of the four alli- 
ance conditions, for the three conflict types. 
A4xX6%X3 3 analysis of variance, simi- 
lar to that computed on the probability of 
attacking given the opportunity to attack, 
was computed on the mean number of attacks 
per game, A significant main effect for game 
type (F = 132.198, p < .0005) and for con- 
flict type (F = 492.938, p < .0005) as well as 


a STEVEN G. COLE 


a significant interaction between game type 
and conflict type (F = 40.428, p < .0005) 
were indicated. In addition, there was a Game 
Type X Player Position and Game Type X 
Conflict Type X Player Position interaction 
(F = 5.024 and 3.144, respectively, p < 
0005). 

Scheffé’s multiple-comparison ¢ tests were 
again used to examine the relevant simple 
effects. In the pure truel and pass option situ- 
ations, there was no difference between the 
mean number of attacks per game between 
the 6/6/6 and 7/6/5 games, the 7/6/5 and 
9/6/3 games, the 9/6/3 and 8/6/4 games, 
and the 10/6/2 and 11/6/1 games. In both 
the pure truel and pass option situations, the 
10/6/2 and 11/6/1 games manifest a sig- 
nificantly smaller mean number of attacks 
per game than did any of the other game 
types (¢ = 9.19 and 10.00, respectively; # = 
5.89, « =.01). A significant propensity for 
more attacks per game in the 9/6/3 games 
than in any of the other game types was noted 


TABLE 11 
MEAN NUMBER OF ATTACKS PER GAME 
Player position 
mone Conflict type Mutual defense Nonaggression Coalition parity Coalition 50/50 
A B G A B © A B C A B G 
6 Pure truel 4.57 | 4.57 | 4.57 | 4.58 | 4.58 | 4.58 | 4.36 | 4.36 | 4.36 | 4.47 | 4.47 | 4.27 
6 Pass option | 4.25 | 4.55 | 4.40 | 4.21 | 4.27 | 4.23 | 4.43 | 4.38 | 4.18 | 4.23 | 4.10 | 4.43 
6 Alliance 3.37 | 3.68 | 3.68 | 2.93 | 2.98 | 2.97 | 2.95 | 2.66 | 2.97 | 3.08 | 3.02 | 3,10 
7 Pure truel 4.11 | 411 | 4.11 | 4.12 | 4:12 | 412 | 4.15 | 415 | 415 | 4.24 | 4.24 | 4.24 
6 Pass option | 3.97 | 4.05 | 3.87 | 4.03 | 4.20 | 3.93 | 4.15 | 3.98 | 4.07 | 4.20 | 4.27 | 4.27 
5 Alliance 3.42 | 3.40 | 3.32 | 3.00 | 3.02 | 3.00 | 2.80 | 3.07 | 3.03 | 3.02 | 2.97 | 3.03 
8 Pure truel 3.64 | 3.64 | 3.64 | 3.67 | 3.67 | 3.67 | 3.72 | 3.72 | 3.72 | 3.67 | 3.67 | 3.67 
6 Pass option | 3.42 | 3.83 | 3.62 | 3.77 | 4.00 | 3.45 | 3.62 | 3.67 | 3.67 | 3.62 | 3.68 3.77 
4 Alliance 4.67 | 3.37 | 3.07 | 2.93 | 2.87 | 2.95 | 2.68 | 3.07 | 3.03 | 2.95 | 2.25 | 2.88 
9 Pure truel 3.87 | 3.87 | 3.87 | 3.91 | 3.91 | 3.91 | 3.92 | 3.92 | 3.92 | 3.92 | 3.92 | 3.92 
6 Pass option | 3.95 | 4.00 | 3.97 | 4.05 | 4.25 | 3.78 | 3.92 | 3.95 | 3.78 | 4.05 | 3.95 | 3.95 
3 Alliance 3.43 | 3.88 | 3.90 | 2.93 | 3.03 | 3.08 | 3.58 | 3.87 | 3.85 | 3.43 | 3.43 | 3.40 
10 Pure truel 3.08 | 3.08 | 3.08 | 3.06 | 3.06 | 3.06 | 3.18 | 3.18 | 3.18 | 3.13 | 3.13 | 3.13 
6 Pass option | 2.97 | 2.97 | 2.95 | 3.03 | 3.02 | 3.00 | 3.10 | 3.12 | 3.15 | 3.05 | 3.07 | 3.02 
2 Alliance 3.00 | 2.93 | 2.95 | 2.93 | 2.82 | 2.78 | 3.00 | 2.90 | 2.95 | 2.95 | 2.95 | 2.90 
11 Pure truel 3.06 | 3.06 | 3.06 | 3.04 | 3.04 | 3.04 | 3.04 | 3.04 | 3.04 | 3.05 | 3.05 | 3.05 
6 Pass option | 2.97 | 2.72 | 2.68 | 3.07 | 2.88 | 2.50 | 3.03 | 2.53 | 2.32 | 3.03 | 2.98 | 2.93 
1 Alliance 3.03 | 2.56 | 2.33 1303 2:58 | 2.40 |. 3.02 |-2,85 | 2.57] 3.18] 3.17 | 345 
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TABLE 12 
PROBABILITY OF SURVIVAL 


Player position 
s Conflict type Mutual defense Nonaggression Coalition parity Coalition 50/50 
A B (s A B Ç A B (e A B (è 
6 Pure truel .25 .25 .20 A8 13 .20 18 25 .26 .28 323: 25, 
6 Pass option 2I .22 18 19 12 22 19 14 222 29. .22 AT 
6 Alliance AS A8 12 WP 38 AT 64 34 .70 37 a5 48 
7 Pure truel 14 .20 .22 14 21 .20 19 13 37 AS 22 28 
6 Pass option 1S AS 32 25 10 22 19 Aly? cdi, 17 R25) 19, 
5 Alliance 12 -15 ayi -10 .59 37 .05 -90 -90 42 -77 .64 
8 Pure truel hg 22 ial 2, ee Ae A NS Ca) <22; .14 .20 | .28 .16 ll .20 
6 Pass option 22 19 tyi ily 10 34 -20 .27 27 as 32 -20 
4 Alliance .17 19 34 A3 -67 .58 10 -80 .82 45 -65 52 
9 Pure truel 14 05 A7 16 14 oS 13 05 06 Al -06 07 
6 Pass option SIDE CY RES E en cone dere eto aS oy SO Ee Sh IS POZE te Sh 
3 Alliance =22 04 15 85 59 40 05 82 -19 -40 .63 Ri 
10 Pure truel .07 .02 .05 18 02 03 -07 .02 .06 05 .02 .07 
6 Pass option .08 02 -00 05 .00 04 07 02 .03 05 .00 07 
2 Alliance 14 .02 00 97 .50 AT -99 we} 35 97 47 .56 
1i Pure truel .96 .02 .03 .94 .01 04 95 01 03 -96 01 02 
6 Pass option 99 -02 -05 -99 -00 -02 | 1.00 -00 .02 -99 -00 -02 
1 Alliance :99 -04 .02 .95 .03 02 sOy .04 .03 | 1.00 02 02 


in the alliance games (¢ = 5.968; # = 5.89, 
« = 01). 

In the 6/6/6, 7/6/5, 8/6/4, and 9/6/3 
games, a significant propensity for more at- 
tacks per game in the pass option (PO) and 
pure truel (PT) games than in the alliance 
(A) games was noted, In the 6/6/6 games 
tpr-a = 22.10 and tpo-a = 18.87; in the 
7/6/5 games tpr-a = 17.10 and tposa= 
15.81; in the 8/6/4 games fpr_4 = 10.65 and 
tpo_a = 10.87; and in the 9/6/3 games tpn 4 
= 6.77 and troa = 7.74 (t'5.89, œ= .01). 
There were no significant differences between 
the mean number of attacks per game in the 
pure truel and pass option situations in any 
game type, and there were no significant dif- 
ferences between the mean number of at- 
tacks per game within each conflict type in 
the 10/6/2 and 11/6/1 games. 

Table 12 presents the probability of sur- 
vival for Players A, B, and C in each game 
type, within each of the four alliance condi- 
tions, for the three conflict types. A 4 X 6 X 


3X3 analysis of variance with repeated 
measures on three of the factors was com- 
puted on the probability of survival data. 
There was a significant main effect for game 
type (F = 50.469, p < .0005), conflict type 
(F = 460.199, p < .0005), player position (F 
= 18.474, p ~œ .001), and alliance type (F = 
25.174, p~.005). The Game Type x Alli- 
ance Type (F = 6.839, p < .0005), Conflict 
Type X Alliance Type (F = 60.565, p< 
.0005), Game Type X Conflict Type x Alli- 
ance Type (F = 6.528, p< .0005), Game 
Type X Player Position (F = 80.842, p< 
0005), and Game Type X Conflict Type x 
Player Position (F = 2.455, p œ .002) inter- 
actions were also significant. 

The simple effects for the Game Type x 
Conflict Type X Player Position interaction 
and for the Game Type X Conflict Type x 
Alliance Type interaction were examined by 
means of Scheffé’s multiple-comparison ¢ tests. 
Player A had a significantly greater probabil- 
ity of survival in the 11/6/1 game than any 
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other game type in the mutual defense (MD), 
coalition parity (CP), and coalition 50/50 
(C 4) conditions (mup = 10.56, top = 10.99, 
E = 10,56, œ =.01; toy = 10.00; t = 9.90, 
œ = 05). Player A also had a significantly 
greater probability of survival than Players B 
or C in the 11/6/1 game for all alliance types 
(t = 1,338.03; t= 10.56, œ =.01). There 
were no other significant differences in the 
probability of Players A, B, or C surviving. 
There was a significantly greater probability 
of at least one of the players surviving in the 
alliance games for the nonaggression (NA), 
coalition parity, and coalition 50/50 condi- 
tions in the 6/6/6 through 10/6/2 games 
(ina = 11.92, to, = 10.62; # = 10.56, œ = 
01; top = 10.41; t'=9.90, œ =.05). No 
significant differences for alliance type or con- 
flict type were indicated in the 11/6/1 games, 
and no significant differences for conflict type 
were indicated in the mutual defense condi- 
tion. 


DISCUSSION 


The data supported the hypothesis that the 
subjects would exhibit an inverse relationship 
between the propensity to pass and disparity 
of relative strengths in the pass option games. 
In general, the mean number of attacks per 
game decreased as the disparity of relative 
strengths increased in all game types. How- 
ever, a close examination of the data sug- 
gested that there was no real reduction in 
conflict behavior as a function of the pass op- 
tion. There was no significant difference be- 
tween the mean number of attacks per game 
in the pure truel and pass option games, and, 
except for the 6/6/6 game type, there was no 
significant difference in the propensity to at- 
tack given the opportunity to attack as a 
function of the disparity of relative strengths. 
Moreover, the lowest mean probability of 
attacking (one minus the highest mean proba- 
bility of passing) was .83. Thus, even though 
the probability of passing increased as the 
disparity of relative strengths decreased from 
the veto point to the all-equal point, there 
was a propensity to attack across that same 
range on the disparity-of-relative-strengths 
continuum. The failure to obtain a reduction 
in conflict behavior in the pass option games 
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suggests that given limited communication 
and a history of conflict, the strategy of re- 
solving potential conflict situations through 
cooperative behavior is not prominent. 

Although there was no significant reduction 
of conflict behavior in the pass option games, 
there was a significant reduction of conflict 
behavior in the alliance games. Such a result 
suggests that with the addition of a means of 
communication, the level of conflict behavior 
may be lowered. However, the data indicated 
that there was no significant reduction in the 
probability of attacking given the opportunity 
to attack in the alliance games. Thus, it is 
conceivable that the observed reduction in 
conflict behavior in the alliance games was a 
function of the fact that the coordination of 
attacks by the alliance partners required fewer 
attacks to end a game. The propensity for 
conflict behavior may not have been reduced 
in the alliance games, it may merely have 
been limited by the rules governing the situ- 
ation. Such an assumption was supported by 
the data from the mutual defense games which 
reported a propensity to attack one’s alliance 
partner once the isolate was eliminated. Thus, 
it is likely that the reduction in conflict be- 
havior in the alliance conditions that re- 
quired one to honor an alliance after the iso- 
late was eliminated was a function of the 
lack of an opportunity to resort to conflict 
behavior. In essence, it appears that given 
incompatible payoffs, and that the probabil- 
ity of obtaining one’s payoff through conflict 
is available, conflict behavior will result. Such 
an assumption gains further support from the 
fact that an increase in the probability of at 
least one player surviving was evident in those 
alliance games in which the alliance partners 
were forced to honor the alliance after the 
isolate was eliminated. No such increase in 
the probability of at least one player surviving 
was noted in the alliance games in which the 
alliance partners could attack each other once 
the isolate was eliminated. 

Another interesting note concerning the 
probability of survival is that, with one ex- 
ception, none of the player positions had a 
significantly greater probability of surviving 
than any of the other player positions. The 
one exception was that the dictator in the 
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dictator situation will survive in almost every 
case. This would suggest that the notion of 
power inversion is not supported in the truel. 
Rather than supporting Shubik’s (1954) hy- 
pothesis that in the truel it does not pay to 
be slightly stronger than the other partici- 
pants, it suggests that it does not pay to enter 
into a truel unless one is strong enough to 
guarantee that one cannot lose. 

The mutual defense and nonaggression alli- 
ance games were designed to examine the 
effect of trust on the preferred alliance part- 
ners. It was predicted that below the veto 
point in the mutual defense games, the most 
preferred partner would be the opponent who 
could do the least damage once the isolate was 
eliminated. In the nonaggression games, on 
the other hand, it was predicted that the pre- 
ferred alliance partner would be the opponent 
who could do the most damage if he remained 
the isolate. Both of these predictions were 
supported in the 7/6/5 and 9/6/3 games; 
however, both of them failed to gain support 
in the 8/6/4 game. In both the mutual de- 
fense and nonageression 8/6/4 games, the ini- 
tial contact data were ordered in the predicted 
direction, but the disparity of contact choices 
was attenuated such that the probability of 
the distribution being obtained by chance was 
between .10 and .20 for both conditions. The 
question that cannot be answered by the data 
from the present study is, What caused the 
attenuation of the effect in the 8/6/4 game? 
To answer that question will require further 
research aimed first at discovering if the effect 
is replicable. If the effect proves to be replica- 
ble, it will then be reasonable to focus re- 
search on its Cause or causes. 

At the veto point and above, it was pre- 
dicted that the effect of trust would be over- 
come by the desire to win, and that the 
strongest opponent would be the preferred 
alliance partner of both of the weaker op- 
ponents in the nonaggression and mutual de- 
fense games. The data strongly supported 
that prediction in the nonaggression games 
but were equally strongly rejected in the 
mutual defense games. In fact, in the mutual 
defense games, there was a significant pref- 
erence for the weaker opponent which ap- 
peared to be a function of a belief that the 


stronger opponent would not honor an alli- — 
ance once the isolate was eliminated. Con- 
sideration of that contradictory result indi- ~ 
cates that trust or the lack of trust is still 
the major variable operating at the veto 
point and above. If one of the weaker play- 
ers forms an alliance with the strongest 
player, he must trust the strongest player 
not to eliminate him once the isolate is ~ 
eliminated. That is, if an alliance is formed - 
with the stronger opponent at the veto point, 
once the isolate is eliminated, the stronger 
player would have complete control over 
the allocation of the 100-point payoff. Data 
from the present experiment indicate that it 
is probable that the strongest player would 
take advantage of such a situation to add 
the 100 points to his point total. Thus, with 
long-range goals in mind, alliances are 
formed between the two weaker opponents, 
which insures an outcome no worse than the 
status quo. 

The data from the dictator situation re- 
flect the same line of reasoning. In the non- 
aggression games the weaker players both 
preferred the strongest opponent since the 
only way to insure that some portion of the 
100-point payoff was obtained was to form 
an alliance with the dictator. If the dictator 
chose not to form an alliance, the most that 
could be lost was the effort required to push 
the contact button. On the other hand, if 
the dictator chose to form an alliance, some 
portion of the 100-point payoff would be 
gained. In the mutual defense game, the 
probability of either of the weaker players 
gaining some portion of the 100-point payoff 
was low. If they contacted the dictator and 
he reciprocated, they still had to complete 
the game hoping that once the isolate was 
eliminated, the dictator would not choose to 
eliminate his alliance partner. Since the 
strongest player often chose to contact the 
stronger of his two opponents when he did 
make a contact, it was reasonable to assume 
that he wanted to negate the power of his 
strongest opponent during the alliance phase 
of the experiment and eliminate him once 
the isolate had no points remaining. Thus, 
unless the weaker players trusted the 
strongest player, it was reasonable to either 
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choose not to form an alliance or to form 
one with the weaker opponent. Since the 
contact phase of the game lasted 14 minutes 
if the players chose not to form an alliance 
and only the length of time it took to make 
a reciprocal contact if the two weaker play- 
ers formed an alliance, the time spent in the 
situation was shortened considerably by form- 
ing an alliance. Moreover, the outcome was 
probably not changed. 

With the exception of Caplow (1959), the 
concept of trust has been virtually ignored in 
alliance-formation research. It has been com- 
monly accepted that given the chance to form 
alliances, alliances will inevitably be formed. 
Caplow’s hypothesis that in the terminal situ- 
ation alliances will not be formed was sup- 
ported by the difference between the number 
of alliances formed in the nonaggression and 
mutual defense games. In the 7/6/5 games, 
there were 45 nonaggression and 16 mutual 
defense alliances formed; in the 8/6/4 games, 
there were 58 nonaggression and 17 mutual 
defense alliances formed; and in the 9/6/3 
games, there were 59 nonagegression and 7 
mutual defense alliances formed. A chi-square 
which tested the hypothesis that there was 
no difference in the number of alliances 
formed in the nonaggression and mutual de- 
fense games rejected that hypothesis (x° = 
77.17, df = 2, b < .001). Thus, the data indi- 
cated a significant propensity to form fewer 
alliances in the mutual defense games (the 
terminal situation) than in the nonaggression 
games. The combined data examining the 
mutual defense and nonaggression games were 
considered as support for the hypothesis that 
trust is an important variable in the consid- 
eration of alliance partners as well as the deci- 
sion to form alliances. Although the present 
study did not pretend to explore fully all of 
the possible ramifications of trust, the results 
did point out that trust is an important varia- 
ble in alliance formation. 

One of the major focuses of the present 
experiment was a test of the parity principle. 
The parity norm as postulated by Gamson 
(1961a) states that the participants in an 
alliance situation would expect the other par- 
ticipants to demand a share of the payoff that 
is proportionate to the share of the resources 
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they contributed to the coalition, Subsequent 
research (e.g, Chertkoff, 1966; Gamson, 
1961b; Phillips & Cole, 1970) has tended to 
support the theory that the parity norm has 
a major role in determining preferred alliance 
partners; however, there seems to be a com- 
plete lack of published research that has di- 
rectly tested the parity norm. The present ex- 
periment did not test the parity norm; how- 
ever, it did directly test the parity principle 
by forcing a parity split of the payoff in one 
condition and forcing a 50/50 split of the 
payoff in another condition. 

The data from the present experiment 
showed strong support for the proposition 
that the weaker opponent would be the pre- 
ferred alliance partner in the coalition par- 
ity condition, However, the viability of the 
hypothesis that the parity norm is the basis 
for the formation of weak alliances was weak- 
ened by a preference for weak alliances in the 
coalition 50/50 condition. There was a 
nonsignificant preference for the weaker op- 
ponent in the 8/6/4 and 9/6/3 games and a 
significant preference for the weaker opponent 
in the 7/6/5 games. In the 7/6/5 and 9/6/3 
games, the distribution of alliances formed 
was not random, and a similar but nonsignifi- 
cant result was observed in the 8/6/4 game. 
Weak alliances were the most preferred, fol- 
lowed by strong alliances and medium 
strength alliances, in that order. 

An explanation of the results that might 


account for the unexpected nonrandom pref- . 


erence for alliances in the coalition 50/50 
condition has been proposed by Caplow 
(1968). Caplow noted an unusual result in 
the Vinacke and Arkoff (1957) all-equal 
game. Vinacke and Arkoff labeled their play- 
ers A, B, and C and reported that in the all- 
equal game there was a nonsignificant pref- 
erence for BC, AB, and AC alliances, in that 
order. Caplow suggested that the propinquity 
of the letters was used as a cue to determine 
which alliances would form. It is possible 
that the same effect might have occurred in 
the present experiment by using the propin- 
quity of the numbers as cues for forming alli- 
ances, For example, in the 7/6/5 game, 7/6 
or 6/5 alliances would be preferred. Since any 
alliance that formed would win and the 100- 
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point payoff was to be evenly divided, it is 
not unlikely that such was the case. 

If we consider Caplow’s interpretation of 
Vinacke and Arkoff’s results to be valid and 
apply it to the present study, it would ac- 
count for the nonrandom preference for alli- 
ances in the coalition 50/50 games; how- 
ever, it would not account for the high proba- 


bility of forming weak alliances in the coali- 


tion parity games. It is conceivable that 
the strong preference for weak alliances in 
the coalition parity games was a function 
of the fact that the forced parity split of the 
payoff was a much stronger manipulation 
than the effect of the parity norm, which is 
based on the subjective interpretation of the 
situation by each subject. The present study 
has examined the effect of a strong parity 
principle on the preference for alliance part- 
ners as well as the effect of the absence of a 
parity norm, To test the parity norm as pro- 
posed by Gamson, it will be necessary to find 
a means of measuring and manipulating the 
subjective estimation of the strength of the 
parity norm by the participants in an alli- 
ance situation. 


CONCLUSION 


The purpose of the present study was two- 
fold. First, it examined the strategies em- 
ployed in potential uelative conflict situa- 
tions; and second, it examined potential uela- 
tive conflict situations in which alliances are 
allowed. An additional reason for the study 
was to determine whether providing a means 
for terminating a situation through coopera- 
tive behavior would lower the level of conflict 
in a situation that would, otherwise, have been 
characterized by intense conflict. 

The observed behavior in the potential uela- 
tive conflict situations suggested that merely 
providing a means for terminating an intense 
conflict situation by cooperative behavior is 
not sufficient to reduce the level of conflict. 
No difference was noted in the propensity to 
attack in the pure truel, pass option, or alli- 
ance games. There was a significant propen- 
sity for at least one of the players to survive 
in the alliance games in which the alliance 
partners were forced to honor the alliance un- 
til the game ended. The increase in the proba- 


bility of at least one player surviving in those 
alliance conditions was probably a function of 
the fact that it was easier to arrange for two 
players to survive in the alliance games than 
in the pass option games. 

It was possible for two or three players to 
survive in both the pass option and alliance 
games; however, there were virtually no cases 
in which all three players survived and very 
few. cases in which two players survived when 
an explicit alliance was not allowed. Thus, it 
appears that given incompatible goals, limited 
communication, the need for complete trust 
in your opponent, and a means by which a 
situation can be terminated by conflict, the 
conflict strategy will be chosen. Given the 
same incompatible goals, an improved channel 
of communication, and a negation of the 
need to trust your opponent, a cooperative 
strategy is likely. The same incompatible 
goals and an improved channel of communica- 
tion without a negation of the need to trust 
your opponent result in the selection of a 
quasi-cooperative strategy that deteriorates 
into a conflict strategy at the first chance. 
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This theoretical paper treats motive as a cognitive representation of a goal 
with .no necessary relation to either action or affect. It is suggested that 
resolution of uncertainty, hostility, and mastery are primary motives that can 
lead, to similar classes of secondary motives and phenotypically identical 
goal-related behaviors. A developmental scheme for motive organization is 
discussed, based on the view that changing nodes of uncertainty control the 
hierarchical position of secondary motives. 


The generic concept of motive is one of 
man’s oldest psychological ideas because, be- 
lieving in causality, he requires a word that 
summarizes the forces that activate and main- 
tain actions, images, and thoughts. Concepts 
like motive, need, drive, want, and wish have 
been invented to supply the missing term. 
However, the definitions proposed and the 
supporting evidence cited have differed de- 
pending on the psychological phenomena 
deemed most important to explain and the 
implicit assumptions about what organisms 
wanted most to attain. James (1890), who 
wanted to understand human cognitive and 
affective states, and Troland (1928), who 
inquired into sensory preferences, were unique, 
for most psychologists during the past half- 
century agreed that behavior, or more re- 
cently, change in behavior (Atkinson, 1969), 
was the dependent variable most important 
to predict and interpret. Despite some dis- 
agreement on details, Atkinson (1958, 1964), 
Berlyne (1960, 1967), Festinger (1957), 
Hull (1951), Hunt (1965), McClelland 
(1961), Tolman (1932), Young (1936), and 
others, took it as an unquestioned assumption 
that motives had to do with analyses of fac- 
tors that incited and directed action. 

The disagreements centered on the nature 
of the goal sought, whether it was pleasant 
sensations, optimal physiological arousal, re- 
duction of stimulation, optimal stimulus com- 
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plexity, or reduction of dissonance. They did 
not disagree on the strong link between mo- 
tive and behavior. This prejudice attenuated 
the significance of cognitive representations of 
goals that were not associated with changes 
in action. It seems intuitively true that a per- 
son can sit perfectly still for 10 minutes while 
his mind entertains, in succession, intensely 
pleasant ideas of love, power, and fame. More- 
over, as Hinde (1970) noted, the play be- 
haviors of young animals—fighting, hunting, 
and nesting—are largely independent of the 
drive or motivational factors that generate 
these behaviors in the adult animal. That is, 
goal-related actions can appear in the young 
animal at a time when the physiological state 
defining the adult drive is not operative. 
Hence, an action, no matter how microscopic 
its description, can be misleading as to its in- 
tention or the cognitive structures that define 
potential intentions. Moreover, cognitive rep- 
resentations of goals can change without 
changes in action. It seems useful, therefore, 
to examine critically the relation of behavior 
to motive and, pari passu, the status of the 
construct “cognitive representation of a goal,” 

During the past half-century, American 
psychologists have not wanted to acknowledge 
the division between motive, on the one hand, 
and action on the other. Motivation was 
viewed as a close kin of drive and related to 
an energy construct and, necessarily, to ac- 
tion. Young (1936) postulated that motives 
release energy and as a result initiate activ- 
ity. Hull (1951) and Miller (1951) allowed 
drives to be the source of energy and made 
motives continuous with and derivative from 
them, a notion that was complementary to 
Freud’s (1930) suggestion that man requires 
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a biological force to push him into action. The 
conception of motivation as the energy base 
for behavior had to be accommodated to the 
zealous operationalism of the period following 
the 1930s. The new and tougher empiricism 
rejected a term like energy, for it did not 
permit a satisfactory measurement program. 
The conflict became painfully clear. Psychol- 
ogy wished to retain the concept of motive, 
while it moved closer to operationalism. An 
easy redefinition of motive was gradually ac- 
cepted by many psychologists. Motivation was 
to be defined as a disposition to action rather 
than as a nebulous energy source. Even Tol- 
man (1932), who was the most mentalistic of 
the behaviorists, defined a sign gestalt as a 
construct that informed the animal of how it 
might satisfy its needs. The representation 
was not of a goal qua goal, but of a way to 
attain the goal. This view implies that if 
there were a sign gestalt for power, it would 
contain the instrumental means of gaining 
access to that resource. This view, coinciden- 
tally, is similar to Piaget’s (1967) conception 
of the infant’s sensory-motor schemes during 
the first 18 months, for the scheme contains 
both the representation of the object to be 
grasped as well as the means of grasping it. 
This position, which is demanded neither 
by logic nor data, allowed scientists to inves- 
tigate drives and motives by studying þe- 
havior. It provided a rationale for what psy- 
chologists wanted to do, namely, quantify ob- 
servable responses, and led to a dictionary of 
basic motives tied to behavior and social 
interaction. (Atkinson, 1958; McClelland, 
Atkinson, Clark, & Lowell, 1953; Murray, 
1938). Achievement, affiliation, power, suc- 
corance, and dependency, which were defined 
in terms of specific responses one could see 
and count, became the primary human mo- 


tives.* 


3 This psychological strategy illustrates a general 
human tendency to invent a rationale for an action 
one has decided to initiate or has already initiated. 
A recent example of this human fallibility derives 
from the realization that hospitals are neither bene- 
ficial to most mental patients, nor economically 
feasible to build or staff. Hence, pragmatic considera- 
tions dictate the invention of different techniques of 
treatment. An obvious alternative is to work with 
the patient in the community in which he lives, and 
the community psychiatry movement is born. Courses 
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Although the disposition to action or change 
in action is still the popular definition of mo- 
tive (Atkinson, 1958), the history of re- 
search issuing from this stance has led inves- 
tigators to make a sharper conceptual differ- 
entiation between a goal-related action (e.g., 
a behavior in a natural context, a score on a 
test, or a story told to a picture) and a mo- 
tive. Twenty years of research suggests that 
a representation of a goal and an action aimed 
at attaining it are different constructs. More- 
over, the suspicion that any action is am- 
biguous with respect to its intended goal has 
continued to gnaw at the confidence of those 
trying to quantify motives by examining goal- 
directed responses. Contemporary psycholo- 
gists remain only a little less frustrated than 
their predecessors, committed to the necessity 
of the motive concept but lacking satisfactory 
strategies of measurement. There is, how- 
ever, one salutary change in the theoretical 
climate. Psychologists have become more re- 
ceptive to conceptualizing motives as inti- 
mately continuous with other cognitive proc- 
esses (Festinger, 1957; Janis, 1959). Many 
are ready to view a motive simply as a spe- 
cial class of idea. Indeed, McClelland (1961) 
now defines the achievement motive as an 
ideational phenomenon reflected in fantasy 
and associations rather than action. This is 
not the first time that philosophers or psy- 
chologists have focused on a mentalistic con- 
ception of goals, for Bentham’s (1914) list 
of 15 primary pleasures contains many cog- 
nitive states, including the belief that one is 
pious, nurturant, or creative. 

The primary purpose of this paper is to 
explore the usefulness of assuming that human 
motives be viewed as cognitive representa- 
tions of goals that have no necessary link to 
action, The second aim is to suggest that the 
cognitive bases of motivation are not the 
same for all classes of human motives, and, 
therefore, no simple mechanism will account 
for all of human motivation. | 


and texts in community psychiatry are executed 
while small seminars in closed rooms try to decide 
what community psychiatry means. The decision to 
move hospital patients into the community required 
a theoretical statement that would protect the doctor 
from the unpleasant recognition that the actions 
might not be rational. 
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Let us begin by considering three problems 
that have always surrounded the relation be- 
tween human motivation and behavior. The 
first concerns the primary sources of physio- 
logical activation (or arousal) that push the 
organism into action and determine when he 
is gratified (Berlyne, 1967). There are two 
views on this matter, the extremes of each 
being too simple to be correct. One view as- 
sumes undifferentiated arousal systems lo- 
cated in the midreticular areas, hypothalamus, 
limbic lobe, and lemniscular sensory path- 
ways. Discharge of neurons in these systems 
can help activate specific actions or cognitive 
structures, but the specific psychological struc- 
ture activated is always a serious function of 
situational and cognitive constraints. This 
view finds its most loyal advocates among 
Schachter and his colleagues (Schachter, 
1964), who view the physiological system as 
simply providing initial, undifferentiated vis- 
ceral information for the mind to mold into 
motive, belief, and conclusion, much like a 
potter shapes an unformed mass of clay. 

Experimental studies of animals do not sup- 
port such a simplistic conception, for brain 
stimulation studies indicate preferential links 
between specific nuclear masses and particu- 
lar classes of behavior (Grossman, 1960; 
Teitelbaum, 1961). The alternative view holds 
that every major class of goal-directed be- 
havior is yoked to a locus in the central ner- 
vous system. This hypothesis is also too ex- 
treme, for Valenstein, Cox, and Kakolewski 
(1970) have shown that an animal’s behavior 
following stimulation of some hypothalamic 
site depends on the immediate context. If 
food is available, he eats; if water is available, 
he drinks; if wood chips are available, he 
gnaws. Hence, the most reasonable conclu- 
sion at the moment is that discharge of retic- 
ular, hypothalamic, or limbic structures can 
have both general and particular functions. 
Future research must specify the functions, 
for the meaning of drive hangs on this infor- 
mation. It seems clear that deprivation of life- 
sustaining resources like food, water, or sleep 
leads to discharge of specific sites in the cen- 
tral nervous system. The pattern of discharge 
in any of these loci is usually regarded as 
synonymous with a drive state. Deprivation 


of food alters the blood sugar level which is 
followed, in turn, by chemical changes that 
eventually activate specific cells in the hypo- 
thalamus. Discharge of these cells is as- 
sumed to change response thresholds for be- 
haviors serving the seeking and eating of 
food. Thus, drive states differ along two di- 
mensions: the specificity versus generality of 
their attachment to specific actions and the 
quality and intensity of visceral afferent 
sensory feedback, which provides information 
to the organism in the form of a change in 
perceived feeling. Drive states vary in the 
quality of the feedback, and motivation is 
intimately linked to this information. 

A second problem is the nature of the 
relation of behavior to physiological arousal, 
for each may occur without the other. The 
various forms of physiological arousal do not 
always lead to changes in behavior, for elec- 
trocortical or autonomic indexes of arousal 
can be observed in human subjects who are 
sitting completely still. Moreover, a specific 
state of arousal can lead to different behaviors 
depending on the context (Schachter, 1964; 
Valins, 1966). 

The third problem is contained in the ar- 
gument over whether the concept of motive, 
separate from drive and instrumental condi- 
tioning, is necessary to explain behavior. 
Some behaviors can be elicited and main- 
tained by either continued states of arousal 
(intracranial brain stimulation) or a particu- 
lar context (standing in a crowded subway), 
and it is not necessary to invent any new 
constructs to explain or predict these re- 
sponses. However, the need for an additional 
construct becomes apparent when a change 
in behavior occurs that is not accompanied 
or preceded by a change in environmental 
context or arousal. For example, 300 students 
file from a lecture hall at the end of an hour 
while 1 student, who rose at 5:30 that morn- 
ing, remains writing in the auditorium. We 
require a concept to explain these actions for 
they cannot be understood by a reference to 
context or quality of internal arousal. These 
actions seem directed at future goals, and 
most psychologists use the word motive to 
represent this theoretical idea. 


DEFINITION OF MOTIVE 


Since we feel it is useful to separate the 
cognitive representation of a goal from action, 
we define a motive as a cognitive representa- 
tion of a future goal state that is desired. 
Although a motive can be activated and in- 
fluenced by specific classes of arousal or drive, 
and is often associated with particular be- 
haviors, it need not be. Motive is a category 
of idea and can occur without a change in 
arousal or action. The difference between a 
motive and any other cognitive representation 
(e.g., the representation of one’s closet, the 
product of 11 x 12, the concept of gene) 
rests with the meaning of the word goal. A 
goal is a cognitive representation of an event 
in the future that the person believes. will 
permit him to feel better. Although this re- 
statement of the hedonic principle may sound 
pollyannish, the reader is urged not to aban- 
don the argument yet, for the events that one 
thinks will bring pleasure are likely to have 
different origins. This paper suggests four dif- 
ferent classes of goals from which many hu- 
man motives are derived. Tt is not suggested 
that cognitive representations of these states 
exhaust the motive domain, but rather that 
they comprise important categories of human 
motives. 

Cognitive representations of goals exist in 
either a latent or active state, a difference 
that is captured by the terms motive and 
motivation, respectively. The latter state fol- 
lows cognitive activation of a goal and re- 
sembles a mental set. When a person acti- 
vates a representation of a goal, he prepares 
himself for a particular set of experiences, 
much as a person prepares himself to see and 
hear particular events when he switches the 
television channel from a news broadcast to 
the replay of a Gary Cooper movie. 


Sensory MOTIVES 


There is a small set of sensory events that 
people report as pleasant whose source can be 
traced to the pattern of stimulation within 
specific receptor areas and perhaps eventu- 
ally to rather specific sites in the central 
nervous system. Sweet tastes, cessation of 
pain, warmth, genital stimulation and orgasm, 
tactile stimulation, and kinesthetic discharge 
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are among this group. Cognitive representa- 
tions of these goal states are called sensory 
motives, and every theorist has postulated 
them as basic to man, although Troland 
(1928) and Young (1936) gave them special 
status. Thus, we affirm the idea that sensory 
hedonism is part of man’s motivational 
repertoire but reject the implication that it 
is the origin for all human motives. Sensory 
motives are different from the remaining mo- 
tives to be considered, for gratification of sen- 
sory motives is acompanied by discharge of 
specialized receptors and their. cortical pro- 
jections. No such parallelism is assumed for 
the remaining three motives. 


Tue Mortve TO RESOLVE UNCERTAINTY: 
Tue Wisa to Know 


The goal of the second class of motives is 
the resolution of uncertainty, and the heart 
of this paper is devoted to it and its deriva- 
tives. The individual is continually creating 
representations of experience, some of which 
can be matched to reality easily, others with 
difficulty. I think of a birch tree and turn to 
look out the window and view one. The rep- 
resentation of a flamingo is not matched as 
easily. If the representation is of a desired 
experience whose realization is uncertain, the 
person becomes motivated to resolve the un- 
certainty, Uncertainty is characterized, in 
part, by incompatibility between cognitive 
structures, between cognitive structures and 
experience, or between cognitive structures 
and behavior. Psychologists do not have many 
models of incompatibility that might aid 
understanding of this idea. Incompatibility 
between molecules is a result of structural 
differences between them. Incompatibility _ 
between people is the result of responses thats 
prevent the two agents from cooperating or 
agreeing. Incompatibility between ideas oc- 
curs when the obverse of one follows from the 
other (Festinger, 1957). Tf cognitions are 
viewed as symbolic propositions in a hier- 
archic organization, incompatibility means 
contradiction. If a person possesses an idea 
about a domain of experience, and another 
idea, activated from memory or freshly 
sensed, contradicts the older one, incompati- 
bility is generated. This view has its histori- 
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cal roots in Festinger’s (1957) theory of cog- 
nitive dissonance and is supported by the 
results of many empirical studies that were 
born of that view (Festinger, 1964; Zim- 
bardo, 1960, 1969). 

One source of uncertainty is encounter 
with an event that is discrepant from an 
established schema. The discrepant event can 
arise in the outside world (a friend’s failure to 
smile) or from within the body (intermit- 
tent throbbing of the forearm). The discrep- 
ant event recruits the individual’s attention 
and alerts him, as if the trigger of a gun were 
cocked. The person is now in a state of un- 
certainty and is primed to resolve it by as- 
similating, removing, or escaping from the 
discrepant event.* If the person cannot realize 
any of these goals, uncertainty mounts, and 
affective distress is likely to occur. This dis- 
tress is often called fear or anxiety. If he 
can interpret or act on the discrepancy in 
some way, uncertainty is controlled, and the 
distress will not occur. The specific reactions 
initiated to serve this primary motive to re- 
solve uncertainty will depend on each person’s 
individual history, and Hunt (1965) has made 
this dynamic the central idea in his notion of 
intrinsic motivation, However, Hunt ignored 
three other sources of uncertainty with moti- 
vational significance. 

Two related sources of uncertainty arise 
from incompatibility between two ideas, or 
between an idea and behavior. A child who 
regards his father as wise overhears a relative 
criticize the father’s decision, or the child 
regards himself as honest and has just told 
a lie. The child now has two incompatible 
beliefs; he is uncertain about the one to 
which he should subscribe, and he is pressed 
to do something to resolve the inconsistency. 
If it cannot be resolved, uncomfortable feel- 
ings will be generated. 

A final source of uncertainty is inability to 
predict the future, especially if the doubt 
centers on the experience of potentially un- 


4Janis’ (1959) discussion of reactions to deci- 
sional conflicts presents a parallel analysis, for he 
noted that conflicts can be resolved by ignoring or 
avoiding the information, discounting the com- 
munication, intellectually denying threatening in- 
formation, forgetting, and inventing new reasons for 
a decision. 


pleasant events like punishment, physical 
harm, failure, or rejection. Unpredictability 
implies that the person does not know what 
behaviors and mental sets to activate in prep- 
aration for the future, and potentially in- 
compatible dispositions or ideas can be gen- 
erated. To be unsure of the future affection 
of another is synonymous with entertaining 
simultaneously the contradictory beliefs “she 
loves me—she loves me not.” If the person 
cannot ignore the uncertainty, increase his 
belief in one outcome, or reassure himself, 
distress is likely to follow. 

Each of these sources of uncertainty leads 
to a primary motive to resolve it, and if no 
appropriate responses are available, distress, 
anxiety, fear, shame, or guilt may occur. The 
specific affect experienced is a joint function 
of the quality of afferent feedback and the 
source of the uncertainty. Examination of the 
psychological literature suggests a preference 
for using the word anxiety when the source 
of the distress is unpredictability, fear when 
the source is a discrepant event, and skame or 
guilt for inconsistency between ideas, or be- 
tween idea and action. 

The person may select different strategies 
to resolve the uncertainty. One child may 
have learned that a close relationship to a 
parent is effective; he is often regarded as 
having a dependency motive. Another acti- 
vates the wish for adult praise for an ac- © 
complishment; he is said to have a motive 
for recognition. A third seeks a weaker child 
to bully; he is described as possessing a mo- 
tive for dominance. Since children are raised 
by human beings rather than machines and 
since adults often resolve the child’s uncer- 
tainty, most people seek reassuring relation- 
ships with others when they are uncertain. 
But there is a small minority who wish to 
withdraw from people when they are uncer- 
tain, because others have not helped them 
resolve uncertainty in the past. Each of these — 
more specific motives and their goal-related — 
behaviors can be in the service of the pri- — 
mary motive to resolve uncertainty. The cog- 


5 When the child is subordinate to another, he can 
never be sure when the dominant one will demand 
or coerce him to conform to some request. The 
subordinate role always contains more uncertainty 
than the dominant position with others. ; 
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nitive representations of these more concrete 
goals are regarded as secondary motives. 

The potential variety of secondary motives 
can be best appreciated by comparing the 
salient motives of contemporary Western soci- 
ety—achievement, dominance, sexuality, au- 
tonomy, affiliation, distinctive identity— 
with those promoted by the early classical 
Chinese philosophers, especially those of the 
eighth century Ch’an school. The salient 
goals to be attained were absence of thought, 
desire, or motivation, The dominant motive 
was to be without a motive (Yu-Lan, 1953). 
Each society promotes a set of secondary mo- 
tives that helps to resolve uncertainty. The 
reasons for the variety in secondary motives 
are not clear, but they probably derive from 
specific environmental presses local to each 
society. For example, observations of bush- 
men living in Botswana suggest that the un- 
certain availability of a regular supply of 
meat makes a motive for food salient in the 
preoccupations of these people, while money 
and status seem to play these roles in West- 
ern society. 

Most of the popular motives normally as- 
cribed to children and adults by novelists or 
psychologists in Western culture, such as 
achievement, affiliation, power, dependency, 
nurturance, or succorance, can be derivatives 
of a primary motive to resolve uncertainty 
and alleviate subsequent affective distress. 
The affective distress, be it anxiety, fear, 
shame, or guilt, only emerges when the per- 
son cannot assimilate, remove, or act upon— 
in short, cope with—the original source of 
uncertainty. The aborting of uncomfortable 
feeling through activation of a coping reac- 
tion can be observed in many situations. For 
example, a 1-year-old watches his mother 
leave him alone in a strange room. If he is not 
actively engaged in any activity, he is more 
likely to cry than if his attention is absorbed 
in some action. A 10-month-old placed in a 
closed room cries within a few seconds. But 
if he is placed in an adjoining room with his 
mother and the door to the original room is 
left ajar, he will crawl into it. Although he is 
out of the mother’s sight, he does not cry. He 
becomes alerted, looks around, and crawls 
back to his mother. The availability of a 
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response to an issue when alerted by ‘the dis- 
crepant scene blocked the emergence of fear 
(Rheingold & Eckerman, 1970). 

However, it is important to note that the 
child must be developmentally mature 
enough to recognize the separation from his 
mother as a discrepant event, or it will not 
generate uncertainty. Most American children 
do not begin to show fear when their mother 
leaves them with a stranger until they are 
9-12 months of age (Kotelchuck, 1971). 

The power of context-relevant behavior to 
contain fear is also seen in an experiment in 
which monkeys were trained to press a lever 
every few seconds in order to avoid a painful 
shock. After the “lever pressing” had become 
habitual, the experimenter changed the ar- 
rangement of shock so that the first “lever 
press” the animal made after 5 tranquil min- 
utes produced a shock, rather than eliminated 
it. After this training had been completed, 
the shock was eliminated completely at ran- 
dom intervals for periods as long as 50 min- 
utes. Under this last condition, the animal’s 
rate of lever pressing doubled (Stretch, Or- 
loff, & Dalrymple, 1968). The absence of 
shock for so long a time may have been a 
discrepant event that alerted the monkey 
and prompted him to exhibit the one response 
that was ascendant in this context, namely, 
pressing the lever. Similarly, Solomon and 
Wynne (1953) reported that dogs exposed to 
shock in an avoidance training situation 
eventually stopped showing autonomic signs 
of distress, but continued to display the 
avoidant behavior, as if the behavior inhibited 
fear. Lazarus’ (1966) work on human sub- 
jects also implies that the activation of cog- 
nitive or behavioral structures in uncertain 
situations aborts fear or anxiety. Indeed 
Mandler (1964) came close to the position, 
when he stated that, 


anxiety appears as the emotion of choice in those 
cases where the onset or offset of visceral arousal is 
not under the organism’s control. Where no “other 
response is readily available to prevent, undo, or 
terminate the emotional effect of interruption (eg. 
by substitution) we would expect the emotion ob- 
served to be anxiety or distress or fear... . It is 
thus lack of environmental control, this helplessness 
in the face of interruption that is responsible for the 
high frequency of negative affect in response to 
interruption [p. 179]. 
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and later, Mandler stated that “the anxiety 
phenomenon is seen as conditional upon the 
absence of appropriate environmental stimuli 
and associated responses [pp. 184—185].” 

The varied schools of psychotherapy offer 
an example of this principle. Each strategy of 
therapy seems to be effective with a particu- 
lar set of symptoms and patients, although the 
means by which each method attains its re- 
sults are dramatically different. Consider two 
treatment strategies with a woman who feels 
afraid in crowds. A psychoanalyst may tell 
her that she is afraid of suddenly losing con- 
trol of herself and undressing in public. He 
reassures her that she can control these im- 
pulses by reminding herself of the historic 
cause of her fear the next time she is in a 
crowd. She believes this interpretation for it 
is congruent with her theory of personality 
and remediation. The next time she is in a 
crowd and feels anxiety mounting, she is re- 
minded of the office interpretation. She re- 
hearses it and reassures herself that she is 
capable of inhibiting her urge to undress in 
public. As she becomes engrossed in these 
thoughts, her anxiety diminishes. “Thinking 
about the interpretation” recruits her atten- 
tion, and anxiety is reduced. She feels bet- 
ter, and both doctor and patient are convinced 
of the validity of the dynamic explanation 
that helped her. 

This woman might have made an appoint- 
ment with a behavior therapist who told her 
that her fear was a conditioned reaction to 
crowds, and it is only necessary to break the 
conditioned habit. This deconditioning is ac- 
complished by teaching her to relax when she 
is afraid in a crowd. The feeling of relaxation 
will become conditioned to the sight of the 
crowd, and the fear will be countercondi- 
tioned. One technique to teach relaxation is 
the control of breathing. The doctor tutors 
the patient in the breathing exercises to be 
used when she is afraid. This woman also has 
faith in the prescription. The next time she is 
in a crowd and feels anxious, she reminds her- 
self of the suggestion and begins to concen- 
trate on her breathing. As her attention be- 
comes recruited to the task of relaxing, her 
anxiety recedes. The essential similarity be- 
tween the two treatment procedures is that 
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the patient had faith in the therapist’s sug- 
gestion, believed in the effectiveness of an 
engrossing action or thought sequence, and 
implemented it. The action required concen- 
tration and blocked anxiety. The woman was 
helped by both techniques, and it is likely 
that one basis of cure was the fact that both 
strategies required her to become preoccupied 
with an engrossing cognitive and/or behav- 
ioral routine that she believed would help her. 
Her faith in the “procedures” was probably 
as important as the procedures themselves, as 
studies of native medicine imply (Chaplin, 
1930; Iwai & Reynolds, 1970; Prince, 1968; 
Rao, 1966). 

Many social institutions are conveniently 
organized ways to provide routines to deal 
with uncertainty. Religion, work, school, and 
sex are complex ritualized involvements that 
bind attention through the acting out of well- 
practiced actions and turn the mind from the 
daily discrepancies, inconsistencies, and 
sources of unpredictability that cannot be 
easily denied, interpreted, or pushed aside. 
These ritualized solutions may fail when a 
significant portion of the population ques- 
tions their effectiveness. The person is then 
confronted with the discrepant experience of 
knowing large numbers of people with whom 
he feels similar and who do not share his be- 
liefs about the rituals that heal. This incon- 
sistency, which produces uncertainty, can be 
toxic to the effectiveness of the procedure. 

Although the term “uncertainty” is used 
here, the meaning intended is synonymous 
with the more popular phrases cognitive con- 
flict, cognitive dissonance, and cognitive dis- 
equilibrium. Hence, the motive to resolve un- 
certainty might be renamed the motive for 
cognitive harmony, consonance, equilibrium, 
or simply the motive to know, which Berlyne 
(1962) calls epistemic curiosity.° 

The desire to resolve uncertainty is not 
viewed as a deprivation or tension-reduction 
process. The person is not deprived of the 
state of certainty, in the same sense that he 


6Tf one substitutes uncertainty for optimal com- 
plexity in the writings of Walker (1964) and Dem- 
ber (1965), these positions become complementary to 
the one presented here. Moreover, some of the 
problems of social psychology are clarified by this 
view. 
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is deprived of warmth when cold or water 
when thirsty, and he is not in a state of arousal. 
Research by Driscoll and Lanzetta (1964) 
and Hawkins and Lanzetta (1965) support 
this idea, for they found no relation between 
experimentally induced uncertainty and in- 
dexes of arousal. Uncertainty is a state char- 
acterized by a particular organization of cog- 
nitive structures and often accompanied by a 
physiological. state of “alerting.” This state 
has an inherent lability and is usually fol- 
lowed by processes which produce a different 
organization of cognitive structures and, typi- 
cally, dissolution of the state of alertness. The 
new cognitive arrangement is neither a reduc- 
tion nor an attenuation of the first, but a 
different structure. Consider as an analogy 
the process of mitosis. Initially, the nucleus 
contains scattered chromatin material. This 
phase is followed by the ordered pairing of 
chromosomes—a state which is labile—fol- 
lowed by the splitting of the chromatids and 
eventual production of two cells. Each suc- 
cessive stage is not a reduction of the preced- 
ing stage, but a change in the arrangement of 
the structures of the earlier one. Some may 
feel that the phrase “resolve uncertainty” is 
defensive, implying that man dislikes the 
unknown and is not attracted to challenge 
and self-actualization. At the moment, this 
choice of words is clearly a matter of taste. 
Given the sparcity of empirical data on these 
issues, the motive “to know” could have been 
substituted for “resolve uncertainty.” The 
first is more flattering to man than the second, 
but aside from that prejudice, the logic and 
implications of the argument presented seem 
to the writer to be almost identical. 


ANGER AND HOSTILITY 


The relation between the affect state of 
anger and the motive of hostility parallels the 
relation between the alerted affect state of 
uncertainty and the subsequent motive to 
resolve it. Let us consider first the conditions 
that elicit anger, as we considered the condi- 
tions that generated uncertainty. 


Interruption of a Response Routine 


An actual or potential interruption of a 
behavior sequence directed at a goal can 


provoke anger. The child is building a tower 
of blocks, and his brother knocks them down. 
The mosaic of internal sensations created by 
this event is one component of anger; the 
second component is the child’s perception of 
the cause of the interruption. If the cause is 
specifiable, anger usually occurs, and a fre- 
quent consequence of anger is the generation 
of a cognitive representation in which injury, 
physical or psychological, occurs to the agent 
of interruption, That class of cognitive repre- 
sentation is called hostility. 

Anger and hostility usually occur together, 
but they can occur separately, and it is not 
necessary for either to occur every time a re- 
sponse routine is blocked. If the person can- 
not specify the agent of interruption, he may 
become neither angry nor hostile, but uncer- 
tain and afraid. Consider a man trying to 
open a stuck door. If he believes that some- 
one locked the door from the other side, or 
suspects the door is warped because he failed 
to treat it before summer began, anger oc- 
curs. But if the person believes that the door 
is neither locked nor warped and cannot gen- 
erate an explanation of why it will not open, 
uncertainty, and perhaps anxiety, will de- 
velop. The stuck door has become a discrep- 
ant event that cannot be explained. 

Some interesting predictions follow from 
this analysis. People who do not reflect upon 
possible explanations of unusually frustrating 
events are apt to become angry and hostile; 
those who pause to analyze the same situa- 
tion are more likely to become uncertain and 
anxious. The national unrest in America prior 
to the 1968 presidential elections made some 
adults anxious because they realized that no 
one person or group was responsible for the 
strife that tore at the nation’s spirit. The 
“conditions and the times” were held respon- 
sible. But it is difficult to become angry with 
an entity as abstract as “conditions and 
times.” Other citizens became angry at the 
hippies, the blacks, or the college students, 
because they believed these groups were the 
cause of the unrest. Many voted for George 
Wallace because of hostility to those who 
wanted McCarthy elected; they wished to 
hurt those they believed had frustrated them. 
The more knowledge one has about an uncer- 
tain situation that has no clear cause, the less 
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likely that anger and hostility will occur. The 
more knowledge one has about frustrating sit- 
uations, the more likely that anger and hos- 
tility will develop. 


A Second Occasion for Anger: Threat to 
Standards 


A threat to a person’s standards can pro- 
voke anger. A 12-year-old American girl be- 
comes angry when called stupid, ugly, or 
childish if competence, beauty, and maturity 
are firmly held standards. A threat to a 
standard is functionally similar to the inter- 
ruption of a response routine, for actions 
flow from beliefs, and a threat to the standard 
deprives the behavior of its cognitive founda- 
tion. A boy will not study if he believes he is 
dull; an adolescent girl will not initiate a 
heterosexual relationship if she is sure she is 
unattractive. Hence, direct interruption of 
behavior or indirect interruption through an 
attack on standards can generate anger and 
potentially hostility. 

The primary motive of hostility, like the 
motive to resolve uncertainty, can lead to a 
variety of secondary motives and behaviors, 
since there are many ways to hurt another. 
The wish to inflict physical harm, harsh 
criticism, and dominance are popular choices. 
Less frequently, sexuality, competence, recog- 
nition, and accumulation of material goods 
are selected to gratify the hostile wish. In- 
deed, any time a person believes he can hurt 
another by gaining a particular resource the 
hated one values, the basis for a secondary 
motive is created. Hence, motives for 
wealth, affection, and symbols of honor can be 
activated whenever hostility is generated. 
But since gratification of these motives can 
also resolve uncertainty, it appears that the 
same secondary motives and goal-related ac- 
tions can serve either the resolution of un- 
certainty or the gratification of hostility. Asa 
result, behaviors that appear to gratify many 
of the secondary motives listed earlier are 
necessarily ambiguous as to their primary 
origin. 


Tue MOTIVE FOR MASTERY 


The motive to master or perfect a skill— 
called effectance by Robert White (1959)— 
is related to, but not identical with, the mo- 


tive to control uncertainty. A major differ- 
ence is that the mastery motive does not 
seem to be preceded by the distinctive alert- 
ing state characteristic of uncertainty. A 4- 
year-old sees a pile of blocks in the corner 
and quietly sits down and builds an elaborate 
house. It is unlikely that he experienced any 
change in affect as he walked toward the 
blocks, or even as he began working. It was 
acknowledged earlier that the desire to per- 
fect a skill or conquer a challenging problem 
can arise from secondary motives related to 
uncertainty or hostility. A child may work at 
school to prevent criticism from the teacher 
and resolve uncertainty over adult acceptance. 
Or a boy may practice baseball to obtain a 
position that a rival wanted and, hence, grat- 
ify hostility. But the motive for mastery may 
have its own primary origins, based on three 
related themes: the desire to match behavior 
to a standard, to predict events, and to define 
the self. 

As the child acquires representations of the 
world, he comes to believe in them and tries to 
maintain them. Some of these representations 
are experienced directly; he knows that pud- 
ding is eaten with a spoon, handkerchiefs are 
put in pockets. Other rules he has only heard ~ 
about; bobsledding is exhilarating, marijuana 
is relaxing. These rules, like compasses, guide 
behavior and thought, for as Johnathan Rich- 
ardson noted over 250 years ago, “It is a 
certain maxim, no man sees what things are, 
that knows not what they ought to be.” 

Each person attempts to simulate, by con- 
struction or imitation, these idealized states. 
The 4-year-old builds a house of blocks be- 
cause he has an idea of what a house of 
blocks should look like, and the blocks are 
available. Man climbs mountains because he 
has some representation of the scene from the 
top and a mountain is nearby. However, an 
additional factor must be present if the mo- 
tive is to be evoked, for it is foolish to expect 
that every time a child sees some blocks, he 
will stop and build a house. There has to be 
some uncertainty about attaining the idealized 
state. If the child is perfectly certain he can 
build a house of blocks, or certain that he 
cannot, he will not be motivated to build it, a 
suggestion in accord with the finding that 
search for information is, on occasion, curvi- 
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linearly related to response uncertainty 
(Driscoll & Lanzetta, 1964; Lanzetta & 
Driscoll, 1968; Lanzetta & Kanareff, 1962), 
although this empirical relation is not always 
affirmed (Hawkins & Lanzetta, 1965; Sieber 
& Lanzetta, 1964). Hence, one basis for the 
mastery motive is possession of a representa- 
tion of an idealized state of affairs, coupled 
with some uncertainty about the ability to 
match behavior and experience to that 
standard. This class of uncertainty is usually 
not accompanied by unpleasant feelings of 
anxiety but by a more subtle and more 
pleasant state that invites the term excite- 
ment. Whether excitement or distress is gen- 
erated by uncertainty depends, first, on 
whether the person can cope with the dis- 
crepancy and, second, whether he generated 
it or it was imposed upon him. Excitement is 
most likely to occur when the person gen- 
erates the uncertainty and has a relevant cop- 
ing action. Man seeks uncertainty when he 
believes that he can deal with it; he avoids 
uncertainty when he believes that he cannot. 

A second, closely related basis for mastery 
involves the wish to predict future events, for 
prediction permits the feeling of control. The 
attraction to new experience often derives 
from the desire to predict the self’s reaction 
in a novel context. One motive for the youth’s 
initial experimentation with mescaline, mari- 
juana, or LSD is the wish to know if the self 
“can take” the strong feelings and emotions 
produced by the drug. The continued usage 
derives, in part, from the desire to become 
adept at predicting one’s feelings and 
thoughts in the drugged state. When predic- 
tion becomes accurate, the person often stops 
this activity. Affect. is most intense while 
predictive control is being gained, not when 
it has been attained. This phenomenon is also 
observed in animals, for they seem most 
aroused when mastering a new habit, not 
when the habit has been learned (Fischer, 
1970). Man selects for mastery those tasks 
that generate some uncertainty, and for which 
he has some initial set of appropriate re- 
sponses. Uncertain challenges that do not 
engage any behavioral implementation are 
shunned. This dynamic leads to the third 
basis for mastery, the desire for self-defini- 


tion. 


JEROME KAGAN © 


Each individual wants to know the quality 
and uniqueness of his psychological character- 
istics; those attributes that define the self-as- 
object to the self-as-person. The attributes 
viewed as salient depend on the culture’s val- 
ues, and the child turns toward the culture to 
discover the characteristics he should try to 
master. But even after he learns them, he 
often requires others to help him evaluate’ 
how well he is doing. Society tells the girl to 
be beautiful, but she can only know her 
attractiveness by comparing herself with 
women who define the total range of beauty, 
and deciphering the sincerity of flattering 
men. This view is a restatement of Festinger’s 
(1964) social comparison hypothesis. 

Since the performances and experiences of 
the self are a fundamental source of uncer- 
tainty, especially for the child, many of the 
mastery goals sought are attempts to gain 
information that will resolve some node of 
uncertainty surrounding the conceptualization 
of the self. When behaviors seem to be aimed 
at increased levels of uncertainty, the more 
fundamental goal is often to resolve uncer- 
tainty surrounding an attribute of the self. 

Hence, the motive for mastery can be grati- 
fied in almost any behavioral context one can 
imagine, from backyard to battlefield, Psy- 
chologists have narrowed their attention too 
narrowly on instances of this motive in intel- 
lectual domains, and future work should 
broaden the base of inquiry. 


How ro DESCRIBE MOTIVES 


Psychologists have traditionally described 
the quantitative component of motives in 
terms of intensity or strength, because a 
motive was always linked to visceral sensa- 
tions and a predisposition to behave. A 
strength metric is appropriate for intensity of 
feeling and amplitude of response. But a 
mental representation of blue green surf does 
not possess srtength in the same way that the 
prick of a pin or the striking of a table does. 
Ideas seem better described by adjectives 
appropriate for concepts, such as articula- 
tion, complexity, and position in a hierarchi- 
cal structure. 

The articulation of a motive refers to the 
degree to which the goal can be specified and 
resembles the meaning intended when one 


Morives AND DEVELOPMENT ; 61 


speaks of a well- or poorly articulated con- 
cept. A well-articulated concept of justice is 
one in which the central dimensions of the 
concept are clearly represented and amenable 
to statement. Ifa motive for recognition is 
well articulated, the person has representa- 
tions of specific and relevant goals. Articula- 
tion refers to the degree of specification of the 
goal, not the number or variety of goals 
sought. Many young adults have a poorly 
articulated motive to “get more out of life,” 
a motivational state that has produced a 
spate of cartoons illustrating a man on top of 
a rocky mountain asking a wrinkled sibyl, 
“What is my destiny?” 

The complexity of a motive is similar in 
meaning to the complexity or breadth of a 
- concept and congruent with Kelly’s (1955) 
discussion of the complexity of categoriza- 
tion of another. The complexity of the con- 
cept “fruit” is defined by the number of 
exemplars recognized as belonging to the cate- 
gory. Although most adults have a motive 
for power, they differ in the number of al- 
ternative goals viewed as potential gratifica- 
tions of the motive (i.e., the breadth of the 
motive). A college student for whom leader- 
“a ship roles in student affairs is an arena for 

power may possess a varied set of goal repre- 
sentations that include athletics, radical party 
membership, or the college newspaper, or a 
simple motive in which president of the dor- 
mitory is the single goal sought. The complex- 
ity of a primary or secondary motive should 
increase with age, for language, introspection, 
and the honing of expectations in the frus- 
trating world of action all amplify the total 
variety of goal representations that might 
satisfy a motive. One corollary of this devel- 
opmental change is that there should be 
“greater substitutability of goals for the older 
than for the younger child. The 5-year-old 
with a motive for power has a limited under- 
standing of how to gratify it; the adult be- 
lieves this motive can be gratified by wealth, 
perfection of a valued skill, or identification 
with a group or person possessing power. 

Motives, like all concepts, exist in a hier- 
archy, and each can be ascendant at differ- 
ent times. The idea of position in a hier- 
archy approximates the popular term pre- 
occupation. When a motive is ascendant in 


the hierarchy, the processing of information 
is selective in favor of the motive, and avail- 
able action routines that can gratify the mo- 
tive have lower thresholds. Ascendant mo- 
tives orient the person to relate his experience 
to the goal and to seek out experiences that 
are likely to gratify it. 

Probability of gratification is a major de- 
terminant of hierarchical position for the 
secondary motives derived from uncertainty. 
This statement is in accord with Atkinson’s 
(1958) assumption that motive arousal bears 
an inverted-U relation to subjective expec- 
tancy of success. If an activated motive is 
continually frustrated, a lowered expectancy 
of goal attainment (certainty of nongratifica- 
tion) will develop, and eventually the motive 
will assume a subordinate position in the hier- 
archy. Consistent gratification, which pro- 
duces a high expectancy of goal attainment 
(certainty of gratification), has a similar ef- 
fect. If a motive is a structure in long-term 
memory, then one possibility is that the 
longer it remains low in the hierarchy, the 
harder it will be to retrieve it. When retrieval 
becomes unusually difficult, the motive is said 
to be unconscious. Each person possesses 
hundreds of thousands of cognitive represen- 
tations of goals which lie latent in the shadow 
of unawareness. Some of these representa- 
tions can be activated into consciously expe- 
rienced motives; others seem to be unavail- 
able but apparently able to direct behavior. 
This observation has led to the postulation of 
unconscious motives. It is possible, however, 
for a person to experience cognitive frag- 
ments of a goal without recognizing the sig- 
nificance of the complete idea, just as it is 
possible for a person to perceive only a frag- 
ment of a solution to a problem. This phe- 
nomenon, labeled “tacit knowing,” is cap- 
tured in the child’s reply to the question, 
“Why did you smile just now?” The child 
answers, “I know, but I just can’t tell you.” 
Similarly, a child does not have to think, “I 
hate my mother and want her to be unhappy; 
therefore, I will fail in school.” He may have 
only a fleeting awareness of his mother in a 
state of upset following her scanning of his 
report card. The fleeting thought may be 
sufficient to activate relevant behavior. The 
3-year-old is not aware of the rules of gram- 


mar, yet his sentences are surprisingly accu- 
rate. We do not yet understand how a repre- 
sentation of a goal can guide behavior, but 
the selection of an action to gratify an un- 
conscious motive may resemble the child’s 
selection of the correct verb in a complex 
sentence that he is in the midst of uttering. 
Both problems may be clarified by the same 
solution. 


A Comment on Measurement 


This conceptual attitude toward motive im- 
plies that the creation of sensitive assessment 
strategies may be informed by research on 
concept formation and utilization (Kintsch, 
1970). A useful technique to assess a con- 
cept’s relative position in a hierarchy is to 
present multiple instances of two or more 
concepts and to evaluate the degree to which 
the exemplars of each concept are clustered in 
recall (Mandler, 1969). This technique might 
be borrowed to assess the change in hier- 
archical position of two motives, say reas- 
surance from a friend and dominance of 
peers, following an experimental situation that 
created uncertainty. A related strategy would 
measure differential induction of motive con- 
cepts before and after an experience intended 
to activate the motive. An early study, using 
this design, found that college students who 
acquired the concept of “hostile aggression” 
more quickly than three other concepts pro- 
duced the most aggressive imagery to a spe- 
cial set of inkblots (Kagan & Moss, 1961). 

The assumption behind these methodo- 
logical suggestions is that events representa- 
tive of a goal engage cognitive representa- 
tions relevant to the motive. The external 
event is given more focused attention, and its 
symbolic form is more readily included in 
the resulting cognitive structure. Both pro- 
cesses should result in more organized retrieval 
and easier concept induction (Kintsch, 1970). 
Since physical and psychological distinctive- 
ness of the experimental materials will influ- 
ence quality of recall or concept induction, it 
is necessary to equate the stimuli on these 
two dimensions. This is not an easy task. 

These procedures are theoretical relatives 
of instruments like the Thematic Appercep- 
tion Test (TAT), where the constructed 
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story is presumed to reflect the motive. How- 
ever, the TAT has insufficient sampling of 
motivational information, does not pit one 
motive against another in a systematic way, 
and requires the subject to generate a co- 
herent and complex verbal response. These 
three constraints may have limited the sensi- 
tivity of the TAT as an index of motivation. 
Failure to acknowledge the specificity of sec- 
ondary motives may also have been a hidden 
source of error in earlier attempts at motive 
assessment. The young child’s motives are 
likely to be concerned with specific experi- 
ences and objects. The young child wants to 
be kissed by his motker, to play a game with 
his father, or to see his sister cry. He should 
not be characterized as having an abstract 
motive for love, attention, or hostility. 


DEVELOPMENTAL ISSUES 
Changing Sources of Uncertainty 


If motives are viewed as cognitive struc- 
tures, we should expect developmental changes 
in both their organization and hierarchical 
position. The hypothesis that uncertainty of 
gratification influences the hierarchical posi- 
tion of some motives implies that the ascend- 
ant motives during any developmental period 
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should, to some degree, refer to goals whose 4 


attainment is uncertain. To illustrate, the 
typical middle-class 2-year-old is certain of 
being fed and sheltered and not yet aware of 
the potential difficulty of learning to read or 
gaining peer acceptance. Hence, neither class 
of goals generates uncertainty, and these 
motives are subordinate in the hierarchy. The 
Presence of the parent and signs of parental 
nurturance, however, are uncertain goals, 
Crying upon separation from the parent and 
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requests for affection, which occur frequently x 


among 2-year-olds, affirm the assumption that 
motives for these goals are ascendant at this 
time. Four years later, when uncertainties 
surrounding parental presence have been re- 
solved, uncertainty over sex-role identity, 
integrity of moral attitude, and attainment 
of culturally valued skills become the bases 
for new secondary motives, Among most 
American 10-year-olds, uncertainty over 
school mastery and peer acceptance push the 
motives for academic competence and friend- 
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ship intó an ascendant position. In the mid- 
dle of adolescence, when uncertainty over 
sexuality emerges, a motive for heterosexual 
involvement replaces grades for those youth 
who have resolved the uncertainty over in- 
tellectual ability and are prepared to cope 
with new doubts. The view that develop- 
mental changes in motives are partially influ- 
enced by changing foci of uncertainty is sup- 
ported by the change in the nature of the un- 
certainty over sexuality that has occurred 
among many American college women during 
the last 20 years. Earlier, the uncertainty 
concerned sexual virtue; today, it is sexual 
sophistication. 


Changing Cognitive Structures 


Since motives have been defined as cogni- 
tive constructs, it is reasonable to assume that 
development changes in cognitive processes 
may have important implications for motiva- 
tion. Piaget (1952, 1967) insists that the 
young child is not capable of generalized ab- 
stract representations, and the author has 
argued, in a similar vein, that the first rep- 
resentations of experience are schemata and 
images, with symbols, concepts, and rules 
added to the repertoire later (Kagan, 1971). 
Hence, maturation may place serious con- 
straints on the nature of motives. Since Piaget 
hypothesizes a basic change in cognitive proc- 
esses at early adolescence—the shift to for- 
mal operations—it is possible that a new class 
of secondary motives also emerges at this 
time, and that this period marks a stage in 
motivational growth. 

The soundest basis for postulating a stage 
in psychological development occurs when 
biology has prepared the child for a change in 
cognitive structure, motive, affect, or be- 
havior, with experience playing the role of in- 
ducer. The biologists conceptualize this pro- 
cess in terms of an inherent biological compe- 
tence potentiated by external forces. To il- 
lustrate, at a particular time in development, 
certain ectodermal cells in the salamander 
acquire the potentiality to become the lens of 
the eye, if, and only if, inductive endodermal 
tissue is present in the vicinity to alter sur- 
face properties of cells and allow that compe- 
tence to become manifest (Jacobson, 1966). 
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It is possible that biological developments, of 
which we are still unaware, prepare the 11- 
year-old for major psychological changes, if, 
and only if, the proper inducing experiences 
occur. : 

It is suggested that the early adolescent 
has acquired a new cognitive competence, 
namely, the disposition to examine the logic 
and consistency of his existing beliefs. The 
emergence of this competence is facilitated by 
encounter with phenomena and attitudes that 
are not easily interpreted with existing ideol- 
ogy. These intrusions nudge the preadoles- 
cent to begin an analytic reexamination of 
his knowledge, and, as a consequence, new 
motives rise in the hierarchy. Piaget (1952) 
and Donaldson (1963) suggest that the early 
adolescent, in comparison with an 8- or 9- 
year-old, can deal with hypothetical premises 
that may violate reality. The early adoles- 
cent appreciates the discontinuity between 
the self-contained information in a hypo- 
thetical problem and the information he 
carries with him for more practical chal- 
lenges. The adolescent is able consistently to 
induce rules from events with multiple at- 
tributes, and he is not limited to a one-at-a- 
time analysis. The ability to reason about the 
hypothetical and to consider organization of 
multiple propositions simultaneously allows 
the adolescent to entertain the possibility that 
the goals he now desires might have been 
different, given an earlier history, and to 
appreciate that an experience is often de- 
pendent on events not in the immediate field. 
He knows that his mother’s anger can be 
provoked by any one of several violations, 
and he realizes that her anger at dinner might 
be a product of yesterday’s report card. 
Awareness of different bases for the same be- 
havior is analagous to the ability to generate 
alternate proofs for the same conclusion. 
Consider the following agonizing self-intro- 
spection of an 18-year-old girl who has spent 
the night with a boy she does not love and 
chides herself in the morning, “The only sin I 
committed was the sin of thinking I would 
never be lonely.” This sentiment requires 
separation of sexual and affiliative gratifica- 
tions and recognition that each can lead to 
the same action. 


Of special relevance to this paper is the 
hypothesis, derived from Piaget, that the ado- 
lescent is disposed to examine his beliefs in 
sets, and to search for inconsistencies among 
them and between these beliefs and related 
actions, This inclination depends partly on 
the previous competences, for critical examina- 
tion of the compatibility of a set of related 
beliefs requires the capacity to consider multi- 
ple rules simultaneously. Thus, the 13-year- 
old broods about the inconsistency among the 
following three propositions: (@) God loves 
man. (b) The world contains many unhappy 
people. (c) If God loved man, he would not 
make so many people unhappy. The adoles- 
cent is troubled by the incompatibility that 
is immediately sensed when he examines these 
statements together. He notes the contradic- 
tion and has at last four choices, He can deny 
the second premise that man is ever un- 
happy; this is unlikely for its factual basis is 
too overwhelming. He can deny that God 
loves man; this is avoided, for love of man 
is one of the definitional qualities of God. 
He can assume that the unhappiness serves an 
ulterior purpose God has for man; this possi- 
bility is sometimes chosen. Finally, he can 
deny the hypothesis of God. 

The last alternative, which has become a 
popular form of the resolution in Western 
society, has profound consequences, for it 
invites the further implication that if this 
statement is not tenable, all beliefs held at 
that moment are also in jeopardy. 

Similar analyses are performed on other 
childhood premises (e.g., My parents are 
omniscient. Sexual activity is bad.) with simi- 
lar results. Soon the adolescent is faced with 
a major source of uncertainty. Many of his 
childhood beliefs are not tenable, and he has 
not yet constructed a new set. This uncom- 
fortable state leads to the ascendance of sec- 
ondary motives surrounding autonomy, ideo- 
logical commitment, sexual experimentation, 
and, if the goal actions that are activated by 
these motives are blocked, hostility. 

There are two classes of inducing experi- 
ences that allow the new competence to trans- 
form the existing motivational hierarchy. 
Puberty qua puberty is a universal biological 
inducer. The child’s body changes, and there 
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is an increased salience of perceived sexual 
feelings. As a result, masturbation increases, 
and this private violation of a social prohi- 
bition has to be rationalized. The adolescent 
also recognizes that he is fertile and must 
decide how to handle this power, and finally 
he must acknowledge that childhood actions 
and beliefs have to be transformed or re- 
placed. 

There are also local inducers specific to 
each culture that accelerate or retard this 
process depending on the degree to which they 
force the adolescent to confront incompati- 
bilities in his existing beliefs. Our culture 
seems to maximize encounter with dissonance 
at puberty. For example, many children enter 
junior high school, which, by contrast with 
the elementary school setting, contains a 
greater variety of belief systems. The child 
encounters children who have different atti- 
tudes toward religion, grades, sexuality, par- 
ents, and drugs. Each ideological position has 
many advocates, and, since the source of these 
attitudes cannot easily be discredited, the 
child must deal with them. 

Second, many schools begin tracking, a 


“harsh event that forces each student to think 


about his intellectual capacities. Such a pro- 
cedure frightens those in the top track who 
may not believe they are talented enough to 
warrant that assignment, while it saddens and 
angers those in the lower track who resent 
this categorization and must erect a ration- 
alization against the importance of academic 
accomplishment. Finally, Western culture 
emphasizes more than most that beliefs are 
a central part of an individual’s identity. If 
integrity of belief systems is so central to the 
conceptualization of the self, it is necessary to 
polish its rough edges, and it is likely that 
the surge of concern among American ado- 
lescents with the meaning of their existence, 
the significance of their role in society, or 
whether a goal is theirs or their parents’ 
forms the bases for major changes in the 
motivational hierarchy. 


Discussion 


The views in this paper obviously build 
upon the work of many psychologists, espe- 
cially the writings of Atkinson, Berlyne, 


å 


j 
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es, Hunt, Festinger, McClelland, Murray, 


Walker, Dember, Janis, and White. It differs 
from these views by awarding greater sig- 
nificance to the cognitive quality of motiva- 


. tion. This paper treated motives as cognitive 


representations of goals and considered the 
four different motives of sensory pleasure, 
hostility, resolution of uncertainty, and mas- 
tery. It was suggested that these motives 
are different in origin and dynamics, and overt 
goal-directed behaviors currently labeled by 
the categories dependent, succorant, dominant, 
achievement, aggressive—to name a few— 
can be in the service of resolution of uncer- 
tainty, mastery, or hostility and, hence, are 
ambiguous as to their primary origin. The 
view that motives are cognitive representa- 
tions organized in a hierarchy suggests quan- 
tification strategies that resemble those used 
for concepts, rather than those used for cod- 
ing overt responses. It was suggested that 
articulation, elaboration, and hierarchical po- 
sition were three relevant dimensions to be 
assessed in describing motives. A develop- 
mental approach to motive.organization makes 
the interaction of environmental sources of 
dissonance and cognitive maturation central 
causes of change in motivational structure. It 
was suggested that Piagetian theory con- 
tained implications for motivational develop- 
ment, especially during the shift from con- 
crete to formal operations during adolescence. 


REFERENCES 


Arxinson, J. W. (Ed.) Motives in fantasy, action 
and society. Princeton, N. J.: Van Nostrand, 1958. 

ATKINSON, J. W. An introduction to motivation. 
Princeton, N. J.: Van Nostrand, 1964. 

ATKINSON, J. W. Change of activity. In T. Mischel 
(Ed.), Human action: Conceptual and empirical 
issues. New York: Academic Press, 1969. 

BENTHAM, J. Principes de legislation et traites de 
legislation. (Trans. by C. M. Atkinson) London: 
Oxford University Press, 1914. 

Bertyne, D. E. Conflict arousal and curiosity. New 
York: McGraw-Hill, 1960. 

Bertyne, D. E. Uncertainty and epistemic curiosity. 
British Journal of Psychology, 1962, 53, 27-34. 
Bertyne, D. E. Arousal and reinforcement. Ne- 

braska Symposium on Motivation, 1967, 15, 1-110. 
Cuarrin, D. Some aspects of Hindu medical treat- 
ment. London: Luzac, 1930. 
Demser, W. N. The new look in motivation. Ameri- 
can Scientist, 1965, 53, 409-427. 


Motives AND DEVELOPMENT 65 


Donatpson, M. The study of children’s thinking. 
London: Tavistock, 1963. 

Driscoty, J. M., & Lanzera, J. T. Effects of prob- 
lem uncertainty and prior arousal on predeci- 
sional information search. Psychological Reports, 
1964, 14, 975-988. 

FESTINGER, L. A theory of cognitive dissonance. 
Stanford: Stanford University Press, 1957. 

Festincer, L. Conflict, decision and dissonance. 
Stanford: Stanford University Press, 1964. 

Fiscuer, K. W. The structure and development of 
sensory-motor actions. Unpublished doctoral dis- 
sertation, Harvard University, 1970. 

Freup, S. Three contributions to the theory of sex 
(1905). (Trans. by A. A. Brill) New York: Ner- 
vous and Mental Disease, 1930, 

Grossman, S. P. Eating or drinking elicited by di- 
rect adrenergic or cholinergic stimulation of hypo- 
thalamus. Science, 1960, 132, 301-302. 

Hawkins, C. K., & Lanzerra, J. T. Uncertainty, 
importance, and arousal as determinants of pre- 
decisional information search. Psychological Re- 
ports, 1965, 17, 791-800. 

Hm, R. A. Animal behavior. 
York: McGraw-Hill, 1970. 

Hutt, C. L. Essentials of behavior. New Haven: 
Yale University Press, 1951. 

Hunt, J. McV. Intrinsic motivation and its role in 
psychological development. Nebraska Symposium 
on Motivation, 1965, 13, 189-282. 

Iwar, H., & Reynotps, D. K. Morita psychother- 
apy: The views from the West. American Journal 
of Psychiatry, 1970, 126, 1031-1036. 

Jacosson, A. G. Inductive processes in embryonic 
development. Science, 1966, 152, 24-34. 

James, W. The principles of psychology. Vol. 2, 
New York, Holt, 1890. 

Janis, I. L. Motivational factors in the resolution 
of decisional conflicts. Nebraska Symposium on 
Motivation, 1959, 7, 198-231. 

Kacan, J. Understanding children. New York: Har- 
court, Brace & Jovanovich, 1971. 

Kacan, J., & Moss, H. A. The availability of aggres- 
sive ideas. Journal of Personality, 1961, 29, 217- 
234. 

Kerry, G. A. The psychology of personal constructs. 
New York: Norton, 1955. 

Kıntscu, W. Learning, memory and conceptual 
processes. New York: Wiley, 1970. 

Koretcuuck, M. A study of separation anxiety. 
Unpublished manuscript, Harvard University, 1971. 

Lanzetta, J. T, & Driscott, J. M. Effects of un- 
certainty and importance on information search 
in decision-making. Journal of Personality and 
Social Psychology, 1968, 10, 479-486. 

LANZETTA, J. T., & Kanarerr, V. T. Information cost, 
amount of pay-off, and level of aspiration as 
determinants of information seeking in decision- 
making. Behavioral Science, 1962, 7, 459-473. 

Lazarus, R. S. Psychological stress and the coping 
process. New York: McGraw-Hill, 1966. 


(2nd ed.) New 


60 "We 


Manopter, G. The interruption of behavior, Ne- 
braska Symposium on Motivation, 1964, 12, 267- 
343. 

Manpter, G. Words, lists, and categories: An ex- 


perimental view of organized memory. In J. L. 


Cowan (Ed.), Studies in thought and language. 
Tucson: University of Arizona Press, 1969. 

McCretranp, D. C. The achieving society. Prince- 
ton, N. J.: Van Nostrand, 1961. 

McCrrrtann, D. C., Atkinson, J. W., CLARK, R. A. 
& Lowrrr, E. L. The achievement motive. New 
York: Appleton, 1953. 

Mutter, N. E, Learnable drives and rewards. In S. 
S. Stevens (Ed.), Handbook of experimental psy- 
chology. New York: Wiley, 1951, 

Murray, H. A. Explorations in personality. New 
York: Oxford University Press, 1938. 

Pracet, J. The origins of intelligence in children. 
New York: International Universities Press, 1952. 

Pracet, J. Six psychological studies. New York: 
Random House, 1967. 

Prince, R. The therapeutic process in cross-cultural 
perspective: A symposium. American Journal of 
Psychiatry, 1968, 124, 57-62. 

Rao, S. Culture and mental disorder: A study in an 
Indian mental hospital. International Journal of 
Psychiatry, 1966, 12, 139-148. 

Ruemcorp, H. L., & ECKERMAN, C. O. The infant 
separates himself from his mother. Science, 1970, 
168, 78-83. ; 

Scmacmrer, S. The interaction of cognitive and 
physiological determinants of emotional state, In 
L. Berkowitz (Ed.), Advances in experimental so- 
cial psychology. Vol. 1. New York: Academic 
Press, 1964. 

SIEBER, J. E., & Lanzerra, J. T. Conflict and con- 
ceptual structure as determinants of decision- 
making behavior. Journal of Personality, 1964, 32, 
622-639. 


Jerome Kacan 


Sotomon, R. C, & Wynne, L. C. Traumatic avoid- 
ance learning: Acquisition in normal dogs. Psy- 
chological Monographs, 1953, 67(4, Whole No. 
354). 

SrrErcH, R., OrLorr, E, R., & Datrympre, S. D. 
Maintenance of responding by - fixed-interval 
schedule of electric shock presentation in squirrel 
monkeys. Science, 1968, 162, 583-585. 

TEITELBAUM, P. Disturbances of feeding and drink- 
ing behavior after hypothalamic lesions. Nebraska 
Symposium on Motivation, 1961, 9, 39-65. 

Torman, E. C. Purpose of behavior in animals and 
men. New York: Century, 1932. 

Trotanp, L. T. The fundamentals of human motiva- 
tion. New York: Van Nostrand, 1928. 

VALENSTEIN, E. S., Cox, V. C., & KAKOLEWSKI, J. 
W. Reexamination of the role of the hypothalamus 
in motivation, Psychological Review, 1970, 77, 
16-31. 

Vars, S. Cognitive effects of false heart rate feed- 
back. Journal of Personality and Social Psychol- 
ogy, 1966, 4, 400-408. 

Waker, E. L. Psychological complexity as a basis 
for a theory of motivation and choice. Nebraska 
Symposium on Motivation, 1964, 12, 47-94. 

War, R. W. Motivation reconsidered: The concept 
of competence. Psychological Review, 1959, 66, 
297-333. 

Younc, P. T. Motivation of behavior. New York: 
Wiley, 1936. 

Yu-Lan, F. History of Chinese philosophy. Vol. 3. 
Princeton: Princeton ' University Press, 1953. 

ZIMBARDO, P. G. Involvement and communication 
discrepancy as determinants of opinion conform- 
ity. Journal of Abnormal and Social Psychology, 
1960, 60, 86-94. 

ZIMBARDO, P. G. The cognitive control of motivation, 
Glenview, Ill.: Scott, Foresman, 1969. 


fT 


(Received January 5, 1971) 


r 
ya 


bta 


i la of Fe and Social Psychology 
EFFECTS OF COMPATIBILITY, CROWDING, GROUP SIZE, 
AND LEADERSHIP SENIORITY ON STRESS, ANXIETY, 


HOSTILITY, AND ANNOYANCE IN ISOLATED GROUPS * 


SEWARD SMITH ann WILLIAM W. HAYTHORN * 


Florida State University 


Fifty-six subjects, run in four sets of four dyads and two triads each, were 
subjected to laboratory conditions of isolation and confinement for 21 days. 
In addition to the group size variable (dyads versus triads), the 2 X 2 Xx 2 x2 
experimental design included a compatibility manipulation (compatible versus 
incompatible), crowding versus less crowding (as measured by cubic feet per 
man), and junior versus senior leadership. Analyses of subjective report mea- 
sures of subjective stress, state and trait anxiety, hostility toward isolation 
partner(s), and annoyance with physical features of the laboratory setting 
yielded statistically significant results indicating higher subjective stress and 
state anxiety during isolation than during pre- and postisolation base-line 
periods; greater reported hostility in junior-led than in senior-led, and in 
hypothetically incompatible than compatible, groups; a greater effect of crowd- 


Ry ing on subjective stress and state anxiety reports in three-man than in two- 
man groups; greater reported hostility in less crowded than in crowded rooms; 
< greater annoyance with physical aspects of the situation in compatible than 


in incompatible, and in two-man than in three-man groups; greater reported 
anxiety in two-man than in three-man groups; and better adaptation to con- 
finement in three-man than in two-man groups and in less crowded than in 
crowded rooms. The results reflect the highly interactive nature of the rela- 
tionships between the subjective reactions of individuals in groups, on the 
one hand, and both social and physical aspects of the group setting on the 
other hand. s 


(® For centuries, hearty and adventuresome group confinement literature are available 


< man has accepted severe, isolated confinement 
as an integral part of exploration and un- 
usual duty assignments. These ventures have 
frequently involved extreme crowding, re- 
stricted interactions, boring constancy of 
surrounds, and loss of contact with parent 
society. From anecdotal reports (eg., 
Brainard, 1929; Byrd, 1930, 1938), it is clear 
that in addition to the often difficult physi- 
cal environments involved, the psychological 
milieu bodes potentially serious consequences 
„as well. It is only in recent years that probing 
~ study of such isolated confinement has begun. 
Comprehensive reviews of this expanding 


1The research reported herein was conducted at 
the Naval Medical Research Institute, Bethesda, 
Maryland. It was jointly supported by the Navy 
Department Bureau of Medicine and Surgery and the 
National Aeronautics and Space Administration. The 
views expressed herein are those of the authors and 
not necessarily those of either supporting institution. 

2 Requests for reprints should be sent to William 
W. Haythorn, Department of Psychology, Florida 
State University, Tallahassee, Florida 32306. 
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elsewhere (e.g., Fraser, 1966; Haythorn, in 
press; Smith, 1969), and need not be 
repeated here. Only those laboratory studies 
directly related to the present study will be 
discussed. 

In general, small groups in confinement 
face a large reduction in the usually available 
sources of emotional gratification and release. 
They frequently report monotony and bore- 
dom, interpersonal frictions, irritability, and 
negativity of mood (Haythorn, Altman, & 
Myers, 1966; Taylor, Wheeler, & Altman, 
1968). Members often feel lethargic and non- 
accomplishing. They may well withdraw from 
one another or exhibit increased territoriality 
(Altman & Haythorn, 1957a). Some handle 
their confinement well, others, poorly indeed. 
Even in the relative safety and comfort of 
laboratory confinement, volunteer groups 
have experienced levels of stress, boredom, 
and interpersonal conflict that have been in 
some instances intolerable. 
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This study was the last in a series of studies Longer preconfinement and postconfinement =~ 


of laboratory confinement research at the 
Naval Medical Research Institute. There, 
numerous studies of both individuals and 
groups in isolated confinement have been con- 
ducted in the last 7 years. Group confinement 
durations of from 8 to 21 days have been 
employed. In a 10-day study (Altman & 
Haythorn, 1965, 1967a, 1967b; Haythorn 
& Altman, 1967; Haythorn et al., 1966), 
isolated dyads showed more psychological 
stress and emotional symptomatology than 
did controls, Pairs of men were composed 
to be homogeneous or heterogeneous on 
measures of dominance, dogmatism, affili- 
ation, and achievement. Hypothetically in- 
compatible isolated dyads had greater 
problems which led some to fail to endure 
the full 10 days. Incompatibility with re- 
spect to dominance and need for achievement 
appeared to be particularly problematic. 

A second study of dyads confined for 8 
days involved experimental manipulation of 
mission-length expectation, stimulation from 
the outside, and availability of privacy 
(Taylor, Altman, Wheeler, & Kushner, 1969; 
Taylor et al., 1968). Fifty-three percent of 
the groups aborted. As in the earlier study, 
these groups showed higher stress. Groups 
who had expected longer duration missions 
comprised two-thirds of the aborts. These 
long-duration-expecting abort groups experi- 
enced the greatest levels of psychological 
stress, There was somewhat surprising evi- 
dence of higher stress in privacy conditions 
(where subjects slept in separate rooms). 
The very limited amounts of stimulation 
available from the outside that were em- 
ployed generally did not help tolerance for 
confinement. 

The study reported in this paper bears 
the name Project RIM (restricted, isolated 
monotony) and was planned to be a compre- 
hensive, multidisciplinary investigation of 21- 
day crowded confinement. It was a logical 
extension of the previous Naval Medical Re- 
search Institute work. The 21-day confine- 
ment period was more than double the length 
of prior studies. Steps were taken to ensure 
that subjects would be more likely to try to 
endure the full duration of confinement. 


periods gave greater opportunity for a 
broader measurement package with more dis- 
ciplines (e.g., electrophysiology) being repre- 
sented. The space per man was considerably 
less than previously available and was se- 
lected to bracket the limited range likely to 
be available in deep-ocean and space habitats. 
It seemed particularly necessary to extend 


the previous research to include three-man \ 


groups to begin exploring the effects of group 
size. It was hypothesized that triads would 
be more likely to experience difficulties coping 
with confinement than would dyads. Simi- 
larly, the subjects used in the present study 
were an older, better trained, and more ex- 
perienced segment of the population than the 
recruits used in earlier studies, to increase * 
the generalizability of results to relevant 
populations. The earlier study of interper- X 
sonal compatibility (Haythorn & Altman, 
1967) employed selection procedures that 
composed groups from persons in only the 
upper and lower tertiles on the various per- 
sonality measures. For Project RIM, a group 
composition manipulation was used that ex- 
cluded no one. As such, the current procedure 
represents a much more stringent evaluation ” 
of personality-test predetermination of later ‘> 
intragroup compatibility. 

A final rationale guiding much of our effort 
was awareness of the unique setting per- 
mitting the study of wide arenas of social- 
psychological functioning under highly con- 
trolled circumstances, Virtually never does one 
have the opportunity to study the emergent 
social behavior of so many groups for so long a | 
time in such an ideal observational and 
measurement situation. 

The scope and diversity of data preclude, 
complete coverage in any single article. This — 
teport, therefore, is addressed primarily to 
reports by subjects of subjective stress, 
anxiety, hostility, and other annoyances. 


METHOD 


Complete and detailed documentation of the ra- E 
tionale, research procedures, and test program for 
Project RIM are contained in previous reports ad- 
dressed solely to methodology (Donenfeld, Smith, 
Haythorn, Libert, & Brown, 1970; Walsh, Donenfeld, 
Smith, Haythorn, Briley, Burgoon, Colson, & Holi- 
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a Se 
a’ man, 1970). Hence, only limited details are presented 


` spectively). Altogether, 


heres 


Plan of the Study 


Broadly, Project RIM was the study of two- and 
three-man groups undergoing crowded confinement 
for 21 days with very little to do, Four independent 
variables were employed in a 2 X 2 X2 X2 factorial 
design. In addition to size of the group, also studied 
were the extent of crowdedness, seniority of the group 
leader, and interpersonal compatibility. The crowded 
conditions afforded about 70 cubic feet of usable 
space per man, while in the less crowded rooms, 
200 cubic feet were allotted. The six laboratory 
rooms varied in length and width from 6 feet 9 
inches X $ feet 4 inches for a two-man crowded 
room, to 7 feet 114 inches X 13 feet 8 inches for a 
three-man uncrowded room. “Usable space” was con- 
sidered to be the total volume of the room minus 
space occuped by the refrigerator, oven, desk, food, 
and water storage, etc. Group Jeadership was as- 
signed on the basis of rank. Half of the groups were 
led by enlisted men (E-4 through E-6) who were 
senior peers to the followers (E-2 and E-3); the oth- 
ers were led by junior peers (E-2 or E-3) who 
usually had only a few days date of rank over their 
followers. 

Due to laboratory space limitations, only six 
groups could be run at the same time, To minimize 
disproportionality of subject numbers between two- 
man and three-man group conditions, it was decided 
that the six groups in each run would include four 
dyads and two triads (eight and six subjects, re- 
six runs were completed, 
with Runs 1 and 2 being initially planned as pilot 
runs for procedural check-out and staff training. 
Runs 3-6 were considered experimental runs, with 
the groups in Runs 3 and 5 composed to maximize 
incompatability, hypothetically, and Runs 4 and 6 
composed to maximize compatibility. Run 2 was 
identical to Runs 4 and 6, while Run 1 was identical 
to Runs 3 and 5 except for being shorter (14 rather 
than 21 days). Each run included four dyads and 
two triads, with half of each in crowded and half 
in less crowded rooms. 

Interpersonal compatibility was based on a theory- 
derived computation utilizing the control and affec- 
tion measures from the FIRO-B (Schutz, 1958) and 


£ need Achievement from the Edwards Personal Pref- 


erence Schedule (Edwards, 1959). The formula for 
determining compatibility was a modification of 
Schutz’s (1958) procedure, eliminating his inclusion 
variable on the grounds that confinement forced 
inclusion to a considerable degree (Altman & Hay- 
thorn, 1965), and adding the need Achievement vari- 
able from the Edwards Personal Preference Schedule 
on the grounds that compatibility with regard to 


need achievement is highly important in isolated 


2 Interested readers may obtain copies of the pro- 
cedures documentation reports by writing to either 
of the authors. 
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and confined groups with tasks to perform (Altman 
& Haythorn, 1967a; Haythorn et al., 1966). 

Control compatibility and affection compatibility 
scores from the FIRO-B are derived for dyads from 
individual scores indicating the individuals’ self- 
descriptions of the degree to which they express 
toward others and seek from others interpersonal 
behavior of the kind in question (control or affec- 
tion). The compatibility score for each interpersonal 
orientation is a summation of reciprocal (the degree 
to which the wanted or sought behavior of one 
person. is expressed by the other), interaction (the 
degree to which the two individuals are similar in 
level of behavioral activity of the kind under ques- 
tion), and originator (the degree to which the 
expressed behavior in question exceeds or falls short 
of the wanted behavior of that type) compatibility. 

For need achievement, absolute differences between 
the scores of pairs of subjects were used as measures 
of achievement compatibility, on the grounds that 
similar attitudes toward achievement would mini- 
mize interpersonal difficulties due to discrepant ex- 
pectations regarding task accomplishment. Support 
for this was provided by the earlier studies (see 
above). É 

This composition manipulation did not result in 
preexperimental mean differences between compatible 
and incompatible groups on any of the personality 
variables used in the manipulation. The difference 
is therefore only with regard to the intragroup 
configurations of individuals’ scores. 

The formula used for computing overall compati- 
bility was as follows: Compatibility = control com- 
patibility + affection compatibility + difference be- 
tween subjects in need Achievement scores. The 
lower these scores, the more compatible the dyad was 
hypothesized to be. For triads, the average compati- 
bility of the three dyads within the triad was taken 
as the group compatibility score.* 

The computer computed a group compatibility 
score taking the 14 men who served in each run 
of the experiment and assembling them into every 
possible set of four dyads and two triads. The com- 
puter examined all 151,200 possible sets of six groups 
composed of combinations of the eight followers with 
the six leaders and was programmed to select either 
the most or least compatible set of four dyads and 
two triads. As indicated above, on half of the runs, 
subjects were assigned to maximally compatible 
groupings; least compatible compositions were em- 
ployed for the other half. 


Conditions of the Experiment 


Based on the premise that crews in long-term con- 
finement would be likely to experience difficulty par- 
ticularly during lengthy unoccupied portions of their 
missions, we chose to study confinement under con- 
ditions of minimally filled time. Hence, subjects had 


4 Inquiries concerning further details of the com- 
patibility manipulation should be addressed to the 
authors, 
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few duties to perform, little recreational material 
was available, and the only daily scheduled item was 
a task and/or questionnaire session occupying from 
10 to 150 minutes. Subjects were in lighted rooms 
containing a clock and a calendar; but, aside from 
playing a few games, eating, sleeping, conversing, and 
writing, there was little to do, In order to emphasize 
coping rather than aborting behavior, stringent vol- 
unteering procedures were employed; subjects re- 
ceived a monetary award and were provided with 
exceptionally good and varied food. 


Laboratory Facilities 


The study utilized the Naval Medical Research 
Institute’s “deep isolation” laboratory, which con- 
sists of six virtually soundproof, air-conditioned In- 
dustrial Acoustics Company test rooms and an asso- 
ciated control center. A general description of the 
basic facility is available in a prior report (Smith, 
Myers, & Edmondo, 1967). For Project RIM, the 
rooms were outfitted with special triple-deck bunk 
beds, one chair per man, a desk, storage file drawers, 
refrigerator-freezer, microwave oven, bottled water 
(for washing and drinking), and a chemical toilet. 
Where necessary to meet requirements of experimen- 
tal crowding conditions, room space was reduced by 
false walls and barriers. The rooms were constantly 
monitored by means of microphones and a television 
camera, Questionnaires and tasks were presented to 
subjects on a videomonitor. The highly automated 
control center contained video- and audiomonitoring 
and recording equipment, as well as various test-pre- 
senting, -programming, and -scoring instrumentation, 


Subjects 


Eighty-one volunteer Naval enlisted men (E-2 
through E-6) participated in the study. After 
dropping the 25 pilot study subjects, there remained 
56 subjects, data from which are included in the 
analyses reported here. They were recruited from 
various advanced Navy schools by an extended vol- 
unteering procedure designed to screen out all except 
the most serious participant. Subjects received a 
cash award ($75-$100) to make their pay status 
comparable to that for unusual duty. The men 
ranged in age from 18 to 32, averaging 20.8 years. 
Intelligence was measured by the General Classifica- 
tion Test portion of the Navy Classification Battery. 
The General Classification Test range was 45 to 75, 
with a mean of 61.5, representing a Wechsler Adult 
Intelligence Scale IQ of approximately 117. 


General Procedure 


Several run cycles of approximately 33 days dura- 
tion were carried out to complete the research design, 
There were 7 preconfinement days, during which time 
base lines of psychological, biochemical, and electro- 
physiological measures were taken. Physical and psy- 
chiatric examinations were conducted, and subjects 
were thoroughly briefed and trained on test and 
room-living procedures. They then entered into the 
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21-day period of isolated confinement, during which 
time the men in each room were totally isolated from 
all other persons. Although it was necessary to resup- 
ply food and the chemical toilet three times during a 
tun, this was accomplished without face-to-face 
contact. Upon completion of confinement, subjects 
entered a 5-day postconfinement period. Various 
retrospective and postconfinement measures were 
given, and the subjects underwent final medical and 
psychiatric checks. During the pre- and postconfine- 
ment periods, they lived together in a nearby 
barracks building, engaging in some light duties, 


Dependent Variables 


A large number of dependent variables were em- 
ployed by the multidisciplinary research staff. These 
included blood biochemistry, urine endocrinology, 
electroencephalogram evaluations of sleep stages and 
central nervous system functioning (e.g., Alpha slow- 
ing), medical and psychiatric assessments, and broad- 
ranged psychological functioning. The latter area in- 
cluded individual and group tasks evaluating per- 
formance and intellectual efficiency; measures of 
subjective stress, mood, symptomatology, interper- 
sonal functioning, personality, and demography: 
periodic audiorecordings of free time interactions; 
and carefully controlled, systematic observational 
data gathered by the monitor staff and automated 
videotape recordings, 


Measures of Stress, H ostility, and Annoyance 


Two measures of psychological stress (or anxiety) 
were given to Project RIM subjects: the Subjective 
Stress Scale (Kerle & Bialek, 1958) and the Spiel- 
berger (1966) State-Trait Anxiety Inventory (STAT). 

The Subjective Stress Scale consists of 15 
Thurstone-scaled words or phrases ranging from 
“wonderful” (scale value of 0), through “indifferent” 
(scale value of 48), to “scared stiff? (scale value 
of 94). The subject's extent of subjective stress is 
the scale value of the word or phrase he selects to 
indicate how he felt at some point in, or during 
some period of, time. In this study, for instance, 
subjects were asked six times during the confinement 
period to indicate how they had felt during the 
prior 24 hours, This stress measure was used in 
both the 8- and 10-day previous Naval Medical 
Research Institute group confinement studies. 

Spielberger’s test (also used in the 8-day study) 
consists of two scales, one measuring “trait” 
(STAI-T) and the other “state” (STAI-S) anxiety, 
Trait anxiety is conceived of as a relatively stable 
individual differences index, while state anxiety is 
viewed to be more transitory, being situationally 
linked. The state anxiety measure was also taken 
six times during confinement, referring each time 
to the previous 24 hours, 

Hostility toward partners was 
Hostility Scale (Berkun, 
1963). The Hostility Scale 
scaled phrases ranging from 


assessed by the 
Burdick, & Woodring, 
contains 15 Thurstone- 
“delighted with” (scale 
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TABLE 1 


MEAN SPIELBERGER STATE ANXIETY AS A FUNCTION OF GROUP SIZE 


Days during confinement 


ier Pi Pi Pi 
Condition Paa inS pe abel ‘ine 
2 6 10 | 14 18 21 ; 
Two man* 29.8 Sake 31.4 33.2 33.4 33.6 33.6 32.3 29.1 
Three man? 29.3 27.5 28.0 29.1 29.4 30.7 31.0 29.0 28.9 
an = 32. 5 
bn = 24. 


value of 0), through “indifferent toward” (scale 


j value of 46), to “raging mad at” (scale value of 97). 


During Project RIM—on Days 1, 5, 9, 13, and 17— 
subjects circled the choice that best described how 
they had felt toward their partners at the point 
when they felt least favorable during the prior 
24 hours. 

The annoyance checklist assembled for Project 
RIM was tailored generally to environmental fea- 
tures that might have been annoying to RIM sub- 
jects (e.g. lighting, odors, size of the room, food, 
lack of sink and running water, crowdedness, and 
dirt and clutter in the room). Respondents were 
asked on Days 2, 7, 12, 16, and 20 to indicate 
whether each of 29 categories “never,” “occasion- 
ally,” or “frequently” annoyed them during the prior 
24 hours. The annoyance checklist score included a 
point for every item check “occasionally” and 2 
points for each marked “frequently.” 

During confinement on a prearranged schedule, 
each of these tests was presented to the subjects on 
their closed-circuit television monitors. The pre- 
recorded tests used a television editorial style where 
the instructions and later test items slowly moved 
from the bottom to the top of the screen. The men 
responded by marking IBM answer sheets collected 
later. This system helped protect the privacy of 
answers and limited the amount of possible test 
reminiscing. 


RESULTS 


A five-way analysis of variance treating one 
within- and four between-subjects sources of 
variance, and accepting both disproportionate 
and unequal condition sizes was used for the 
data analyses reported here. It employs least 
squares techniques and was specially devel- 
oped and computer programmed for this proj- 
ect. The analysis paradigm used was that of 
Method 2 described by Overall and Spiegel 
(1969).§ In the discussion of results that 


5 This program was developed and programmed 
by John Berterman, of the Planning Research Cor- 
poration, Washington, D. C. It is written in 
FORTRAN IV, for the IBM 7094. Inquiries should 
be addressed to the authors. 


follows, all significant sources of variance 
are reported. 

Of the 35 groups (81 subjects) who began 
confinement, all except one group completed 
the full duration (1 subject in a compatible, 
senior-led, crowded, two-man group left con- 
finement on Day 5, thereby aborting the 
group). In all, 1,597 man days of confine- 
ment were compiled. The 56 subjects whose 
data are reported herein all completed their 
21-day stay (1,176 man days). In general, 
then, this is a report of coping behavior. 


Time in Confinement 


For this confinement-enduring population 
as a whole, both the state anxiety (STAI-S) 
and the Subjective Stress Scale measures 
showed low but significantly heightened psy- 
chological stress during confinement relative 
to base-line data (“base line” versus “during” 
sign test, x? = 28.51, df=1, p< .001, for 
STAI-S; for the Subjective Stress Scale, sign 
test x? = 7.87, df = 1, p <.01; for STAI, F = 
5.33, df = 1/6, ns, F.o = 5.99; for the Sub- 
jective Stress Scale, F = 12.25, p < .05). For 
all subjects combined, state anxiety increased 
gradually throughout the 21 days (F = 5.60, 
df = 5/196, p < .001). The Subjective Stress 
Scale also, showed significant changes during 
confinement (F = 2.89, df = 5/200, p < .02), 
but with its highest peak on Day 5 and a 
lesser one on Day 13. On the annoyance 
checklist, subjects indicated that several en- 
vironmental features annoyed them as early 
as the second day, with a significant increase 
in the number of items reported as annoying 
(F = 5.26, df = 4/160, p < .001) by Day 7, 
remaining high throughout the rest of con- 
finement. The Hostility Scale did not change 
significantly during the 21 days, but the hos- 
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TABLE 2 


MEAN ANNOYANCE CHECKLIST SCORES AS A 
FUNCTION or Group SIZE 


Days during confinement 


Condition 
2 7 12 16 20 
NA groups | 13.6 | 16.8 | 18.1 | 18.5 | 17.4 
Three-man groups| 11.7 | 14.9 | 11.7 | 11.6 | 11.7 


tility indicated toward partners was already 
surprisingly high by the first afternoon of 
confinement (X = 61.75, approximately the 
scale value of “offended at”). In sum then, 
psychological stress was generally low but 
significantly elevated above base line during 
confinement, while “people” and “thing” 
annoyances were moderate to high. 


Group Size 

As shown in Table 1, two-man-group sub- 
jects registered significantly higher state anx- 
iety (for main effect of group size during 
confinement, F = 6.14, df = 1/40, P< 02) 
throughout confinement than did those in 
three-man groups. This is true both ona 24- 
hour retrospection day by day during con- 
finement, and a 21-day retrospection upon 
completion of confinement (for the latter, 
F = 4.43, df = 1/40, p < .05). Neither pre- 


nor postconfinement base lines show signifi- 
cant group size differences. 

Table 2 shows the annoyance checklist 
data for the same comparison. In addition 
to the significant main effect of group size 
(F = 5.26, df = 1/160, Ż < .001), there was 
a significant Group Size x Time interaction 
(F = 6.91, df= 4/160, p < 001). The three- 
man-group subjects were less complaining 
later in the run than were their 2-man 
counterparts, but there were no appreciable 
differences between them earlier, 


Group Size and Crowdedness 


In Figure 1 the second-order interaction 
(F = 3.49, df = 5/200, P< 01) of group 
size and crowdedness over time for the SSS 
data is presented. While the groups were 
highly similar through the first 9 days of 
confinement, it appears that crowdedness had 
a greater effect on three-man groups than 
on dyads the remainder of the time. Least 
crowded three-man groups indicated feeling 
the least stress of all, while crowded three- 
man groups reported the most. The Spiel- 
berger state anxiety data for the same inter- 
action are shown in Figure 2 (ESERSE 
dj = 5/196, p < .03). 

While the two-man groups were similar 
but higher under both crowdedness condi- 
tions, the three-man less crowded condition, 
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Fic. 1, Subjective Stress Scale scores over time as a function of group size 
and crowdedness. 
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Fic. 2. Spielberger state anxiety over time as a function of group size 
and crowdedness. 


as measured, was the least anxious of all, df= 5/196, p < 03). As shown in Figure 3, 
particularly in the early days of confinement. senior leadership in the three-man condition 
led to less reported anxiety than did junior 
leadership. The reverse of this was true for 
two-man groups. For both group sizes, dif- 

The Spielberger state anxiety data also re- ferences attributable to seniority of leader- 
veal a significant Time X Group Size X Se- ship tended to increase with time, but more 
niority of Leadership interaction (F = 2.70, so for three-man than for two-man groups. 


Group Size and Seniority of Leader 
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Fic. 3. Spielberger state anxiety over time as a function of group size 
and seniority of leader. 


` TABLE 3 
MEAN Hosritrry Scare SCORES AS A FUNCTION OF INTERPERSONAL COMPATIBILITY 


Days during confinement 


During confinement 
ean 


Condition Postretrospection 
1 5 9 13 17 21 
Incompatible’ 66.5 68.2 63.9 62.1 59.6 57.4 9.2 62.95 
Compatibles 57.0 46.9 49.1 48.9 49.0 52.8 45.8 50.62 
an = 28, 
Compatibility of confinement (in the range of “offended at” 


Subjects in hypothetically incompatible 
groups registered significantly higher (F = 
8.54, df=1/40, p< .01) hostility toward 
their partners throughout confinement than 
did groups composed to maximize compati- 
bility (see Table 3). This difference is prob- 
ably quite psychologically significant, too, in 
that compatible subjects more often selected 
“indifferent toward” or more favorable de- 
scriptors, while incompatible subjects selected 
more generally negative phrases such as 
“offended at” to describe feelings toward their 
partners. In contrast with these people an- 
noyance data, incompatible subjects reported 
significantly less thing annoyances than did 
compatible subjects (F = 10.34, df = 1/40, 
p < .01). These annoyance checklist data are 
shown in Table 4. There were no significant 
compatibility main effects on either the Sub- 
jective Stress Scale or state anxiety measures. 


Seniority of Leader 

The interaction between junior- versus 
senior-leader groups and time was significant 
for the Hostility Scale data (F = SH 95 
df = 5/199, p < .01). Although senior-leader- 
group subjects displayed greater hostility on 
Day 1, the junior-leader-group subjects dis- 
played greater hostility during the remainder 


TABLE 4 


MEAN ANNOYANCE CHECKLIST SCORES AS A FUNCTION 
OF INTERPERSONAL COMPATIBILITY 


Days during confinement 


as compared with “indifferent toward”). 
These data are shown in Figure 4. 


Seniority of Leader and Compatibility 


The significant interaction (F = 6.80, 
dj = 1/40, p< .02) between seniority of 
leadership and compatibility (for Hostility 
Scale data) is shown in Table 5. The greatest 
hostility was clearly registered by the incom- 
patible groups led by junior leaders. The 
average response that these subjects used to 
describe their partners fell in the range be- 
tween “fed up with” and “hot under the 
collar about.” 


Crowdedness 


Crowdedness was shown above to be ap- 
parently more of a problem in three-man 
than in two-man groups. A potentially sur- 
prising result is that greater hostility toward 
partners (Hostility Scale) was revealed by sub- 
jects in less crowded groups than by crowded 
subjects (F=4.77, df= 1/40, p< .04). 
These data are given in Table 6, 


Discussion 


The results of this experiment, while being 
generally in line with most predictions, pro- 
duced several surprises as well. Definite evi- 
dence emerged, indicating that three-man 


TABLE 5 


MEAN Hostiirry SCALE SCORES as A FUNCTION 
OF SENIORITY OF LEADERSHIP AND 
INTERPERSONAL COMPATIBILITY 


Condition A = Condition Junior leadera | Senior leadera pd 
2 | 7 12 | 16 | 20 X F 
ce Compatible* ne pH) 50.65 
ible 14.7 | 18.8 Ta 18.7 | 18.2 | 17.78 Incompatible Ki 52.3 68.00 
O cable 10.9 | 13.1 | 12.2 | 12.4| 11.7 | 12.06 X 61.75 51.9 
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Fic. 4. Hostility Scale scores over time as a function of seniority 
of group leader. 


groups generally fared better than did dyads. 
The same was true for groups with senior 
leaders, but in both cases there were notable 
exceptions, The ability of virtually all sub- 
jects to remain in confinement for the full 21 
days was unexpected even though attempts 
were made to assure that they would. Strong 
negative feelings toward partners were experi- 
enced by many men, but these were generally 
kept covert. There seemed to be an implicit 
understanding that restraint in dealing with 
one’s partner was important. Hence, few 
angry confrontations occurred. It was clear, 
however, that many persons used an “I can 
take anything for 21 days” orientation. Such 
a style of adaptation probably would not 
be indefinitely tenable. In fact, even in this 
study, 41% of the subjects revealed through 
postconfinement psychiatric evaluations that 


during the 21 days they had manifested 
disruptive if not serious symptomatology.° 

The general hypothesis that nothing about 
human behavior is simple is certainly true for 
this experiment. The interactive relationships 
among the four independent variables over 
time reveal that conclusions about one vari- 
able cannot be made with precision without 
knowledge of the others. 


Subjective Stress or Anxiety 


The psychological stress or anxiety report- 
edly experienced during confinement was 
greater than base-line levels. Some increases 


6 This figure was derived from the evaluations 
made by our project psychiatrist, Maurice Steinberg, 
whose many contributions to the study are gratefully 
acknowledged. 


TABLE 6 


Mean Hosriiry SCALE SCORES 


AS A FUNCTION OF CROWDEDNESS 


Days during confinement 


During confinement 


Condition Postretrospection 
1 5 9 13 17 21 
Less crowded* 65.0 58.7 61.3 61.1 56.6 57.9 54.0 60.1 
More crowded® 58.5 55.9 51.8 49.9 52.0 52.3 51.0 53.4 


an = 28. 


ey 


BR.” 


throughout the 21 days were in evidence, 
but, overall, the levels were low. This find- 
ing is consistent with results from industrial 
long-term confinement studies (McDonnell- 
Douglas, 1968; Page, Dagley, & Smith, 
1964) that typically report successful com- 
pletions and low stress, but is incon- 
sistent with earlier Naval Medical Research 
Institute studies in which 30%-50% of the 
groups aborted. This latter difference seems 
attributable to population differences (re- 
cruits versus more experienced and older 
men), more stringent conditions for volun- 
teering, and the added incentive of unusual 
duty pay. 

In the current experiment, however, some 
conditions led to more subjective stress or 
anxiety than did others. Three-man groups 
usually reported less stress than did dyads. 
In general, it appears that having two part- 
ners rather than one represented an enrich- 
ment of an otherwise bland environment. 
Combining leadership and group size, triads 
led by senior men registered the lowest state 
anxiety. Among dyads, junior leadership ap- 
peared more advantageous. Possibly the ab- 
sence of a peer follower colleague in the 
senior-leader dyads may have made the hier- 
archical structure more anxiety producing. 
In the interaction between group size and 
crowdedness, triads in less crowded rooms 
reported the least amount of stress of all. 
Triads in the more crowded rooms, on the 
other hand, were subject to considerably 
more stress, and there is at least some sug- 
gestion (from the Subjective Stress Scale) that 
crowdedness may be more stressful for a triad 
than a dyad. Perhaps this is due to the in- 
creased probability of members of a crowded- 
room triad having to maneuver around some- 


TABLE 7 


ADAPTATION TO CONFINEMENT AS A 
FUNCTION or Group SIZE 


Adaptation 
Group size a 
Good | Medium Poor 
Dyads 9 5 17 
Triads 16 2 6 


Note.—x? = 7.75, p < 05. 
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TABLE 8 


ADAPTATION TO CONFINEMENT OF TRIAD MEMBERS 
AS A FUNCTION OF CROWDEDNESS 


Adaptation 
Crowdedness 
Good Medium Poor 
Less crowded 11 0 1 
More crowded 5 2, F 


Note.—Median split x? = 4.68, df = 1, p < .05. 


one or finding it more difficult to share the 
cramped common space. 

Interesting corroboration for these findings 
comes from data independently derived by 
the project psychiatrist, Maurice Steinberg. 
On the basis of his psychiatric interviews and 
observations, the subjects were assigned a 
position along a 5-point confinement adapta- 
tion scale. Those persons who were rated less 
adaptive indicated significantly higher stress 
during confinement than did the men rated 
as having been more adaptive. His psychi- 
atric criteria also reflected the adaptive supe- 
riority of the triad (y°= 7.75, df= 2, 
< .05) as shown in Table 7. In addition, 
as shown in Table 8, virtually all triad 
members who were rated as less adaptive had 
spent their confinement in more crowded 
rooms (x? = 6.92, df=2, p< .05). 

Taken in sum, these data support the no- 
tion that selection of three-man crews for 
confinement missions is likely to be beneficial 
as long as there is at least 200 cubic feet 
of usable space available per man. This con- 
clusion, although admittedly drawn from lim- 
ited experiences of young all-male crews, 
clearly counters for this situation the popu- 
larly held adage that “three’s a crowd.” 

One finding from the Spielberger STAT is 
extremely interesting and may have impor- 
tant implications. While upon completion of 
confinement, subjects revealed through the 
state anxiety measure that their feelings of 
anxiety of the moment had returned to base- 
line levels; in direct contrast, the presumably 
stable trait anxiety scale reflected a large 
pre- to postconfinement drop. Only seven of 
the men failed to show a reduction, the aver- 
age pre/post difference being 5.8 points 
(F = 29.61, df = 1/40, p < 0001). While a 
recency effect contrasting the just-completed, 
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difficult experience with times of usual stress 
might be an explanation, prior evidence from 
test-retest use of the trait scale does not sup- 
port this interpretation (Spielberger, Gorsuch, 


_ & Lushene, 1969). It is possible that subjects 


in long-term-confinement studies gain consid- 
erable emotional growth from having had this 
unusual and ‘insight-producing experience, in 
terms of facing self and inescapable partners 
in more direct fashion than ever before. Such 
an explanation gains some support from re- 
cently collected sensory deprivation data. 
Persons who have undergone multiday, dark, 
quiet, solitary isolation show statistically sig- 
nificant postconfinement increases in “open- 
ness to experience” (Fitzgerald, 1966), a 
measure, which, when administered preconfine- 
ment, successfully predicts tolerance for sen- 
sory deprivation (the greater the openness, 
the greater the isolation endurance and toler- 
ance). This interpretation of the trait anxiety 
data, however, is highly speculative. 


People and Thing Annoyances 


In contrast with the Subjective Stress Scale 
and the STAI, which reflect general anxious 
discomfort, the Hostility Scale and annoyance 
checklist presumably measure more targeted 
sources of irritability or frustration. One in 
crowded confinement has literally no place to 
vent his frustrations other than to target the 
people and things of his unending immediate 
surround or those on the outside responsible 
for his plight and fate. In spite of the 
success of most of the Project RIM sub- 
jects in enduring their 21-day confinement, 
their Hostility Scale and annoyance checklist 
data revealed such frustrations throughout 
confinement, beginning surprisingly early. 

Large differences were found in hostility 
(Hostility Scale) toward partners as a func- 
tion of both compatibility and seniority of 
leadership. Less hostility was generated in 
those groups composed to maximize interper- 
sonal compatibility than was so for hypo- 
thetically incompatible groupings. In similar 
fashion, groups with junior leaders admitted 
feeling considerably greater hostility as con- 
finement progressed than did those with senior 
leaders. As reflected in Table 5, the condition 
that seems to account most directly for these 
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findings combines interpersonal incompatibil- 
ity with junior leadership. That incompati- 
bility should lead to partner-directed hostility 
is in line with obvious predictions. The lead- 
ership matter is less obvious. It seems likely 
that senior leadership was more easily accepted 
due to recognition of greater experience and 
maturity (increased status congruency; cf. 
Adams, 1953; Exline & Ziller, 1969). In con- 
trast, competition for the actual leadership 
role was probably more common in those 
groups led by a junior peer whose credentials 
were certainly more open to debate. 

An unexpected finding was the reported 
feeling of greater hostility among subjects in 
less crowded conditions. Perhaps subjects in 
more crowded conditions became better tuned 
to the need to adapt to their partners. This 
explanation, unfortunately, is not testable 
with the present data. 

Various environmental and situational fea- 
tures were irritants even as early as the 
second day of confinement. Generally the 
things that subjects found most annoying 
(annoyance checklist) throughout confine- 
ment were lack of reading material; lack of a 
shower facility; toilet facilities; lack of a TV 
and radio; the questionnaires; lack of a sink 
and running water; lighting; and temperature 
and humidity. Things they found least annoy- 
ing were mission rules and regulations; con- 
trol center personnel; noises in the room; 
odors; lack of a telephone; cooking facilities; 
and equipment in the room. It would appear, 
then, that creature comfort and entertain- 
ment needs were uppermost in the problem 
hierarchy. 

The finding that dyads were more com- 
plaining than triads appears consistent with 
other data showing that triads generally func- 
tioned more easily. The significantly greater 
“thing” annoyance of compatible groups over 
incompatible is less easy to explain. One pos- 
sibility is the potential for interpersonal con- 
flict to overshadow other sources of annoy- 
ance. Hence, by contrast with the interper- 
sonal adjustment problem, environmental 
annoyances may have seemed less significant 
to members of less compatible groupings. 
Subjects in more compatible groups did not 
particularly enjoy confinement either. How- 


ever, since their social needs were being rela- 
tively better met, their targets of irritation 
were apparently more strongly focused on the 
limited environment and recreation rather 
than on each other, 


Tolerance for Confinement 


Although this report reflects various inter- 
condition differences in the management of 
the confinement experience, it would be well 
worth recognizing that these were highly 
self-selected, coping individuals, As such, 
these results are underestimates of the prob- 
lems many people would face. Many members 
of the general society would consider partici- 
pation in such a venture pure insanity. And, 
indeed, as evidenced in other studies, a very 
large proportion of the population would fare 

„poorly in crowded confinement. By making 
lara available to potential volunteers 
t would virtually guarantee exclusion of 
unsuitable and less than fully dedicated par- 
ticipants, few persons in fact would volun- 
teer, Probably this discouragement of all but 
the most self-confident and dedicated few had 
the most to do with the high proportion of 
completers in this study as compared with 
prior Naval Medical Research Institute 
studies that made informed and realistic 
self-screening less possible, Another highly 
significant feature of any confinement experi- 
ence is the quality of food available, Because 
food represents one of the few gratifications 
available during confinement, it is not sur- 
prising that poor food is frequently one of 
the major sources of confinement complaint 


self-select into such an experience who will «J 
do very poorly. While this may not be much 
of a problem in studies where a return to 
more enriched surrounds is easily possible, 
in the real world of isolated duty stations, 
hyperbaric undersea habitats, man in space, 
and the like, serious consequences of poor 
selection may result. At this writing, for 
example, a fatal shooting of a member of a 
small scientific team manning a floating Arctic 
ice island is under investigation, 


3 CONCLUSION 


The data reported in this paper are indica- 
tive of how subtle changes in environmental 
conditions may influence subjective reactions 
to isolation and confinement, The indepen- 
dent variables that were employed all evalu- 
ated fairly narrow extremes. For example, both 
the 75- and 200-cubic-feet crowdedness condi- 
tions represented relatively little available » 
space. Also, the senior leaders employed were 
not particularly advanced beyond their junior 
counterparts in most cases, Further, the 
compatibility manipulation depended on those 
levels of the three personality factors (con- 
trol, affection, need achievement) that the 
14 subjects happened to possess. The com- 
puter merely made assignments to maximize 
or minimize the compatibility by composi- 
tion rather than by selection. Finally, both 
triads and dyads would obviously be consid- 
ered rather small groups. In spite of these 
range limitations, potent differences were 
found in some instances that suggest the 
following conclusions: 

1, Compatibility can be manipulated by 
fairly simple assignment procedures, and 
groups composed to be, hypothetically, maxi- 
mally compatible adapt better to each other 
in confinement, but may then focus their 
irritations on things or nonpersonal aspects ~~ 
of the situation. 

2. A triad may be a better crew size than 
a dyad (disregarding duty requirements), but 
particularly and perhaps only if senior (or — 
respected) leadership and adequate space to 
avoid extreme crowding are available. y 

3. Within the constraints of the limited 


` 


o groups and with jeer Perouility 
= compositions. 


4, Extreme crowding may be more stressful 
or anxiety producing for larger than for 
smaller groups. 

5. Even in groups comprised of men well 
beyond the late adolescent age of the recruits 
used in earlier studies—men who have had 
in some instances a fair degree of experience 
in their military careers—isolation and con- 
finement in groups for relatively short periods 
of time (21 days) can often produce reports 


of significantly heightened negative affect, 
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_ Twenty-one male volunteer undergraduates were measured for attitude and 
attitude difference to a pro-French speaker and a pro-English speaker while 
heart rate (heart rate and heart rate level change) and heart rate variability 
(heart rate variability and heart rate variability change) were measured. Both 
heart rate variability and heart rate variability change are predictive of atti- 
tude, and the level of heart rate is not an interesting variable in this predictive 


relationship. 


Recently a number of researchers have at- 
tempted to join the two fields of social psy- 
chology and psychophysiology. This holds 
promise of enriching both areas, for as 
Shapiro and Crider (1969) have noted: 


a growing understanding of the social psychology 
of the psychophysiological experiment points up 
sources of variation in physiological measures and 
conversely, the remarkable sensitivity of these mea- 
sures to social attributes such as race, sex, and status 
leads to a fuller appreciation of the impact of social 
factors on individual behaviour [pp. 37-38]. 


A number of studies (Anisfeld & Lambert, 
1964; Lambert, Anisfeld, & Yeni-Komshian, 
1965; Lambert, Hodgson, Gardner, & Fil- 
lenbawn, 1960) have used tape-recorded 
messages to study evaluative reactions toward 
a speaker. The assumption behind this 
research is 


that a listener’s attitude toward members of a par- 
ticular group should generalize to the language they 
use and consequently his evaluational reactions to 
the spoken language should be similar to his reactions 
when actually in contact with a group member 
[Anisfeld & Lambert, 1964, p. 89]. 


The study by Lambert et al. (1960) used a 
6-point scale on which 14 traits were rated. 
In that study, the same passage was read in 


1 The author would like to thank Robert A. Alte- 
meyer for his aid and constructive comments, Rene 
Villa for his infectious interest in the studies pro- 
duced by Lambert, Hodgson, Gardner, and Fillen- 
bawn (1960), John Adamson for his constant sup- 
port, F. Chebib for his guidance with the statistical 
analysis, and Gerta Hes for her aid in the final 
preparation of this manuscript. 

Requests for reprints should be sent to J. Alan 
Burdick, Department of Psychiatry, University of 
Manitoba, Winnipeg R3E, Manitoba, Canada 1Y9. 
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French and English, and the subjects (who 
were from two separate populations of French 
Canadians and English-speaking Canadians) 
were asked to judge the physical character- 
istics of a speaker from his voice. It was 
found that both French- and English-speaking 
subjects evaluated the English speaker higher 
in all desirable traits. These studies lead to 
the conclusion that a spoken language (even 
when not understood) is a powerful influence 
in the shaping of attitude. While we did not 
use a communication in French, we used 
English spoken with a French accent. The 
study by Larimer (1968) changed Lambert’s 
technique by the use of a 7-point scale, and 
used seven 2-minute propaganda speeches. 
This 7-point scale would allow the use of 
factor analysis of the type used by Osgood 
and Suci (1955) to objectively quantify the 
questionnaire data, and while Larimer did not 
do this, the data in the present study were 
factor-analyzed to obtain such a measurement 
of attitude. Š 

The use of heart rate as a physiological 
parameter in investigations of arousal, affec- 
tive, and/or cognitive tasks has a long ao 
in psychology and has become increasingly 
important in social psychology (Shapiro & 
Schwartz, 1970, cover this background care- 
fully). Evidence has been reported that sug- 
gests that such a relationship between heart 
rate and attitude does exist. For example, 
Buckhout (1966) has reported that a change 
in attitude is associated with a decreased 
heart rate. Kagan and Moss (1962) noted 
that “large changes in heart rate tend to be 
more selective than changes in conductance 
with respect to the psychological nature of 
the stressor (AHR) [p. 242].” They further 
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noted that 


the men who were above average intelligence, who 
actively sought status goals as adults and who were 
interested in and confident about their intellectual 
competence as adolescents, had the greatest increases 
in heart rate during the administration of most of 
the intellectual tasks [pp. 248-249], 


As a whole, the dimension of heart rate level 
change is said to be sensitive to whether the 
subject wants to accept environmental inputs 
(heart rate decreases) or reject them (heart 
rate increases) (Lacey, Kagan, Lacey, & Moss, 
1963). In addition, Elliott (1969) has sug- 
gested that “the critical features controlling 
tonic HR acceleration are the instigation, 
anticipation and initiation of responses, and 
the presence of incentives [p. 211].” 

In almost all of the measurements of physio- 
logical functioning, another dimension has 
been of interest to the psychophysiologist, 
that is, the variability of the parameter 
studied. In general, these investigations were 
seen as of interest to the measurement of 
“traits” or “types.” In this context, the level 
of heart rate variability has been said to be 
related to “personality types.” Cardiac labiles 
were reported as “manifestly anxious, hysteri- 
cal, and overtly reactive,’ while cardiac 
stabiles were reported as “phlegmatic, re- 
pressed, resilient, and lacking affect” (Lacey 
& Lacey, 1958). 

Since heart rate level and heart rate vari- 
ability are reported to be correlated, the 
present study used a form of analysis to 
control for this interaction. The coefficient 
of temporal variability, which has been 
shown to highly correlate with other meth- 
ods for the measurement of heart rate 
lability (Burdick, 1968), was also used as a 
measure of heart rate variability. 


METHOD 


Twenty-one male undergraduates from the de- 
partmental subject pool were tested in a psycho- 
physiological experiment measuring electrocardiogram 
(EKG) (by means of standard lead I, recorded on 
a Grass V polygraph) while they listened to two 
tape-recorded positions (passage order was counter- 
balanced) on French Canadians. It is likely that 
the subject material was of some interest because 
of the illegal activities of the Quebec Liberation 
Front, since this was of national interest at the time 
of the experiment. 


Upon the subjects’ (two at a time) arrival in the 
experimental room, the experimenter introduced 
himself and asked them to be seated at a table and 
chair. He played the following instructions from 
a tape. 


This is an experimental investigation of the 
extent to which people's judgments about a speaker 
are determined by his voice. You know of the 
common phenomenon of having a voice on the 
radio or telephone summon up a picture in the 
listener’s mind of the person speaking. In this 
experiment we will see how reliable this com- 
monly accepted idea is. I will be playing you two 
tape-recorded sections of two people reading ma- 
terial about the ‘same topic. In this case we have 
asked them to write and read a section on their 
own personal idea of the position and importance 
of French Canada, It so happens that their posi- 
tions are quite different, but we wanted them to 
speak their own thoughts so that their voice would 
be most natural for each of them. After each 
of these is played, I would like you to fill out 
this speaker evaluation sheet,? so that we can 
measure your accuracy in your perception of that 
individual. So as you listen to each speech, try 
to form an impression of what the person you 
are listening to is really like. 

We are interested in the measurement of heart 
rate as an index of how nervous you are. Two 
measurement plates will be attached to both arms 
in order to record this, Since previously published 
work has shown that the ability to perceive the 
physical characteristics of a speaker is grossly im- 
paired by a (quote) state of nervousness (unquote) 
as measured by heart rate, I would like you to 
relax as much as you can. 


The speaker-evaluation-sheet forms were in the 
form of a standard 7-point semantic differential 
scale. 

The speaker evaluation list was as follows: (a) 
tall-short, (b) handsome-ugly, (c) amusing—boring, 
(d) intelligent-stupid, (e) fat-thin, (f) weak-strong, 
(g) not religious-religious, (k) self-confident-shy, 
(i) kind-mean, (j) gentle-cruel, (k) friendly—dis- 
agreeable, (J) nice-evil, (m) interesting-irritating, 
(n) not trustworthy-trustworthy, (0) blond hair- 
dark hair, (~) dependent-independent, (q) good 
disposition—bad disposition, (r) wise-foolish, (s) 
help-not helpful, (t) passive-active. 

One passage read by a male graduate student with 
a French accent was: 


[pro-French condition] For 200 years French 
Canada has existed side-by-side with English 


2The experimenter handed the subjects the sheets 
at this time. (Speaker-evaluation-sheet instructions 
are in part from Osgood & Suci, 1955.) Four items 
added to the questionnaire used by Larimer (1968) 
to establish face validity of the cover story were 
(a) fat-thin, (b) weak-strong, (c) blond hair—dark 
hair, and (d) dependent-independent. 
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Canada. Throughout these 200 years, while English 
Canada was becoming more and more american- 
ized, French Canada was developing a distinctive 
Canadian culture. Where English Canada was de- 
pendent, first upon England and then upon the 
United States for its ideas, French Canada was 
progressing—trying out new ideas, introducing 
new reforms, both political and social. Where 
English Canada was content to live in the past, 
French Canada was progressing into the 20th cen- 
tury. In fact, it would be stated that any progress 
Canada has made has been largely due to French 
Canada. 

Although English Canada comprises a sizable 
majority within Canada at large, French Canada 
has contributed to the culture of Canada way 
out of proportion to its number. Because of 
its language and culture, French Canada has 
been able to make great contributions to the arts 
and literature. Its films are becoming world re- 
nowned, Its leaders are becoming widely known, 
not only within Canada, but throughout North 
America. In the future, French Canada will be 
a leader in Canadian thought. 

When one looks realistically from all points of 
view, he must conclude that if Canada is to sur- 
vive, French Canada must be preserved. Canada 
would not be complete without French Canada 
(Larimer, 1966, p. 2831. 


While this message was read, an EKG was taken; 
at the end of this passage, the subjects were asked 
to fill out a rating of the speaker evaluation sheet 
on the person who read the message. 


TABLE 1 


Factor ANALYSIS OF SPEAKER EVALUATION SHEETS 
with Hicuest LOADED PAIRS 


Factor a SO 
I (Evaluative) 30.7 
Kind-mean 83 
Friendly-disagreeable 75 
Handsome-ugly 74 
Helpful-not helpful -70 
Interesting-irritating -68 
Nice-evil .67 
II (Activity—potency) 14.0 
Passive—active —.64 
Self-confident-shy 61 
Weak-strong ase) 
Gentle—cruel 00 
TIL (Physical appearance) 9.9 
Fat-thin -10 
Amusing-boring .60 
Blond hair-dark hair 92 
IV (Religiousness) 8.3 
Not religious-religious —.63 
Blond hair-dark hair SS 
Interesting-irritating AT 


Note —5% significance level = ,43, 10% significance level 
=.56, 


TABLE 2 
Factor Score DIFFERENCES BETWEEN CONDITIONS 
Mean Mean | SD of t 
Factor pro- pro- Moitt differ- 
French English | ences 
I | +258 | =258 | 5.17 | 7.08 | 3.34** 
Ir | +1.70 | —1.70 3.40 | 3.59 | 4.34** 
Til +.32 =132 64 | 2.69 | 1.10 
IV =.63 Oo} 1.29") 2.33 | 2.45* 
*p <.05 ae ie 
wD < 01 


After the sheets had been rated and collected, 


another passage (pro-French speech in first half, pro- 
English speech in other half) was played that, in 
half the testing procedure, was a recording of a 
psychiatrist whose native language is English. 


[pro-English condition] There was a period of 
time when Canada could have existed with two 
nations within the boundaries of one country. That 
time has past. The conditions of the world have 
changed. No nation can exist for long with two 


E 
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separate and distinct languages and cultures within “~ 


its borders. Progress is impeded. Communication 
is hampered. And, eventually, such a situation 
leads to national stagnation. Therefore, the only 
possible solution to such a dilemma is for Canada 
to become one culture with one language 
predominant. 

Canada has always been closely related to the 
English-speaking world. French Canada, on the 
other hand, has made little contribution to the 
development of Canada. Mainly, its contribution 
takes the form of nostalgia; a remembering of a 
freer past. Canada’s culture and economy have 
always been closely linked to the culture and 
economy of English-speaking people. There con- 
tinues to be friendly and free exchange of goods 
and cultural events between Canada and other 
English nations, In short, culturally, Canada is an 
English country. Therefore, each minority, in- 
cluding the French Canadian one, should be 
required to learn to communicate in English. Such 
a situation is not only desirable, but absolutely 
essential in this day and age if Canada is to move 
into the 20th century and take its place among 
the leaders of the world. Canada must present a 
united country. The country would be united if 
French Canada were assimilated [Larimer, 1968, 
p. 238]. 


DATA ANALYSIS 
EKG data for each individual were ana- 


lyzed for each condition by measuring (to the 
nearest .5 millimeter and then converting to 
beats per minute) 60 successive R to R 
intervals (60 X 21 X 2= 2,520 measurements 


for the group). Each record was scored for 
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TABLE 3 
PRODUCT-MOMENT CORRELATIONS BETWEEN ALL VARIABLES 
Variable 1 | 2 | 3 | 4 | 5 | 6 | 7 

1. Factor I change 
2. Factor II change —.54* 
3. Factor III change a3 —,06 
4, Factor IV change —.08 Soules 27. 
5. Heart rate .17 -2S —.02 —.05 
6. Heart rate variability = 93" 20 —.08 — 13 —.63** 
7. Change in heart rate =A7 09 = 17 07 .18 .00 
8. Change in heart rate variability | —.50* 21 10 — 12 —.24 00S* = 20 

Note,—df = 19, 

*p <.05, 

wD < 01 


mean rate (heart rate) and the ratio of 
the square root of the mean-square successive 
difference divided by the mean rate (coeffi- 
cient of temporal variability)—the measure 
- of heart rate variability. 
_ The 42 speaker evaluation sheets were sub- 
jected to principal component analysis, and 
factor scores for the first four factors were 
calculated for each sheet separately. These 
differences (scores from the pro-French condi- 
tion were always subtracted from the values 
obtained for the pro-English condition) be- 
tween these factor scores became the criteria 
variable (Y’). 

The heart rate and heart rate variability 
measures in the pro-French condition were 
subtracted from the heart rate and heart 
rate variability measures in the pro-English 
condition. The resulting change scores (heart 
rate and heart rate variability) for each indi- 
vidual were treated as independent measures 
(X). The level of heart rate and heart rate 
variability was the mean of both conditions 
for each subject. Multiple linear regressions 
were calculated where appropriate. The use of 
difference scores aided in the reduction of the 
variance from between-individual scores; the 


mean level scores aided in the reduction of 
“error” (variance), which is posed by the 
problem of how to analyze data that are af- 
fected by the “law of initial values” (Wilder, 
1968). 


RESULTS 


A principal component analysis of the 
speaker evaluation sheets was performed, and 
four factors were identified which accounted 
for 63% of the total variance. The results 
are found in Table 1. 

Factor scores for each factor were calcu- 
lated for each subject for each condition. A 
t test (paired) was calculated between the 
two conditions, and the results are presented 
in Table 2. 

Table 3 presents the first-order correlations 
between the variables. 

Multiple linear regressions of Factor I score 
differences as criteria and heart rate measures 
as predictors were calculated, and the results 
are found in Table 4. 


DISCUSSION 


The principal component analysis of the 
speaker evaluation sheet showed that this 


TABLE 4 
MULTIPLE LINEAR REGRESSIONS WITH THE CHANGE VALUE OF FACTOR I As CRITERIA 


Heart iC) i Change in 
Criterion Intercept Heart rate bliy aes heart tate Fa 
Y’ change in Factor I 330 +.062 —.534 —1.397* 2.856 
Y’ change in Factor I 24.666 —.151 —1.241* —.230 2.996 
Y’ change in Factor I 17.101 —.088 —.782 — 375 —.819 2.655 


adf = 3/17. 
*p < 05. 


questionnaire is made up of independent di- 
Mensions of semantic space. Like the factor 
analysis produced by Osgood and Suci 
(1955), it produces four factors, which ac- 
count for a little more than half of the total 
variance (63%). The first factor seems to be 
“evaluative” in nature (and it is this factor 
that would be labeled “attitude” by Osgood 
& Tannenbaum, 1955). The second factor is 
a mixed “activity-potency” dimension, which 
Osgood and Suci found to separate into two 
separate factors. This difference may be to- 
tally due to the use of different bipolar ad- 
jectives in each factor analysis. The third and 
fourth factors are more difficult to interpret; 
they are labeled from the highest loaded 
pair, “physical appearance”? and “religious- 
ness.” 

Table 2 shows that the two speakers were 
rated significantly different on three of the 
four factors. In short, the pro-English speaker 
is rated as being of higher evaluation (kinder, 
friendlier, etc., Factor I) than the pro- 
French speaker. The pro-English speaker is 
rated more active and stronger (Factor II) 
than the pro-French speaker; and the pro- 
English speaker is rated less religious than 
the pro-French speaker (Factor IV). 

Osgood and Tannenbaum (1955) have 
identifed the evaluative factor as measuring 
“attitude,” and the data from this study 
seem to support their opinion and further sug- 
gest that parameters of heart rate variability 
are significantly related with this dimension 
of attitude. It is interesting to note that al- 
though the factors are independent, the 
change on factor scores are not independent 
between Factors I and II (r = —.54) and 
Factors II and IV (r = .61). This could im- 
ply that when attitude is measured (Factor 
I), potency and activity (Factor II) are not 
truly independent when assessing changes of 
evaluation. 

Objectively judged, the speakers’ voices 
were different in quality. The pro-English 
speaker was a psychiatrist who had a deep 
voice and appeared to use it in a rapport- 
seeking manner. The pro-French speaker was 
a psychological graduate student who spoke 
with hesitation, more slowly and with a tenor 
voice. Neither speaker agreed totally with the 
speech he read. The choice to use a person 
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with a French accent to read the pro-French 
speech was made to increase the identification 
of the speaker with a social group. It is un- 
like the study by Lambert et al. (1960) in 
that it was not read in French but in English. 
There is a difference of opinion as to the im- 
portance of French Canada in Canada. Mani- 
toba contains a large French population, but 
tends to keep an “open mind” on the opiy- 
ions voiced here; however, it was expected 
that the pro-English speaker would be more 
acceptable than the pro-French speaker, 

Overall heart rate (mean heart rate level 
change = —.51, SD = 3.9) was higher, and 
the heart rate variability (mean heart rate 
variability change = A SD E nde: 
creased when the subjects listened to the pro- 
English speaker when compared to the pro- 
French speaker, but none of those changes 
approach statistical significance. The mean | 
heart rate level for both conditions was a 
little high for the group (mean heart rate = 
83.7, SD = 10.9), but a “normal” heart rate 
may range greatly from person to person. 
Clinically, 83.7 beats per minute would be 
seen as normal for a subject in a psychologi- 
cal experiment with EKG electrodes attached 
to both arms. Heart rate and heart rate varia- 
bility were significantly negatively correlated 
(r= —.65, df= 20, p < .01) as found previ- 
ously. There was a significant first-order cor- 
relation between heart rate variability and 
the change on Factor I (r= —.53, p< .01) 
and the change in heart rate variability and 
the change in Factor I (r= —.50, p< .05). 
In this experiment, the heart rate “labile” 
tended to change his attitude (Factor I) be- 
tween the conditions less than the cardiac ` 
“stabile”; this seemed to be counter to the 
clinical description of these types as reported 
by Lacey and Lacey (1958). Likewise, the 
more a subject changed in his attitude, the 
less change was seen in his heart rate varia- 
bility scores; that is, change in attitude was 
associated with an increase of stability in the 
variability of the heart rate. 

If we divide the subjects at the median 
heart rate variability score (Mdn = Sexe 
5.64, SD = 3.85) and call the subjects above 
the median “labiles” and those below the mean 
“stabiles,” we can compare some of these 
relationships more completely. The labiles 
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have a mean difference score on the Faetor I 


dimension of 3.50 (SD = 7.98, n = 10), while ' 


the stabiles mean difference score is 6.62 (SD 
= 6.07, n= 11). The labiles have a mean 
heart rate variability change score of —1.13 
(SD = 4.81), while the stabiles’ score is .05 
(SD = 3.02). The mean heart rate of the 
labiles is 79.5 (SD = 11.67), while the stabiles 
have a mean level of 87.6 (SD = 8.86). The 
significant negative correlation between heart 
rate variability and the evaluative factor could 
be interpreted in several ‘ways. It could be 
suggested that cardiac labiles evaluated the 
pro-English speaker more equally with the 
pro-French speaker because they “saw 
through” the cover story and refused to con- 
formë to the experimenter’s expectancy, 
which anticipates the pro-French speaker to 
be evaluated much lower than the pro-English 
speaker. This is somewhat supported by the 
* evidence that Buckhout (1966) reported, 
where anticonformers had lower heart rates 
than conformers. On the other hand, if the 
cover story were effective, one would expect 
those members of the group who found the 
messages most “arousing” (i.e, the cardiac 
stabiles with higher heart rate level) to evalu- 
ate the two points of view as being most dif- 
_ ferent.* Both explanations are weakened by 

the fact that all these differences are not sig- 
nificant and that heart rate level fails to relate 
significantly, at any level, to Factor I values. 

The negative correlation of heart rate vari- 
ability change and the evaluative factor 
change suggests that those subjects who 
changed most in their heart rate variability 
from the pro-English speaker condition to the 


~ pro-French speaker condition reported that 


the pro-English speaker was higher or more 
equal in evaluation than the pro-French 
speaker, as compared to those subjects who 
changed least in their heart rate variability. 
The positive correlation between heart rate 
variability and change in heart rate varia- 
bility shows that the cardiac labiles pro- 

3 The two extreme individuals who evaluated the 
pro-French speaker as being “better” than the pro- 
English speaker were both labiles; all other indi- 
viduals evaluated the two conditions as equal or the 
pro-English speaker as better. 


4Mean absolute change for Factor I for the 
stabiles was 6.67, while for the labiles it was 6.27. 


duced the most change in the heart rate 
variability dimension but did not rate 
much difference between the two condi- 
tions. From the heart rate data, one could 
speculate that the stabiles were more 
“nervous” or “aroused” than the labiles. Is 
this difference in mean level of heart rate 
giving us a false impression of the strength of 
the association between heart rate variability 
(or heart rate variability change) and atti- f 
tude? The data in Table 4 present the infor- 
mation to answer this question. From the first 
equation we see that if we assess the predictive 
power of change in heart rate variability and 
“control” for both heart rate and heart rate 
change, the predictive value of heart rate 
variability change is still significant at the .05 
level. Likewise, heart rate variability is also a 
significant predictor at the .05 level when the 
same two variables are controlled. Neither 
heart rate nor change in heart rate affected the 
predictive relationship between attitude and 
heart rate variability (or, separately, heart 
rate variability change). It is possible to go 
one step further and tell whether heart rate 
variability (or heart rate variability change) 
is predictive if the other is held constant 
(along with the heart rate parameters); and 
the answer to this is that neither heart rate 
variability nor heart rate variability change 
is a significant predictor of attitude if the 
other is held constant. It is likely, therefore, 
that replication of our results will hinge 
strongly on the Heart Rate Variability X 
Change in Heart Rate Variability interaction 
(r = .60) that is found in this study. 

Our method of measuring “lability” differs 
from some others in that the heart rate mea- 
surement was taken during the stimulus 
period, not during a “rest” state with no 
stimulus. It was noted in a former paper 
(Burdick, 1968) that the test-retest reliabil- 
ity of such rest background activity is poor 
(p = 48, df = 12, ms) and that other re- 
search has also reported the same observation 
(Johnson, 1963). The results from the present 
study lead us to the conclusions that heart rate 
variability is a highly sensitive parameter of 
attitude, and this sensitivity to slight changes 
in the affective environment might lead to the 
poor test-retest reliability in the rest con- 
dition. 
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* The overall conclusion is that change in 
attitude is predictive from a subject’s affec- 
tive state at the moment (as measured by the 
mean heart rate variability) and from the 
amount of change in this variability of heart 
rate. The affective state that the subject hap- 
pens to be in when he is tested is directly 
related to his psychophysiological reaction, 
and this interaction is related to attitude 
measurement. While we have assumed that the 
nature of this process underlying attitude to 
be affect (following Osgood & Tannenbaum, 
1955), a strong case could be argued for a 
cognitive nature rather than an affective one. 
The particular form of analysis used in this 
experiment (i.e., the use of differences þe- 
tween two conditions) was helpful in the 
reduction of “error,” but, at the same time, it 
limited the amount of generalization that 
could be made from this experiment. A study 
presently underway (Burdick, Adamson, Che- 
bib, & Koulack, 1972), in which a number of 
visual stimuli are presented, uses a dif- 
ferent form of analysis, and the topic of the 
stimuli is not of immediate public impor- 
tance. While the present experiment could be 
criticized for the use of a complex design with 
relatively few subjects, it is clear that heart 
rate variability must be seen as an important 
physiological correlate of attitude as mea- 
sured in this experiment, 
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ROLE OF RESPONSIBILITY AND VIOLATED EXPECTANCY 
IN THE AROUSAL OF DISSONANCE* 
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University of North Carolina Duke University 


It was hypothesized that a violation of expectancy will result in dissonance 
only when an individual feels some responsibility for the creation of the 
expectancy. Subjects were led to believe that their partner in a game might 
possess a trait that would hurt their chances of winning money. Some subjects 
were made to feel some responsibility in creating the trait expectancy, and 
others were not, All subjects lost money either because their partner pos- 
sessed the expected trait or an unexpected characteristic. Measures of attraction 
to the partner were employed as indicators of dissonance reduction, and the 
results showed that when negative consequences followed the confirmation of 
the expectancy, the subjects experienced dissonance regardless of who was 
responsible for the creation of the expectancy. However, when negative conse- 
quences followed a violated trait expectancy, dissonance was experienced only 


when the subject felt responsibility for forming the expectancy. 


Despite the large volume of research on the 
theory of cognitive dissonance (Festinger, 
1957), there remains considerable disagree- 
ment about the necessary and sufficient condi- 
tions for the arousal of dissonance (Abelson, 
Aronson, McGuire, Newcomb, Rosenberg, & 
Tannenbaum, 1968). In an attempt to indi- 
cate necessary conditions, Aronson (1968) 
argued that dissonance should be aroused 
w whenever events disconfirm expectancies— 
_ especially expectancies about the self. Taking 
a somewhat different approach, Bramel 
(1968) proposed that the violation of any 
expectancy should arouse dissonance because 
it indicates to the individual that he is in- 
competent and therefore unworthy. Yet a 
third view, offered by Carlsmith and Freed- 
man (1968), is that dissonance will be 
aroused only when the cognitions that are 
discrepant with each other were expected or 
foreseeable. These views of what is necessary 
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for the arousal of dissonance appear to be 
incompatible. However, the purpose of the 
experiments to be reported here was to show 
that these different views complement each 
other in specifying the necessary conditions for 
the arousal of cognitive dissonance. 

The position that discrepant cognitions 
must be foreseeable in order for them to 
arouse dissonance or increase its magnitude 
has recently received rather compelling sup- 
port from an experiment by Cooper (1971). 
In this experiment, subjects worked with a 
partner on a cooperative task in which the 
partner’s timidity or braggartliness cost the 
subject money. Subjects were forewarned that 
their partner might be timid or braggartly, 
and some were simply assigned to that part- 
ner, while others were given the impression 
that they could choose another partner if they 
wished (although none actually did). Some 
subjects then found that they lost because 
the warning proved correct, while others 
found that they lost because their partner 
had the opposite trait to that which they had 
been led to expect. This violation of the ex- 
pectancy was unforeseeable to the subjects at 
the time they chose their partner. The mea- 
sure of dissonance reduction was the posttask 
attractiveness of the partner, attractiveness 
being a dimension by which subjects could 
“justify” their choice of partner. Cooper 
found that only subjects who lost because 
their partner possessed the expected trait 


SE TRR- 


showed evidence of dissonance reduction. 
Those who lost because their partner had the 
opposite of the expected trait and those who 
had no opportunity to select a different 
partner exhibited little or no evidence of 
dissonance reduction, 

It is apparent that Cooper’s data are in- 
consistent with the position that a discon- 
firmed expectancy is a sufficient condition for 
the arousal of dissonance. Cooper interpreted 
the effects of his foreseeability manipulation to 
mean that a person must feel responsible for 
the consequences of his commitments in order 
for negative consequences to arouse or in- 
crease the magnitude of dissonance. Respon- 
sibility, however, is a broader concept than 
foreseeability, and it suggests at least one 
way in which disconfirmed expectances could 
arouse dissonance, namely, that a person may 
feel more responsible for an expectancy that 
he holds. In general, a person should feel 
more responsible for an expectancy that he 
derives through his own judgmental proc- 
esses than for one that is given to him by a 
highly credible communicator. In Cooper’s 
experiment, the expectancy was given to the 
subject by the experimenter, and, conse- 
quently, the subject should have felt little or 
no responsibility for it. Thus, when the ex- 
pectancy was disconfirmed, it aroused little or 
no dissonance. We suggest that if, on the 
other hand, the subject had been allowed to 
establish his own prediction about whether 
his partner might be overly timid or bold, 
he would have felt responsible for the re- 
sultant expectation and would have experi- 
enced dissonance when that expectation was 
disconfirmed. 

The present experiments were designed to 
allow examination of the interaction between 
responsibility for expectancies, on the one 
hand, and the role of confirmation or discon- 
firmation of those expectancies, on the other, 
as determinants of cognitive dissonance when 
negative consequences occur. 


EXPERIMENT T 
Method 


Design Overview 


Subjects were led to believe that they would play 
a game with a partner with the possibility of winning 
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a large sum of money for a good team performance. 
They were told, however, that someone who was too 
timid or braggartly could ruin the partnership. Sub- 
jects were then led to believe that they could choose 
one of two people to be their partner. Each subject 
was shown apparent personality profiles of the two 
people. Half of the subjects were told that the per- 
sonality tests themselves were accurate and that they 
could get a very good picture of what traits their 
partner would possess from the tests themselves (test 
responsible), The other half of the subjects were 
told that experiments with the personality tests had 
shown that while the information given in the 
personality profile was not, by itself, sufficient to 
make accurate predictions about a person’s behavior, 
individuals who were sensitive and who worked at 
integrating the information could make very accurate 
predictions about the person’s behavior (subject 
responsible), One profile indicated that the described 
person had a tendency to be braggart, and the other 
described a more timid person. After the subjects 
chose which one they wanted for their partner, 
they began playing the game for money. In all cases, 
the partner’s actions caused the subject to lose 
money. However, half of the subjects lost money 
because the partner they picked possessed too much 
of the trait they had expected (confirmed expect- 
ancy), and half lost because the partner possessed 
the opposite trait (violated expectancy). After the 
game, the subjects were asked to indicate how at- 
tracted they were to their partner, and this consti- 
tuted the main dependent measure. A separate set 
of conditions was run to measure how attracted 
subjects would be to their partner if they did not 
have the opportunity to choose her. 


Subjects 


Ninety-seven female subjects were recruited from 
introductory psychology classes at Duke University 
to participate in an experiment entitled “Cooperation 
and Sensitivity.” In all, the data from 90 subjects 
were included in the final analysis since 7 subjects 
expressed some suspicion about the design, and their 
data were eliminated (3 in subject responsible- 
violated expectancy, 2 in subject responsible-con- 
firmed expectancy, 2 in test responsible—confirmed 
expectancy). Subjects were run in groups of 4. 


Procedure 


When the subjects arrived at the experimental 
room, they were asked to take seats at a table which 
had been partitioned so that each subject could not 
see who was sitting on either side of her. After the 
subjects were seated, the experimenter handed each 
girl two copies of the “Worland Personality Inven- 
tory” which contained a list of 20 adjective pairs, 
The subjects were asked to indicate on the 10-point 
scales accompanying each adjective pair the descrip- 
tion that they felt best described themselves and to 
fill out both inventories exactly the same. 

After the subjects had completed the inventories, 
the experimenter explained that the purpose of the 
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experiment was to test a new theory of intelligence. 
The subjects were told that most intelligence tests 
examine only what a person knows, not how he acts 
on this knowledge. It was explained that many psy- 
chologists felt that how one acts on his knowledge 
may be more important than what facts he knows. 
The experiment in which subjects were to partici- 
pate was aimed at demonstrating the importance of 
how one acts on what he knows and what he does 
not know. 

Subjects were told that they would be in a part- 
nership with another person and that to increase 
people’s interest, it would be possible for them 
to win over $5.40 for a good performance. It 
was explained that they would be asked a number 
of multiple-choice questions, and their task was to 
choose an answer to the question and then indicate 
whether or not they thought they had chosen the 
correct answer. It was pointed out to the subjects 
that in many situations in “real life” it is often 
more important that one be able to decide whether 
or not he has made the right decision than actually 
making the correct choice. The reason for the part- 
nership was that in real life it is rare that a decision 
only affects one person and that the experimenters 
were interested in making the study more like a 
real-life situation. Subjects were told that they would 
be answering the questions separately but that the 
performance of their partner and themselves would 
be the deciding factor as to whether they won any 
money. It was explained that if both their partner 
and themselves arrived at the correct answer on the 
question and stated that they were correct, each 
would win 18¢; if one person got the question cor- 
rect and said she was correct, and the other answered 
the question wrong but said that she was wrong, 
each would win 12¢; and if they both got the answer 
wrong and said that they had given incorrect an- 
swers, both members would win 9g. It was further 
pointed out that if either partner made a mistake 
in his answer, neither would win any money. The 
experimenter interjected, “Thus, you can see that 
it doesn’t pay to be too timid or brash in your 
decisions.” 

Subjects were told that the experimenter would 
communicate to them through earphones and that 
when he asked them for their answers, they were 
first to indicate their answer to the question by 
holding up a card and then hold up another card 
indicating whether they felt their answer was correct 
or not. After both partners had answered the ques- 
tion, the experimenter indicated to each how the 
other had answered and the correctness of both 
answers. Thus, the experimenter communicated to 
the subjects through earphones, and the subject 
communicated to the experimenter through use of 
flash cards. The subjects were also told that the 
order of questions would be different for them and 
their partners, so that they would be answering the 
same questions but in different orders. 

After the nature of the game had been explained 
to the subjects, the experimenter told them that 
two of them would be allowed to choose their part- 
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ners and that the “choosers” would be randomly 
decided by drawing slips of paper from a box. The 
choosers would then be given the personality inven- 
tories of the other two people on which they could 
base their choice. Subjects then drew slips from a 
box, and, in fact, they all drew slips labeled 
“chooser.” After inspecting the slips, the experimenter 
asked two of the subjects to come with him to 
another room. The experimenter seated the two sub- 
jects in another room and told them they were 
choosers and that he would get the personality in- 
ventories of the other two subjects for them. The 
experimenter then went into the original room where 
the two other subjects had waited and told these 
subjects that they were the choosers and that he 
would shortly give them the inventories of the 
other subjects. 


Responsibility Manipulation 


Test responsible. The experimenter randomly desig- 
nated one room of subjects to be in the test respon- 
sible condition. He entered that room and before 
giving the two subjects the inventories, he said: 


The Worland Personality Inventory has been used 
in a number of experiments and has proved to be 
an extremely effective instrument in predicting the 
personality of others. Even though no personality 
test can be perfectly accurate, you can be pretty 
sure that your partner will be much as described 
in the inventory. 


These instructions were designed to place full respon- 
sibility for the expectancy on the experimenter and 
the “personality test.” 


Subject Responsible 


The two subjects who were chosen to be in the 
subject responsible condition were told: 


The Worland Personality Inventory has been 
used in a number of experiments, and an interest- 
ing discovery has been made. While the informa- 
tion given in the test itself cannot give an ac- 
curate picture of the personality of the person who 
took the test, it has been found that people are 
socially sensitive, and those who work to try and 
integrate the information provided in the inventory 
can make amazingly accurate predictions about the 
personality of the test taker. Thus, with the infor- 
mation provided in the test and with work on your 
part, you should be able to form an accurate 
description of the other two subjects. 


Emphasis was put on the fact that the subject 
would have to form her own expectancy, and her 
competence to do so was challenged. 


Expectations about Partner 


After this introduction, the subjects were assured 
that they would be able to have the partner they 
chose, even if they both chose the same partner. 
They were admonished not to talk with each other 


while making their decisions and were then handed 
two completed copies of the Worland Personality 
Inventory which had supposedly been filled out by 
the other two subjects. Actually, the experimenter 
had substituted two standard inventories, Both in- 
ventories gave the description of people who saw 
themselves as being fairly intelligent, friendly, capa- 
ble, productive, thoughtful, creative, hardworking, 
happy, and sociable. The ratings for the timid and 
braggart partner differed by only 1 point on each 
of these traits, and the direction of variance was 
randomly chosen. However, the inventory of the 
timid partner indicated that she was generally pas- 
sive, dependent, not confident, introverted, and not 
impulsive. The inventory of the braggart partner, on 
the other hand, indicated that she was generally 
aggressive, independent, extraverted, ambitious, im- 
pulsive, and somewhat distant. 

All subjects received an inventory describing a 
timid person and one describing a braggart person. 
Subjects were given 8 minutes to decide on a part- 
ner, and they indicated their choice to the experi- 
menter by handing him the inventory of the chosen 
partner first and then giving him the inventory of 
the rejected partner, 


~ Partner's Actual Traits 


After the subjects had chosen their partner, they 
all returned to the original experimental room, The 
experimenter again explained the procedure which 
was to be followed in answering the questions, and 
then each subject was handed a booklet containing 
15 multiple-choice questions. The questions were de- 
signed to appear fairly difficult, and a number did 
not have any correct answer. (These problems were 
identical to those used by Cooper, 1971). This made 
it possible to manipulate the number of questions 
- the subject and partner answered incorrectly without 
creating a great deal of suspicion or making one or 
the other look too stupid. Subjects then put on the 
earphones, and the experimenter retired to the con- 
trol room where he could see all the subjects and 
speak to each, individually, through the earphones. 
After he determined that the communication system 
was in proper working order, he informed the sub- 
` jects that they could begin working on the problems. 
_ Subjects were informed after each question how they 
had done and how their partner had scored. 
= In all, the subject was informed that she had 
answered in a correct combination on all but two 
of the questions, that on one question she had 
labeled a correct answer as incorrect, and that on 
another she had labeled an incorrect answer as 
correct. 

Timid partner. If the subject had been randomly 
chosen to have a timid partner, it was made evident 
that her partner tended to be much too cautious. 
The timid partner made seven correct estimates, 
seven incorrect estimates in the timid direction 

_ (correct but labeled incorrect), and one incorrect 
estimate in the braggart direction (incorrect but 
labeled correct). 


es USERE E EON: 
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Braggart partner. If the subject had been assigned 
to have a braggart partner, it became evident that 
her partner was too brash and tended to be too sure 
of herself. In all, the braggart made seven correct 
estimates, seven incorrect estimates in the braggart 
direction, and one incorrect estimate in the timid 
direction. 

At the end of the experiment, each subject was 
informed by the experimenter, “I’m sorry, but you 
only won 92¢,” It was clearly evident to the sub- 
ject that the reason she did not win more was 
because her partner had been too brash or timid. 

The experimenter then entered the room and dis- 
tributed the main dependent measure to the subjects. 
The subjects were told as the experimenter handed 
out the questionnaire that their answers on this ques- 
tionnaire would be very helpful in evaluating the 
sensitivity of the intelligence test and in examining 
what conditions affected it. The questionnaire con- 
tained five questions, each with 31-point scales 
running from “strongly agree” to “strongly disagree.” 
The two questions that were included to measure 
dissonance reduction stated, “I think my partner 
would be an enjoyable person to know” and “I think 
I would like my partner as an individual.” The other 
three questions, which were included as “fillers,” 
asked the subjects how intelligent they thought their 
partner was, how interesting they found the experi- 
ment, and how useful they felt the experiment would 
be to science. 


Design Summary 


The design was a 2 X 2 X 2 factorial. Subjects were 
were either led to believe that they could base their 
expectation about their partner on the personality 
test (test responsible) or that the subject herself 
had to put effort into building up an expectation 
about her partner (subject responsible). Subjects 
chose either a timid (expect timid) or a braggart 
(expect braggart) partner, and it turned out that 
the partner they actually did get either had (con- 
firmed expectancy) or did not have the expected 
trait (violated expectancy). In all cases, the partner 
caused the subject to lose money because she pos- 
sessed too much of either the timid or braggart trait. 
Subjects were asked to rate how much they were 
attracted to their partner, and this measure 
constituted the main dependent measure. 


Results 


From Cooper’s (1971) personal responsi- 
bility hypothesis, it can be predicted that 
subjects in the  test-responsible-violated- 
expectancy condition should have experienced 
little if any dissonance since they were not 
responsible for the expectancy about their 
partner and could not have foreseen at the 
time of their choice that they would lose 
money because this expectancy would be vio- 
lated. However, subjects in the confirmed 


- expectancy conditions should have experi- 
enced dissonance since they had some respon- 
sibility for their loss of money. Their partner 
did possess the expected trait, and the fact 
that she might cause the subject to lose money 
should have been foreseeable at the time of 
choosing. The prediction that subjects in 
the subject-responsible—violated-expectancy 
condition should have experienced the greatest 
amount of dissonance can be gleaned from 
our responsibility for expectancy hypothesis. 
Not only should dissonance arise from the 
fact that the subject’s chosen partner caused 
her to lose money, but that the partner also 
disconfirmed an expectancy for which the sub- 
ject was responsible. Thus, greatest dissonance 
and attraction for the partner should occur 
in the subject-responsible—violated-expectancy 
cell and least dissonance and attraction should 
be found in the teat smile violated: 
expectancy condition, 


Test of the Hypotheses 


Attraction to the partner served as the mea- 
sure of dissonance reduction, as it was felt 
that the subject could justify her choice by 
increasing the attractiveness of her partner 
(cf. Aronson & Mills, 1959; Cooper, 1971). 
Two questions, asking how likable and enjoy- 
able a person the partner was, served as the 
main dependent measures, Under each ques- 
tion was a 31-point scale with 1 representing 
the most flattering and 31, the least attrac- 
tive position. Because both questions were felt 
to measure attraction, the scores from them 
were combined for the analyses.‘ 

Subjects expected either a timid or braggart 
partner, and some subjects played the game 
with a timid partner, others, with a braggart 
partner, The content of the expectancy 
(whether expect timid or expect braggart) or 
the actual traits of the partner (whether timid 
or braggart) should not have had any effect 
on the arousal of dissonance, and an analysis 
of the data revealed no differences in the re- 
- sults as a function of the content of the 


4 The results from the likable and enjoyable ques- 
tions were very similar, with the only difference being 
the fact that differences on the likable question 
reached greater levels of significance than did those 
= Obtained on the enjoyable question. 


TABLE 1 


MEANS AND ANALYSIS OF ATTRACTION SCORES: 
EXPERIMENT I 


Result of expectancy 


Responsibility for 
expectancy 


Violated Confirmed 
expectancy expectancy 
Subject responsible 19.32% 25.55 
n» 19 20 
Test responsible 37.00 27.46 
n 25 26 


a 1 = highest attraction score; 62 = lowest attraction score. 
The attraction scores were computed, for each subject by sum= 
ming his responses to the questions, “I think I would like my 
partner as an individual,” and “I think my partner would bean 
Each que: estion was accompanied 


enjoyable person to know,” 
‘strongly agree (1)" to 


y a 31-point pak running from * 
“strongly disagree (31)."" 

b Differences in the number of subjects in the test responsible 
and subject responsible conditions are due to the fact that 
slightly more subjects (five versus two) expressed suspicions 
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about the design in the subject responsible condition, and due 


to faulty planning, the last group of subjects was run as a test 
responsible rather than a subject responsible group, 


expectancy or the trait of the partner. Thus, 
for the sake of clarity and brevity, the re- 
sults are presented as a 2 X 2 factorial with 
test responsible-subject responsible and con- 
firmed expectancy-violated expectancy as the 
variables. 

A glance at the results presented in Tables 
1 and 2 indicates that both the expectancy 
and personal responsibility hypotheses re- 
ceived strong support. In the subject respon- 
sible conditions, greater attraction to the 


- partner was found when she violated the sub- 


ject’s expectancy than when she possessed the 
expected trait, and the opposite trend oc- 
curred in the test responsible conditions 


(interaction F = 23.30, df = 1/86, p < .001).’ 


As predicted by the expectancy hypothesis, 
the greatest attraction to the partner was 
found in the  subject-responsible-violated- 
expectancy condition. The difference between 
the attraction scores in this cell and those of 
any of the other conditions was significant 


TABLE 2 


ANALYSIS OF VARIANCE SUMMARY TABLE: 
EXPERIMENT I 


Source Ei MS F 
Re AER 79 | 35.95* 
1 60.88 | 1.03 


1 | 1381.64 | 23.30* 
Error 86 59.30 


Responsibility for expectancy (A) 
Result of expectancy (B) j 
AXB 


*p <.001. 


beyond the .05 level (subject responsible—con- 
firmed expectancy, F = 6.54, df = 1/86; test 
responsible—confirmed expectancy, F = 12.41, 
df = 1/86; test responsible—violated expect- 
ancy, F = 58.54, df = 1/86). 

The personal responsibility hypothesis 
(Cooper, 1971) received support from the 
finding that least attraction for the partner was 
exhibited by subjects in the test-responsible— 
violated-expectancy condition. The significant 
difference (F = 19.20, df = 1/86, p < .001) 
between the confirmed and violated expect- 
ancy cells in the test responsible condition 
replicated Cooper’s (1971) finding. Subjects 
in the  test-responsible—violated-expectancy 
condition were also significantly less attracted 
to their partner than were subjects in the two 
subject responsible conditions (confirmed ex- 
pectancy, F = 24.55, df = 1/86, p< .001; 
violated expectancy, F = 58.54, df= 1/86, 
p < 001). 


Other Questions 


Besides these two attraction questions, the 
` subjects were asked to indicate on 31-point 
scales (1 = strongly agree; 31 = strongly dis- 
agree) their agreement with the following 
questions: “I think my partner is intelligent”; 
“T feel this experiment may be useful to 
science”; “I found this to be an interesting 
experiment.” These questions produced no 
differences that reached the traditionally ac- 
ceptable significance level. The only differ- 
ence that approached significance was that 
‘the timid partner was seen as being some- 
what more intelligent than the braggart 
partner (F = 3.31, df = 1/86, p < .10). 


EXPERIMENT IT 


Following the completion of the first study, 
a second study was conducted with the hope 
of collecting data that would add further 
clarity to the results of the first experiment. 
While postexperimental interviews indicated 
that the subjects in the first study had been 
able to form the desired expectancies about 
their partner from the personality inventories, 
there had been no independent measure of 
these expectancies. It was decided to replicate 
one of the test responsible conditions from 
the first experiment and to obtain an indepen- 
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dent measure of the expectancy of the partner - 
formed by the subjects. 

It was also decided to obtain a measure of 
how subjects would rate their partner if they 
had no choice in picking her. Presumably, 
there should be very little dissonance aroused 
if the partner was assigned to the subject. 
Cooper (1971) demonstrated that subjects 
who were not allowed to choose their partner 
and who should not haye experienced disso- 
nance when their partner performed poorly 
gave their partner attraction scores similar to 
those given by subjects whose chosen partner 
violated the expectancy given them by the 
experimenter. Translated into the language of 
this experiment, it means that subjects who 
had no choice should rate their partner about 
the same as did subjects in the test-responsi- 
ble-violated-expectancy condition. 

Thus, a second experiment was undertaken 
to provide data on the expectancies held by 
subjects and on the attraction that subjects 
would feel toward a partner who had been 
assigned to them and who caused them to lose 
money. 


EXPERIMENT IT 


Method 
Subjects 


Fifty-eight female subjects were recruited from 
introductory psychology classes at Duke to partici- 
pate in a study entitled “Cooperation and Sensitiv- 
ity.” The data from six subjects were deleted from 
the final analysis because they expressed doubts 
about the true intention of the experiment (four in 
the test-responsible-braggart-confirmed-expectancy 
condition and one each in the no-choice conditions 
of braggart-confirmed expectancy and timid—vio- 
lated expectancy). 


Choice Conditions 


The test responsible conditions of expect timid- 
confirmed expectancy and expect braggart-confirmed 
expectancy were replicated step-by-step from the 
first study. 


No-Choice Conditions 


The instructions to these subjects were exactly 
the same as to those subjects in the other condi- 
tions of the first study, with the following excep- 
tions: (a) The experimenter told all subjects that 
he would assign them a partner, and (b) the sub- 
jects were not told exactly how accurate they could 
expect the inventories to be in describing their part- 
ner, but were told that they could get an idea about 
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their partner from the inventory.5 Otherwise, the 
subjects in these conditions received the same two 
personality profiles and won the same amount of 
money as did subjects in the first study. In all, four 
no-choice conditions were run: expect timid-con- 
x firmed expectancy, expect timid-violated expectancy, 
oe] ‘expect braggart-confirmed expectancy, and expect 
braggart-violated expectancy. 


Dependent Measures 


Ss In addition to answering the five questions that 
the subjects in the first study answered, the subjects 
in this experiment were given two additional ques- 
tionnaires. Immediately after they had either chosen 
or were assigned a partner, they were given a ques- 
tionnaire that stated: “Check the box which best 
describes your anticipation about your partner.” 
Below this were two 31-point scales with the end 
points of the first labeled “dominant” and “sub- 
missive” and the end points of the second reading 
“timid” and “braggart.” From the answers to these 
“questions, the success of the manipulation of expec- 
tancy could be gauged. The second questionnaire was 
_ given to the subjects immediately after they had 
>“ completed the experiment. Subjects were asked the 
single question, “How accurate do you believe the 
Worland Personality Inventory was in describing 
your partner?” and a 31-point scale running from 
“very accurate” to “not at all accurate” followed. 
The purpose of this question was to see if the actual 
performance of their partner confirmed or violated 
the subjects’ expectancies. 


>; Results 

« Expectancy 
a It was hoped that subjects in the expect 
timid conditions would predict that their 


partner would be relatively submissive and 
timid and that subjects in the expect braggart 
conditions would anticipate a dominant and 
braggart partner. The results indicated that 
the timid—braggart manipulation was success- 
^g ful, Subjects in the expect timid conditions 
anticipated a timid and submissive partner. 
There were no significant differences between 
any of the expect timid conditions, as the 
means on the timid question ranged between 
10.00 and 11.57 (1 = timid, 31 = braggart), 
and on the submissive question, the means 
from the three expect timid conditions fell be- 


5 Instructions given subjects in the no-choice con- 

~ dition were originally the same as those given sub- 

jects in the subject responsible condition, However, 

during pretesting, no-choice subjects questioned the 

elaborate instructions since they could not use the 

data to base their choice of partner. Thus, the 
compromise instructions were employed. 
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tween 7.38 and 10.12 (1 = submissive, 31 = 
dominant). Subjects in the expect braggart 
conditions looked for a braggart (mean range 
between 20.12 and 22.08) and dominant 
(mean range between 21.29 and 23.75) part- 
ner. There were no differences that approached 
significance between the data obtained in the 
three expect braggart conditions. However, 
analyses of variance performed on the results 
yielded highly significant differences between 
the expectancies of subjects in the expect 
timid and expect braggart conditions (timid— 
braggart, F = 156.26, df = 1/46, p < .001; 
submissive-dominant, F = 357.82, dj = 1/46, 
p < 001). 


Accuracy oj the Personality Inventory 


After the subjects had completed the game 
with their partner, they were asked to indicate 
on a 3l-point scale (1 = very accurate, 31 = 
not at all accurate) their response to the ques- 
tion “How accurate do you believe the Wor- 
land Personality Inventory was in describing 
your partner?” It was predicted that subjects 
in the confirmed expectancy conditions would 
see the inventory as rather accurate and those 
in the violated expectancy conditions would 
rate the personality inventory as inaccurate. 
The results obtained on this question con- 
firmed the prediction, since the subjects’ re- 
sponses in the confirmed expectancy conditions 
averaged around 9.00, and the responses from 
the subjects in the violated expectancy condi- 
tions averaged 21.44. This difference was 
highly significant (F = 77.95, df = 1/46, p < 
.001), and there were no significant differences 
in the responses of the subjects in the test re- 
sponsible and no-choice conditions. Thus, the 
subjects were correctly able to perceive the 
brashness or timidness of their partner dur- 
ing the game. 


Attractiveness of Partner 


The interest in the attractiveness scores was 
to see how the subjects in the no-choice con- 
ditions, who supposedly were not experiencing 
dissonance, would rate their partners, as op- 
posed to the subjects who were allowed to 
choose their partners. As with the attractive- 
ness scores in the first experiment, data from 
the likable and enjoyable questions were com- 
bined. As can be seen from the data presented 


TABLE 3 


MEANS or Arrractrion Scores: 
Experiment IT 


Condition Gkracion 

No choice 

Expect timid-confirmed expectancy 35.00 

Esot timid-violated expectancy 33.62 

TE braggart-confirmed expectancy 34.88 

Expect braggart-violated expectancy 34.25 
Meiren 7 

Expect timid-confirmed expectancy 24.28 

Trnect braggart—confirmed expectancy 23.09 

n 


Note.—1 = highest attraction score; 62 = lowest attraction 
score. The attraction scores were computed for each subject by 
summing his responses to the questions “I think I would like 
my partner as an individual," and “I think my partner would 
be an enjoyable person to know," Each question was accom- 
panied by a 31-point scale running from “'strongly agree (1)"’ 
to “strongly disagree (31)."’ 


in Tables 3 and 4, there were no differences 
in the ratings of the subjects within the no- 
choice conditions, However, the subjects in 
the test responsible conditions rated their 
partner significantly more attractive than did 
subjects in the no-choice conditions (F = 
14.05, df = 1/46, p< .001). It should be 
noted that the scores from the test responsible 
conditions obtained in this study are similar to 
those obtained in the first study for the cor- 
responding conditions. Further, there is strik- 
ing similarity between the results from the 
no-choice conditions and those from the 
test-responsible—-violated-expectancy condition 
from the first study. 


Other Questions 


As in the first study, there were no differ- 
ences in the subjects’ responses to the ques- 
tions about the partner’s intelligence or about 
the interest or usefulness of the study. 


Discussion 


There were two potential sources of disso- 
nance present in these studies. First, disso- 
nance could arise when the partner chosen by 
the subject caused her to lose money. The sec- 
ond potential source involved the violation of 
the expectancy; the subject expected the part- 
ner to act in one manner, and she, in fact, 
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+ confirmed expectancy conditions should have 


CERTEN: 


behaved in an opposite manner. Based on 
Cooper’s (1971) personal responsibility hy- 
pothesis and on the present responsibility for 
expectancy hypothesis, it was predicted that 
subjects in the subject-responsible-violated- 
expectancy condition would suffer the great- 
est dissonance. Their chosen partner cost them 
money and violated an expectancy for which 
they were responsible. Subjects in the two 


experienced less dissonance than would sub- 
ject-responsible—violated-expectancy subjects. 
The confirmed expectancy subjects lost money 
due to their partner’s action, and this loss 
should have been foreseeable, since the partner 
possessed the expected personality trait. Sub- 
jects in the test-responsible—violated-expect- 
ancy cells should have experienced little, if 
any, dissonance, despite the facts that they 
lost money and their chosen partner discon- 
firmed their expectancy, These subjects were . 
neither responsible for the expectancies they 
held about the partner, nor was the violation 
of these expectancies and subsequent loss of 
money foreseeable at the time they made 
their choice of partners. 

The results of the study offered strong sup- 
port for all the predictions, since the subjects 
in the subject-responsible—violated-expectancy 
condition were significantly more attracted to 
their partner than were subjects in any of the 
other conditions; the subjects in the test- 
responsible—violated-expectancy condition were 
significantly less attracted to their partner 
than were subjects in the other cells, 

Bramel’s (1968) reasoning may be em- 
ployed to explain why the violation of an ex- 
pectancy under one set of instructions should 
lead to little, if any, dissonance-reducing be- 
havior, and why the violation of the same 
expectancy under another set of instructions 


TABLE 4 


ANALYSIS OF VARIANCE SUMMARY TABLE: 
EXPERIMENT IL 


Source df MS F 


Treatments 5 | 275.57 | 3.10* 
Error 46| 89.01 
Test responsible vs. no choice 1| 437.65 | 14.05** 
*p <.05. 
+b <.001. « 
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should lead to such strong attempts at disso- 
nance reduction. Bramel suggested that the 
violation of an expectancy, regardless of its 
content, may cast serious doubt on the indi- 
vidual’s capability to make predictions about 
his world and his sensitivity to events going 
on around him; This threat to the person’s 
competence results in dissonance and an at- 
tempt to regain self-esteem. Following this 
line of reasoning, it may be predicted that 
violations of expectancies that result in 
threats to the individual’s competence and 
self-concept will result in dissonance. When an 
individual feels responsible for an expectancy 
that he established on his own and this ex- 
pectancy is later violated, the disconfirmation 
should directly threaten the individual’s self- 
esteem, since it calls into question his sensitiv- 
ity and ability to arrive at predictions. The 
instructions given to the subject in the sub- 
ject responsible conditions were designed to 
make her feel responsible for the expectancy 
and to involve her ego in the formation of the 
expectancy. Since it was made clear to the sub- 
ject that she had the responsibility for form- 
ing the expectancy, the violation of the ex- 
pectancy should have cast doubt on her abil- 
ity to predict how other people will act. 
According to Bramel, this threat to the sub- 
ject’s competency to predict how others will 
act and her sensitivity to others should have 
resulted in dissonance. 

The instructions in the test responsible con- 
ditions relieved the subject of all responsibil- 
ity for forming the expectancy about her 
partner. The experimenter and the inventory 
were responsible for the expectancy, The sub- 
ject’s ability to predict and her sensitivity 
were not on the line in forming this expect- 
ancy. Thus, the violation of the expectancy 
in the test responsible condition should not 
have threatened the subject’s self-concept, and 
little dissonance should have followed the vio- 


lation. 

The results of this study suggest that disso- 
nance will result from the disconfirmation of 
an expectancy when the subject feels responsi- 
ble for that expectancy, but not when the sub- 
ject feels no responsibility for that expect- 
ancy. The interpretation based on Bramel’s 
(1968) self-concept hypothesis, that the vio- 
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lation of the former expectancy threatens the 
subject’s self-esteem, while the violation of 
the latter does not, explains why this differ- 
ence should occur. However, before this ex- 
planation can be accepted, there are some 
questions about the design of the study that 
need to be dealt with. 

The first involves the strength of the ex- 
pectancies held by the subjects in the various 
conditions. It might be argued that the sub- 
jects in the four conditions of the first study — 
held expectancies of different strengths, and 
that this difference in the expectancies could 
account for the obtained data. While no inde- 
pendent measures were taken of the subjects’ 
expectancies in the first experiment, there are — 
a number of points that argue against this 
strength of expectancy explanation. First, it 
is difficult to see how this explanation could 
account for the finding of more dissonance- 
reducing behavior in the subject-responsible— 
violated-expectancy condition than in the con- 
firmed expectancy conditions but less in the 
test-responsible-violated-expectancy condition. 
Second, to explain the differences in the test 
responsible and subject responsible condi- 
tions, it would be necessary for the stronger 
expectancies to be formed by subjects in the 
subject responsible conditions. It is difficult 
to understand how this could occur, since the — 
subjects in the test responsible conditions 
were given a “very accurate” and supposedly 
widely used personality inventory which told 
them what to expect. Subjects in the subject ~ 
responsible conditions had to work with less 
accurate information on a task with which 
they had little familiarity to arrive at the 
expectancy. If anything, one would expect — 
stronger expectancies to be formed by sub- 
jects in the former condition. Last, the data 
from the second experiment indicated that 
expectancies formed in the confirmed and 
violated conditions were similar, as were the 
strength of the expectancies formed by the 
subjects in the no-choice and test responsible 
conditions, Thus, a prediction based simply 
on differences in the strength of expectancies 
could not account for the obtained results, nor 
would it fit the experimental procedure. 

A second and somewhat more pesky prob- 


lem involves determining how a person expe- 


„g no dissonance would behave, The 
riencing of means in the first study supported 
jctions. However, in order to say 
all results are directly in line with the 
esponsibility (Cooper, 1971) and 
(Bramel, 1968) explanations, it 
nvenient to say that the subjects 
test-responsible-violated-expectancy 
ere not experiencing dissonance. 
e present results, it is impossible 


io post this without reservation since there 
no co! 
ment, an t 
my different from those given to the test 
slight ew and subject responsible subjects. 
e however, two factors that would 
B that test-responsible-violated-expect- 

4 indios pjects were responding as if they were 
cing dissonance. First, these sub- 

` ye responses very similar to those 
j py the subjects in the no-choice cells, 
A iite should not have been experiencing 
7 Na nance While the instructions to the no- 
oi subjects were slightly different, and it 
E ficult to predict how this difference might 
E is affected their responses, it was demon- 
we ed that the expectancies held by these 
iiiz subjects were similar to those held 
s test responsible subjects, This finding 
by t „gest that a comparison between the 
nd the test responsible subjects 
ave some meaning. The second point 
yes that subjects in the test-respon- 
ated-expectancy condition were not 
+ „eing dissonance comes from Cooper’s 
ey In this study, the experimental 
(197 aol subjects were given the same in- 
and is with the only difference between 
structio “groups being that the control sub- 
the m re assigned a partner, whereas the 
jes. tal subjects were allowed the free- 


to : H 
ER this expectancy in the experimental 
yii 


nditions gave responses similar to those ob- 
ed from the control subjects. 

2 use of the lack of firm control data, 

ibility arises that test-responsible- 


violated-expectancy subjects became less at- 
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tracted to their partner than would a control 
group of subjects. Possibly, these test-respon- 
sible-violated-expectancy subjects felt that 
their partner had “faked” the scores on her 
personality inventory so as to appear more 
attractive, This alternative, however, is un- 
likely. First, if the subjects in this condition 
thought their partner was somehow cheating, 
they would be expected to report the experi- 
ment as worthless and having little scientific 
value. There were no differences in the re- 


` sponses to questions about the interest and 


scientific worth of the experiment given by 
the subjects in any of the conditions of the 
experiment. Second, if the partner were cheat- 
ing on her scores, it would be expected that 
she should try to present herself as being as 
attractive as possible. Neither the timid or 
braggart inventory presented a particularly 
attractive person. Third, the question of why 
the subjects in the subject-responsible-vio- + 
lated-expectancy condition rated their partner 
so attractive would need to be explained. Tt is 
possible that they did not feel as cheated by 
the partner, but why should their partner 
receive the highest attraction scores? These 
three points seem to argue against the possi- 
bility that the subjects in the test-responsible— 
violated-expectancy condition were any more 
negative toward their partner than subjects 
experiencing no dissonance would have been, 

In all, the results of this experiment suggest 
two points about what situations are likely to 
arouse dissonance, First, they offer additional 
support for Cooper’s (1971) hypothesis that 
some feeling of personal responsibility for 
the discrepant belief or behavior is necessary 
if dissonance is to occur. Subjects who were 
personally responsible for their choice of part- 
ners or for the expectancy about her evi- 
denced dissonance-reducing behavior, while 


subjects who had little or no responsibility * 


for their partner or for the expectancy be- 
haved as if they were not experiencing disso- 
nance. 

This experiment also suggests that disso- 
nance will follow the disconfirmation of an 
expectancy for which the individual is per- 
sonally responsible, but will not result when 
the subject has no responsibility for the ex- 
pectancy. Bramel’s (1968) idea that disso- 
nance will result from the violation of an 
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yp ccaney that casts the person’s predictive 
‘petence and sensitivity in doubt ade- 
vately explains this finding. The violation of 
‘| expectancy for which the individual is re- 
ponsible should threaten his self-esteem, 
while the violation of an expectancy for which 
he has no responsibility should not, 

According to the responsibility for expec- 
tancy hypothesis, dissonance should occur 
whether the result of the violated expectancy 
is good or bad. The simple violation should be 
sufficient to call into question the predic- 
tor’s decision-making ability, In the present 
study, a negative consequence always followed 
the violation of the expectancy, Even though 
this negative consequence also followed the 
confirmation of expectancies and should not 
have affected the form of the results, future 
research might vary the responsibility variable 
and have the violation of expectancy followed 
by a positive outcome, 

A second researchable question Suggested 
by this expectancy hypothesis involves the 
temptation to abdicate or escape from respon- 
sibility in order to avoid dissonance arousal, 
Dissonance is an uncomfortable state of affairs 
and one that most individuals would like to 
avoid, If an individual feels a need to build 
an expectancy about a future event and can 
see that this expectancy might be discon- 
firmed, he runs the risk of experiencing 
dissonance if he develops the expectancy on 
his own, It could be predicted that in such a 
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situation he would seek out an ally to assume 
some of the responsibility for the expectancy, 
It is interesting that if the individual is look- 
ing merely to diffuse responsibility for the 
expectancy, the person whose help in forming 
the expectancy is Sought need not be any 
more of an expert than the individual or have 
any additional knowledge on the topic than 
the individual himself, 
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It was hypothesized that relative to persons with chronic high self-esteem, 
individuals with chronic low self-esteem would be made uncomfortable by 
success because of its inconsistency with their negative self-appraisal and that, 
consequently, further success would be avoided. However, it was thought that 
this would be true only in the case of low self-esteem persons who were 
convinced of their low status and who were made to feel personally responsible 
for a success experience, This reasoning was confirmed. After a self-produced 
success, only subjects certain of their low self-esteem failed to show any 
improvement in their subsequent performance; subjects uncertain of their 
low self-esteem improved significantly following a self-produced success. If 
the success was due to “luck” rather than self-produced, both uncertain and 
certain low self-esteem subjects showed significant improvement. This was 
especially true for subjects who were lucky and certain of their low self-esteem— 
they improved more than any other group including the four high self-esteem 
groups who also improved significantly. The results are discussed in terms of 
their implications for enhancing the self-appraisals of individuals with low self- 


esteem. 


Aronson and Carlsmith (1962) found that 
subsequent to an unexpected success, subjects 
produced a failure, presumably in an attempt 
to minimize the extent of their success. They 
explained these findings in terms of cognitive 
consistency theory, arguing that unexpected 
or incongruent outcomes are shunned in order 
to maintain a consistent conception of one- 
self and the world. Hence, among persons who 
expect failure, success is rejected because it 
constitutes an inconsistency. 

Since Aronson and Carlsmith’s (1962) 
study, numerous attempts have been made to 
replicate the rejection-of-success phenomenon 
(eg., Brock, Edelman, Edwards, & Schuck, 
1965; Cottrell, 1965; Lowin & Epstein, 1965; 
Silverman, 1964; Ward & Sandvold, 1963). In 
some cases, the self-enhancing properties of an 
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unexpected success have proved stronger than 
the need to maintain consistency. Instead of 
rejecting the unexpected success, failure-ex- 
pectancy subjects avidly embraced success. 
Indeed, from the standpoint of reinforcement 
theory, one would predict this outcome: indi- 
viduals deprived of success should desire it all 
the more intensely. The failing student, for 
example, should cherish a grade of “A,” while 
the honor roll student might hardly notice it. 
However, in other cases, failure-expectancy 
subjects have rejected success, thereby main- 
taining consistency at the expense of positive 
outcomes. 

Since consistency and self-enhancement ob- 
viously conflict where unexpected success is 
concerned, researchers have repeatedly asked, 


fe 


“What determines which of these motives will à 6. 


be dominant over the other?” Most experi- 
menters have looked to differential stimulus 
conditions for the answer. However, it seems 
equally plausible to consider the possibility of 
differential psychological states among the 
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jects who denied and protested the experi- 


veridical feedback on a personality test) pro- 
ceeded to compete more vigorously in the 
experimental task. Low self-esteem subjects 
who accepted the induction behaved in ac- 
cord with the consistency principle. Similarly, 
Coopersmith (1967) delineated two distinct 
behavior patterns among low self-esteem boys, 
one indicative of the self-enhancement effect 
and the other reflecting the consistency princi- 
ple. Some of the boys who held themselves in 
low esteem were nonetheless held in high es- 
teem by their peers and teachers. The overt 
behavior of these boys was marked by high 
achievement and a strong desire for improve- 
ment. It appeared that the low self-appraisal 
of these boys whetted their appetite for praise 
and success. On the other hand, boys whose 
low self-ratings were matched by low ratings 


‘ from others did not strive to achieve and 


seemed to be locked into a self-perception of 
failure. Apparently, these individuals had 
given up all hope of improvement. 

It was hypothesized that these chronic low 
self-esteem subjects differed as to the cer- 
tainty of their self-appraisal and that perhaps 
certainty about a low self-appraisal is a crucial 
factor in determining whether the person seeks 


i self-enhancement or attempts to maintain con- 


sistency. Once the individual has acquired a 
secure and stable self-appraisal, he may direct 
his energy toward maintaining the status quo. 
On the other hand, the individual who is un- 
certain and unconvinced of the validity of his 
self-appraisal may be motivated to seek self- 
enhancement both for its desirability per se 
and because it provides a means for reducing 
uncertainty in a favorable direction (cf. Jones 
& Gerard, 1967, pp. 124-125). 

Apparently, then, the uncertain low self- 
esteem person is free to seek out and accept 
esteem-enhancing events such as success. But 
what about the person with chronic low self- 
esteem who is convinced of his lowly status? 
Must he forever run in the vicious circle of 
seeking outcomes consistent with his negative 
self-appraisal which in turn produces further 
validation of his lowly self-appraisal? Or are 
there conditions under which he can tolerate 
and even accept “inconsistent” success? Met- 


tee (1971) demonstrated just such a condi- 
tion. He found that “failure” subjects accepted 
an unexpected success if it was guaranteed to 
be transient and hence was irrelevant to their 
long-term self-appraisal. A related type of 
“inconsistent” success which may also be 
irrelevant to a person’s chronic negative self- 
appraisal and thus may be acceptable to him 
is the “lucky break” or windfall. Although it 
is objectively inconsistent and unexpected, a 
stroke of luck may not violate the subjective 
need for consistency, because it is a chance 
event which the individual had no part in 
producing. Because a “lucky” success is not 
seli-determined, it should not be relevant to 
a chronic failure’s self-percept, and hence 
should not conflict with his low self-appraisal, 
regardless of how certain he is of its validity. 
The present study combined these two lines 
of reasoning concerning certainty of self- 
appraisal and perceived self-responsibility for 
events that violate the self-appraisal. Indi- 
viduals with high and low self-esteem were 
evaluated to determine the amount of cer- 
tainty attached to their self-appraisal. They 
then performed a task in which the outcome 
was either produced by their own effort or 
determined by chance. Halfway through the 
task, all individuals were led to believe that 
their performance to that point was highly 
successful. It was predicted that subjects’ 
reactions to the success would depend not 
only on their self-appraisal, but also on the 
amount of certainty invested in that appraisal 
and the perceived self-responsibility for the 
success outcome. Low self-esteem certain indi- 
viduals should reject or minimize a success 
for which they perceive themselves responsi- 
ble; however, they may accept a success that 
is due to a windfall or a “lucky break.” In 
other words, the consistency needs accom- 
panying a high level of certainty should be - 
triggered only by a self-determined success. 
On the other hand, the low self-esteem uncer- 
tain person should accept both the self- 
determined and the lucky success since he is 
unconvinced of his low self-appraisal and 
should be relatively free to seek the self- 
enhancement he desperately needs. The im- 
pact of the self-determined success, however, 
should be greater than that of the lucky 
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success since it is not only a positive outcome 
but also provides information pertinent to 
the self-evaluation—information that should 
reduce uncertainty in a positive direction. 

For the high self-esteem person, both the 
consistency and self-enhancement hypotheses 
predict the same behavior: acceptance of 
success. Since they are accustomed to success, 
they will probably be rather blasé about posi- 
tive outcomes and should find both the self- 
determined and the lucky success moderately 
attractive. High self-esteem uncertain individ- 
uals in particular should welcome success, 
since it would enable them to fortify their 
tenuous hold on a high status position and 
would substantiate their precarious high self- 
appraisal. 


METHOD 
Overview 


College-age females were asked to complete a ques- 
tionnaire concerning “opinions and attitudes of 
women” as part of a project on the psychology of 
women. The questionnaire, which consisted of the 
two primary self-esteem subscales of the California 
Psychological Inventory, was used to measure both 
self-esteem and certainty. Subjects whose scores 
fell in the upper and lower third of the scoring 
distribution were designated high and low self- 
esteem, respectively. High- and low-certainty groups 
were formed within the self-esteem categories on the 
basis of how “certain” the subjects felt about their 
responses to the self-esteem items. These girls were 
contacted and asked to participate as a control 
population in a study of drug side effects. The experi- 
mental task consisted of 20 trials, 1 minute in dura- 
tion, during which the subject matched geometric 
figures on a display board. At the halfway point 
(ie. after 10 trials), all subjects were given success 
feedback, From the outset, half of the subjects 
believed that their scores were due wholly to their 
skill and performance output (skill condition); the 
others believed that their scores were the result of 
chance or luck and did not reflect their own per- 
formance. Performance on the second block of trials 
constituted a measure of their tendency to accept 
or reject success. 

Thus, the basic design was a 2 X 2 X 2 factorial, 
with two levels each of self-esteem, certainty, and 
personal responsibility for success. 


Subjects 


Subjects were females, ages 18-25, enrolled in 
college summer school sessions. Questionnaires were 
handed out in classrooms; subjects scoring in the 
upper and lower third of the self-esteem distribution 
were contacted by telephone and solicited for their 
participation in a study of the side effects of medica- 
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tion prescribed by psychiatrists for their patients. 
The girls were offered $1.50 for their participation. 
Subjects were told that they were selected at random 
from the class enrollment lists. Seventy-two subjects 
were obtained in this way; refusal rate was 
approximately 10%. 


Measures 


A questionnaire consisting of the 81 items com- 
prising the two primary self-esteem subscales (self- 
acceptance and social presence) of the California 
Psychological Inventory was used to assess chronic 
level of self-esteem. Level of certainty was also 
assessed using this questionnaire. After answering 
each true-false question, the subjects were asked to 
indicate their level of certainty about their response 
by circling one of three possibilities: “C”? was cer- 
tain; “SD” was some doubts; and “U” was uncertain 
about whether the statement was in fact true (false) 
of them. Scores were assigned by allotting 2 points 
for each C response, 1 point for each SD response, 
and no points for a U response. Post hoc analysis 
showed that approximately the same number of posi- 
tive and negative endorsements were assigned to each 
category, for example, 50% of the responses scored 
“certain” were positive self-esteem responses, and 
50% were negative responses. The upper and lower 
thirds of the self-esteem distribution were designated 
high self-esteem and low self-esteem, respectively. 
Within each self-esteem category, certainty scores 
were dichotomized, and certain and uncertain groups 
were formed. Splitting the certainty scores within 
each self-esteem distribution resulted in the same 
certainty median for both self-esteem groups. A cor- 
relation coefficient calculated on self-esteem and 
certainty scores proved to be nonsignificant (—.13), 
thus demonstrating) the independence of the two 
indexes (a scatter plot ruled out the possibility of a 
curvilinear relationship). Moreover, the mean self- 
esteem scores of the low self-esteem/certain and low 
self-esteem/uncertain groups were the same; this was 
also true of the high self-esteem groups (i.e. there 
was no relationship between certainty and esteem 
within the low self-esteem and high self-esteem cells). 


Procedure 


The self-esteem and certainty scores of each subject 
were unknown to the experimenter. Instead, she was 
given a list of subjects to be contacted and the 


experimental condition (skill or luck) to which they ` 


had been randomly assigned. Thus, although the 
experimenter knew the basic hypotheses of the ex- 
periment, she was blind to the self-esteem and cer- 
tainty levels of each subject during the session. 
Subjects were told that the session was concerned 
with determining base-line rates for performance on 
a motor task. The task consisted of matching geo- 
metric figures; the figures were represented on a dis- 
play board, and the subject was given a deck of 
cards that duplicated the figures on the board, There 
were 49 figures, varying’ along dimensions of color 
size, shape, and pattern. The task consisted of 20 
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, trials, each 1 minute long, in which the subjects were 
to match as many cards as possible. After the task 
was described,? the subjects were given either the 
skill or luck manipulation as follows. 


[Skill] As you can imagine, this task involves a 
lot of basic skills—visual-motor coordination, per- 
ceptual ability, quick reaction time, and some 
concept-formation ability. These are the raw ele- 
ments for many different kinds of operations; 
actually, almost any task you can think of requires 
some of them, whether the task is a simple motor 
operation or an intellectual one. Any deficits in 
these fundamental operations would cause severe 
impairment in everyday functioning and would 
probably be immediately apparent. For example 
.., [the experimenter gave several examples of 
various degrees of impairment in order to make 
this manipulation equivalent in length to the luck 
manipulation]. People who are proficient in this 
task are probably also successful in most of the 
motor tasks and intellectual operations that they 
undertake, Indeed, the skills which you have built 
up in doing everyday activities will probably help 
you to be successful on this task, since it taps 
them directly. If you do well, it is probably be- 
cause these basic skills are so highly developed. 

Besides your raw performance, we’re also interested 
in your feelings while you were doing the task, 
so there’ll be a short questionnaire for you after 
you finish. 

[Luck] As you can imagine, this task utilizes 
several basic skills. However, since the drugs we'll 
be testing later are usually used with a psychiatric 
patient population, there will be things other than 
the drugs affecting performance. As I explained 
earlier, so far, there is no way to separate out 
what is due to the drug’s side effects and what is 
the result of psychopathology. What I need is 
some kind of crossover group—for example, a 
group of normal people who take the medication 
before performing or a group of patients who are 
taken off medication before performing. Neither 
of these groups was possible, so I decided instead 
to use a group of normal people for whom I 
simulated a “psychiatric-patient” condition. You 
will be a member of that group. Psychiatric pa- 
tients are characterized by a different attitude 
about things than normal people, regardless of their 
specific diagnosis. I don’t know if it’s something 
that builds up while they’re hospitalized or what, 
but they often have a very fatalistic attitude 
toward things. They tend to feel as if everything 
is predestined or due to chance; they view things 
as inevitable and have very little sense of them- 
selves or their abilities in working on a task. An 
attitude like this is going to affect their perform- 
ance on the task, but it’s hard to predict exactly 
how, We think that this attitude difference is an 
important one to study, so I’d like you to be per- 
forming under those same contingencies that psy- 


3A full description of the task will be supplied 
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chiatric patients assume for themselves; that is, 
that your score will be determined by luck or fate 
-and is not really the direct result of your per- 
formance. Here’s how we're going to do this: at 
the end of each trial, you will pick up two slips 
from the jar. Each slip will have the name of a 
dimension written on it, like large or yellow. We 
will use those dimensions, which you picked out 
at random, to assign penalties or bonus points to 
your score. When I count the cards you have 
matched, I will only count those which contain the 
first dimension you picked—if it were yellow, I’d 
ignore all the blue and red ones you’d matched. 
For the second slip you picked, I will be assigning 
you bonus points from a random number table. 
You can get 1, 2, or 3 bonus points, I’m just read- 
ing down this list of random numbers and using 
the ones, twos, and threes as they appear. If the 
second slip you had picked was “triangle,” for 
example, I would give you bonus points for all 
the cards you had matched which both yellow 
and triangle. You don’t have to remember all the 
details, but the point is that your score is being 
determined by luck: how the deck happens to be 
shuffled on a particular trial, what two slips you 
happen to pick, and what’s happening on the 
random-number table. The more cards you have 
on the board, the greater chance you have of 
making luck pay off, but practice or training won’t 
help you since so much of your score depends on 
luck. 

Obviously, your scores won’t be telling me any- 
thing about the way you performed, since they are 
numbers that are determined by chance. What I 
am interested in is your experience of performing 
this task—what did it feel like to you, So there'll 
be a short questionnaire when you’re done. 


All subjects were then told that there would be 
three practice trials so they could “familiarize them- 
selves with the movements and get used to being 
timed.” At this point, the subjects were also fore- 
warned about the rest period midway through the 
task and given a rationale for it. The experimenter 
explained that it would offset fatigue. Subjects then 
completed the practice trials and the first 10 trials 
of the task. For all subjects, the experimenter 
recorded the number of cards that the subject had 
actually matched on that trial. For subjects in the 
luck condition, the experimenter merely pretended 
to consult the random-number table, etc.; however, 
the number actually recorded was the number of 
cards matched during the trial. At the end of 10 
trials, the subjects were told to take a break. While 
counting up the last trial scores, the experimenter 
casually noted that the subject seemed to be doing 
quite well; after the subject responded, indicating a 
desire to know her score, the experimenter said, 
“Let's see . . . I guess your average would come out 
to about .. . let me add it up. . . ” The score that 
the experimenter reported was actually 7 points 
higher (X +7) than the subject’s true average (X); 
X was also the score reported as being what most 
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people averaged. She continued the feedback by say- 
ing, “Very few people do that well as a 10-trial 
average; usually people average 7 or 8 points lower 
than that. A few people do as badly as (X — 7) per 
trial. But, as I say, your score of (X +7) is actually 
very good; very few people get that high an aver- 
age.” For the luck group, the format of the feedback 
was the same except that “you seem to be quite 
lucky” was substituted for “you seem to be doing 
quite well.” Also, the sentence “You’ve been picking 
good dimensions, and the random-number table is 
working in your favor” was added to reinstate the 
notion of nonresponsibility for the success. 

The subjects’ scores were each elevated by 7 points 
in order to heighten the impact of the success and 
bolster the subjects’ belief that their scores were quite 
high. The maximum score possible, of course, was 49, 
the number of spaces on the board. It was estab- 
lished during pilot testing that the addition of 7 
points brought most subjects’ average score to ap- 
proximately 35, close enough to the objective ceiling 
to seem like an accomplishment, but still low enough 
to allay suspicions. Pilot testing also showed that the 
subject’s score range on the first 10 trials was well 
over 7 points, so that it would be difficult for a 
subject to make an estimate of her own average to 
compare with the bogus feedback. Dependence on 
bogus feedback was further strengthened by time 
pressure and the large number of unfilled spaces on 
the board which prevented the subjects from calcu- 
lating their own scoresvia counting the cards 
remaining in their hands or the unfilled spaces on the 
board. To make this calculation even more difficult, 
the cards were removed from the subject's hands and 
from the board immediately following each trial and 
before the experimenter determined the subjects 
score, 

After the first half of the feedback was given, the 
experimenter shuffled the cards for a few more 
minutes so that subjects could mull over the feed- 
back they had just received. This interpolated time 
was intended to eradicate the effects of fatigue and 
to make it more difficult for the subjects to directly 
compare, in an objective manner, their first- and 
second-half performance. The experimenter then ad- 
ministered Trials 11-20. Upon completion of the 
twentieth trial, the subjects filled out the post- 
experimental questionnaire without receiving any in- 


formation about their performance on the last 10 
trials, 


Dependent Measure 


Although the subjects believed that all 20 trials 
comprised a single unitary task, the averages for the 
first and second 10 trials were calculated separately 
for analysis. In all cases, these averages were based 
on the actual number of cards per trial, not the 
inflated (X +7) feedback. For subjects in the luck 
condition, their scores were those calculated from 
the number of cards actually matched, not the “luck- 
determined” scores (which were never actually calcu- 
lated). Second-half performance was considered to 
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_markedly more in the luck condition, The 


y 


be an index of whether first-half success was accepted 
or rejected by the subjects. ay" 

The postexperimental questionnaire focused o 
subjective performance evaluations and the sub- 
jects’ recollections of their psychological state during 
the task. Some manipulation checks and filled items 
were also included. 


Debriefing 


Neither the “psychiatric patient” cover story nor 
the inflated (X + 7) feedback was questioned by any 
subject. Both task conditions were explained, and the 
experimenter said she was interested in looking at 
the effects of task instructions on performance, 
Subjects were then thanked and paid. 


RESULTS 


An analysis of variance was performed on 
the averages for the first 10 trials to deter- 
mine whether differences existed prior to the 
success feedback. There were no initial dif- 
ferences due to the dispositional variables 
(self-esteem and certainty); however, there 
was a main effect due to level of responsibil4— 
ity (F = 6.71, p < .05). Subjects performed 
better with skill than with luck instructions; © 
the mean scores were 29.0 and 26.9 for the 
skill and luck conditions, respectively. Using 
an analysis of covariance, the scores for the 
second 10 trials were adjusted for the first- 
half differences. The adjusted second-half 
scores represent the degree to which perform- 
ance changed differentially following a prior } 
success and thus constituted the main depen- 
dent measure.* ; 

Table 1 shows the results of the analysis 
of covariance on Trials 11-20. There was an 
overall effect due to the level of responsibil- 
ity (F = 6.55, p< .05), with subjects im- 
proving more in the luck condition than in the 
skill condition. Although the self-esteem main 
effect did not reach significance, the Self- 
Esteem X Level of Responsibility interaction 
was significant (F = 12.08, p < .01). High-* 
self-esteem subjects improved slightly more 
in the skill than in the luck condition, 
whereas low-self-esteem subjects improved 
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Certainty X Level of Responsibility interac- ; 
tion was also significant (F = 9.25, p< 01); 


+ Adjusting the final scores by the first-half scores Ma: 
provides an equivalent base line for all of the groups 
so that comparisons can be made directly among a 
second-half scores. 
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TABLE 1 
‘ANALYSIS OF COVARIANCE: FINAL SCORES 


Mig i ADJUSTED BY INITIAL SCORES 
piapa Source of MS F 
Peas Self-esteem (A) 1 7.78 2,058 
A Certainty (B) 1 6.43 1.701 
Responsibility (C) 1 24.75 6.55* 
s, AXB 1 3.29 <i 
AXC 1 45.65 12.08** 
E BE 1 37.98 9.25** 
AXBX C 1 18.45 4.88* 
Pi Within error 63 3.18 
i ‘Total (NV. — 2) 70 — — 
k p S0: sen 
en b <.01, 
, | certain subjects improved much more in the 


|, luck than skill condition, while uncertain sub- 
m jects maintained the same level of improve- 

+) ment regardless of task instructions. 
© The fact that both self-esteem and certainty 
“interacted significantly with level of responsi- 
¢ bility was due chiefly to the two low self- 
A ssteem/certain groups. Inspection of Figure 1 
d shows that under the skill condition, the 
< Jow self-esteem/certain subjects showed es- 
sentially no improvement over their first- 
half scores, far less than that of any other 
é group; in the luck condition, these subjects 
ae more than any other group. The 

sweSuperiority of low self- 


esteem/certain subjects 
in the luck condition over those in the skill 
condition was highly significant (t = 5.68, 
p< .001). Thus, as Figure 1 demonstrates, 
luck instructions produced greater improve- 
1 ment than skill instructions only among low 
Y self-esteem/certain subjects. But this improve- 
` ment was sufficient to make the luck instruc- 
tions seem more effective for low self-esteem 
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TABLE 2 


- Apyustep FINAL AND ADJUSTED CHANGE SCORES 


Group pees Change 

Skill i 

High self-esteem/certain 32.39 4.44 

High self-esteem/ uncertain 32.90 4.95 

Low self-esteem/ certain 28.60 65 

Low self-esteem, /ancertain 32.09 4.14 
Luck s 

High self-esteem/certain 32.31 436 

High self-esteem: /uncertain 32.41 4.46 

Low self-esteem/ certain 33.83 5.88 

Low self-esteem, /ancertain 32.39 4.44 


CERTAINTY, AND RESPONSIBILITY FOR SUCCESS 


103 


gA Skitt 


ER 10 TRIALS 
a w 
> a 


o 
a 


o 
N 


31 


30 


MEAN CARDS PER TRIAL OV 


HSE/ HSE/ LSE/ LSE/ 
CERTAIN UNCERTAIN CERTAIN UNCERTAIN 
Fic. 1. Adjusted second-half scores: 
over 10 trials. (LSE = low self-esteem, 


self-esteem.) 


group means 
HSE = high 


subjects, regardless of certainty, and for all 
certain subjects, regardless of self-esteem. 

The significant triple interaction (F = 4.88, 
p < .05), was also due to the two Jow self- 
esteem/certain groups. The degree of improve- 
ment shown by high self-esteem groups was 
essentially the same, whereas the low self- 
esteem groups produced a significant inter- 
action (F for simple interaction = 6.97, 
p= 01). Low self-esteem/certain subjects 
either improved a great deal (luck) or showed 
no improvement (skill) as a function of re- 
sponsibility for first-half success, while the 
skill and luck low self-esteem/uncertain and 
all high self-esteem groups improved signifi- 
cantly (és over 6.00, ps < 001) and to about 
the same extent. Especially noteworthy was 
the low self-esteem/ uncertain/skill group who 
matched the improvement of high self-esteem 
subjects and significantly exceeded the non- 
improving low self-esteem /certain/skill sub- 
jects (£ = 3.68, 2 < 01). 

In summary, the pattern of actual second- 
half improvement revealed four major find- 
ings: (4) all high self-esteem groups showed 
equivalent and significant improvement; (b) 
low self-esteem/certain subjects who were re- 
sponsible for their first-half success failed to 
show any improvement—significantly less 
than any other group (c) low. self-esteem/ 
certain subjects who were not responsible for 
their first-half success improved more than 
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any other group; and (d) low self-esteem/ 
uncertain subjects, even those responsible for 
their first-half success, improved as much as 
high self-esteem subjects. Taken as a whole, 
these findings demonstrate that self-deter- 
mined success has a relatively counterpro- 
ductive effect on the person yous chronic 
stabilized low self-esteem. 

Related findings of interest pertain to sub- 
jective estimates of second-half improvement 
(there was no second-half feedback) and 
reported feelings of uneasiness about first-half 
feedback. In both cases, the data further con- 
firm the low self-esteem/certain/skill subjects’ 
relative lack of enthusiasm for their first-half 
success, At the conclusion of the task, their 
estimates of improvement were the lowest of 
any group, while those of low self-esteem/un- 
certain/skill subjects were the highest (¢ for 
these extremes = 4.10, p < .001). Thus, in 
judgments about self-produced performance, 
the low self-esteem/uncertain group insisted 
that they had succeeded, while low self- 
esteem/certain subjects were pessimistic about 
their second-half performance. In terms of 
uneasiness about their first-half success, low 
self-esteem subjects reported being relatively 
less relaxed about the feedback than high 
self-esteem subjects (F = 7.32, p < .01), with 
the low self-esteem/certain/skill group being 
the least relaxed of any group. 


Discussion 


The results show that persons for whom 
success is inconsistent (i.e., chronic low self- 
esteem subjects) will reject or minimize suc- 
cess only if they are certain of their low self- 
assessment and the success is self-determined. 
When low self-esteem/certain subjects suc- 
ceeded under skill conditions, success did not 
provide the impetus for the second-half per- 
formance that was evident in other groups. 
Their performance did not actually deteriorate 
following success, but they did fail to improve, 
and, in so doing, avoided further success. 

_ That is, since low self-esteem/certain subjects 
whose success was fate-determined (i.e., due 
to luck) showed a dramatic improvement from 
the first to the second half of the task, the 
low self-esteem/certain/skill group’s failure to 
improve seems to indicate a suppression of 
performance, a premature stabilizing of first- 
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half performance, rather than a lack of ca- 
pacity. Thus, when these subjects did not 
improve when improvement was clearly acces- 
sible to them, they in effect avoided success. 
Moreover, since the low self-esteem/certain/ 
skill group was the only one failing to im- 
prove, their performance can be said to have 
deteriorated relative to the performance of 
the remaining three low self-esteem groups, 
thus providing evidence that low self-esteem/ 


certain subjects attempted to minimize self- ` 


determined first-half success (i.e., success of 
their own making). The significant ove 
ment shown by all low self-esteem groups 
except the low self-esteem/certain/skill group 
also indicates that whether a consistency or 
a reinforcement rationale will predict chronic 
low self-esteem subjects’ response to success 
is jointly determined by the certainty of the 
self-evaluation and responsibility for success. 


The consistency principle seems to hold only “> 
when success is self-produced and the chronic . 
low self-esteem person is sure of his self- < 


appraisal. 
The improvement of low self-esteem/uncer- 


tain/skill subjects demonstrates that not all . 


low self-esteem subjects attempt to minimize 
self-produced success, even though such a 
success is inconsistent with their chronic self- 
appraisal. This favorable reaction toward an 


unexpected self-generated success was appar- — 


ently a function of the uncertain nature of 
the self-appraisal. Being uncertain®may indi- 
cate that the person doubts that his chronic 


negative behavior stems from intrinsic, im- : 


mutable personal qualities. That is, he is un- 
certain as to whether his chronic, self-assess- 
ment applies to what he actually“ HS oe (ies; 
underlying, finalized dispositional traits) or 
merely reflects surface performance that has 
persisted for an extended period but does not 
arise from his basic nature. This state of 
doubt and uncertainty im regard to the self- 
appraisal may minimize consistency concerns, 
leaving the success-deprived low self-esteem 
person “hungry” for the self-produced suc- 
cess that will validate his refusal to fully 
internalize past failure tendencies. ; 
This finding also helps clarify the results of 
previous self-esteem studies in which chronic 


high self-esteem and low self-esteem subjects he 


have not reacted differently to success (see 
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i he Wylie, 1961). In the skill condition of the 


present study, the low self-esteem/certain sub- 
jects reacted much differently to success than 
the high self-esteem subjects, but the low 
self-esteem/uncertain subjects did not. The 
critical point here is that when the low self- 
esteem groups are combined, they do not differ 
significantly. from the high self-esteem groups 
combined. In short, if the certainty dimension 
had not been a variable, another equivocal 
high self-esteem versus low self-esteem result 
would have emerged. This suggests that when 
low self-esteem subjects are “lumped” to- 
gether without regard for the certainty of their 
self-appraisal, reward-seeking tendencies on 
the part of some more than cancel out con- 


_ sistency tendencies on the part of others so 


that overall, low self-esteem subjects fail to 
differ from high self-esteem subjects. 

Perhaps the most important finding was the 
great improvement shown by the low self- 
esteem/certain subjects in the luck condition. 
Here first-half success gave great impetus to 
actual second-half performance, indicating 
that if success is not self-produced, even per- 
sons with deeply rooted chronic low self- 
esteem will enthusiastically embrace a suc- 
cessful outcome. Apparently, a success that 
does not implicate the person as causal agent 
(i.e, is not self-caused) is irrelevant and 
immaterial to the person’s self-appraisal and 
therefore does not arouse consistency needs. 
The person is not responsible for the event, 
and hence it does not refer or pertain to his 
self-evaluation any more than having one’s 
unoccupied, properly parked car smashed in 
the rear reflects upon one’s driving ability. 
Here, self-consistency should not even be an 
issue by virtue of the nature of the success 
event itself (ie., it has nothing to do with 
the self). This, in effect, leaves low self-esteem 
persons free to react to success without regard 
to their self-appraisal and hence under no 
pressure to behave in a self-consistent manner. 
Perhaps under these circumstances only the 
rewarding aspects of success are relevant, and 
hence low self-esteem subjects in the present 
study reacted very favorably to success. 

The significance of this finding lies in its 
implications for at least interrupting the 
vicious cycle, whereby low self-esteem leads to 
avoidance and/or minimization of experiences 
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that would disconfirm the validity of a nega- 
tive self-assessment. Although a fate-deter- 
mined success should be irrelevant to the 
person’s self-appraisal, it is still possible that 
over time actually experiencing and accept- 
ing a “lucky” success will subtly draw the 
low self-esteem person toward a somewhat en- 
hanced view of himself (cf. Jones, 1964, pp. 
57-67). This might happen in the following 
manner. Even though a successful outcome is 
completely due to luck in an objective sense, 
a person may take partial credit for the suc- 
cess and thereby gently raise his self-esteem. 
After all, in most cases the person does play 
some small part in the determination of 
chance outcomes. For example, in order to 
win at roulette, dice, or coin flipping, a bet 
must be placed, dice rolled, or heads or tails 
called. Hence, the person is a contingency of 
sorts in the total process, whereby he acquires 
a chance success, and therefore he may be 
able to assume a small degree of responsi- 
bility for the success. More importantly, the 
vagueness of his contingency role should allow 
him to choose, within “small degree” reality 
limits (e.g., say 10%), the degree of respon- 
sibility he will assume, ranging from the low 
end of attributing the outcome completely to 
chance up to the maximum of 10% attributed 
to the self. This latter property of a chance 
success may not only free the confirmed 
chronic low self-esteem person from the con- 
straints of consistency needs, allowing him 
to accept the success, but may also produce 
a small increment in self-esteem. That is, 
assuming that minimal degrees of inconsist- 
ency do not activate consistency needs, the 
self-discretion aspect of a chance success 
would enable the low self-esteem person to 
claim a degree of personal responsibility for 
an esteem-enhancing event (i.e., success) that 
would fall short of arousing consistency con- 
cerns. A degree of self-attributed success that 
does not evoke consistency needs should be 
accepted by the low self-esteem person, and 
his self-esteem would rise accordingly. 

This partial credit for success notion pro- 
vides a basis for drawing the low self-esteem 
person subtly toward a positive self-evalua- 
tion. Once the low self-esteem person’s self- 
esteem was raised slightly by acceptance of 
partial credit for a chance success (e.g., 5%), 


then subsequently an even greater degree of 
credit for the success could be assumed 
without arousing consistency needs. That is, 
the person’s self-appraisal would be slightly 
more positive so slightly greater degrees of 
credit for success would be defined as accept- 
able. This suggests that the person would 
now assume credit for a degree of success 
that formerly aroused consistency needs, say 
10%. This would raise his self-esteem even 
higher, which would in turn free him to 
assume even greater degrees of credit for 
success. Once such a positive escalating cycle 
were initiated, the person could be exposed 
to a series of same-size successes in which the 
proportion of success objectively attributable 
to skill (ie, the self) would gradually in- 
crease. This would make for continued small 
increments in the degree of personal credit 
that could be taken for success without 
arousing consistency needs. Consequently, 
self-esteem should be gradually enhanced 
until eventually the person would possess 2 
positive self-evaluation. In this manner it may 
be possible to change negative self-appraisals 
into positive ones. 


Avoidance or Discouragement? 


It seemed possible that a discouragement 
or disappointment factor, rather than avoid- 
ance of success, produced the low second-half 
performance of the low self-esteem/certain / 
skill subjects. Even though there was no trial- 
by-trial knowledge of results, these subjects 
could have become discouraged if they sensed 
that their performance on Trials 11 and 12 
(the first two trials of the second half) failed 
to match their inflated first-half performance. 
Moreover, it is plausible that the low self- 
esteem group would be particularly sensitive 
to a self-determined failure and therefore 
would exhibit the most discouragement. The 
evidence against such an explanation lies in 
an examination of the performance over trials: 
The pattern of the low self-esteem/certain 
group in the skill condition is similar to that 
of other groups, but consistently lower in 
absolute score. This includes the eleventh trial 
where discouragement was not a factor since 
this was the first trial following success feed- 
pack. Nonetheless, the low self-esteem /cer- 
tain/skill group’s eleventh trial performance 
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fell below that of any other group. In addi- 
tion, like the other groups, their peak per- 
formance occurred in the middle of the second 
half of the task (ie., Trials 14 and 15). If 
these subjects had been discouraged by their 
performance at the start of their second half, 
their scores should have dropped off on exactly 
the trials where they peaked. 

A final question arises concerning the mea- 
sure of certainty. It did provide a significant 
distinction among low self-esteem subjects, 
and it seemed to reflect doubt about their 
self-evaluations in general since uncertainty 
was expressed about endorsed positive items 
as often as about endorsed negative items.° 
However, it is unclear as to whether the un- 
certain subjects were uncertain people in gen- 
eral (i.e, across many dimensions) or were 
uncertain with respect only to their self-evalu- 
ation, The present data do at least suggest 
that our uncertain measure was specific to 
self-evaluation, The low self-esteem/uncertain 
subjects in the skill condition responded very 
favorably to the positive feedback concerning 
their performance. In fact, this group’s sec- 
ond-half performance estimate was higher 
than that of any other group. Hence, these 
subjects, though uncertain about their nega- 
tive self-evaluation, were apparently not hesi- 
tant about believing the positive (counter- 
valent) feedback received from an external 
source, at least as evidenced by their confi- 
dence concerning second-half performance. It 
seems likely that if our low self-esteem/un- 
certain subjects had been generally uncertain, 
they would have been doubtful and disbeliev- 
ing of a positive evaluation from an outside 
source as well as being doubtful about their 
self-evaluation. Obviously, however, further 
validation of this measure is imperative be- 
fore substantive conclusions are warranted. 


5Tt should be noted that even low self-esteem 
subjects answered more self-esteem items positively 
than they did negatively. Hence, even though subjects 
were uncertain about positively endorsed items as 
frequently as they were about negatively endorsed 
items, a greater proportion of the negatively en- 
dorsed items were placed in the uncertain category. 
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DELAY OF GRATIFICATION AS A FUNCTION OF RACE 
OF THE EXPERIMENTER ' 


BONNIE R. STRICKLAND 2 


Emory University 


Sixth-grade black and white students were offered one 45-rpm record as an 
immediate reward for completing a locus of control measure or three records 
if they would wait for the experimenter, either a black or a white male, to 
return 3 weeks later. About 80% of the white students chose the delayed, more 
valuable reward, regardless of the race of the experimenter, Thirty-three per- 
cent of the black students chose the delayed reward from the white experi- 
menter, and 56% chose from the Negro experimenter. Black students were more 
likely to be assessed as external than whites. While no relationship was found 
between locus of control and delay behavior among blacks, internal white 
students chose more delayed rewards than external white students. Results 
are discussed in terms of the mistrust of the white experimenter on the part 


of the black students. 


Zytkoskee, Strickland, and Watson (1971), 
in a study with Negro and white ninth 
graders, reported that black adolescents were 
less likely to delay reinforcement than were 
matched white students. In this research, the 
experimenters were white men who appeared 
before the students either as a representative 
of the public school establishment or as a 
casual college student who also had to com- 
plete school assignments. While the status of 
the experimenters did not differentially affect 
the subjects’ delay behavior, it is interesting 
to speculate on the possible effect of the white 
race of the experimenters on the black and 
white students. It could well be that the black 
students did not trust either of the white 
experimenters to return with the delayed re- 
ward to the extent that the white students 
did. Consequently, an important question is 
unanswered. Does the race of the experimenter 
differentially affect the delay behavior of black 
and white children? 

Ample evidence is available demonstrating 
that black children respond differently to 
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black and white examiners or teachers who 
ask them to perform intelligence-related tasks 


(Dreger & Miller, 1960, 1968; Katz, 19643" ~ 


Kennedy & Vega, 1965; Shuey, 1958; Trent, 
1954), and it is only reasonable to assume 
that a black man standing before a predomi- 
nantly black class presents a different stimulus 
value than a white man. In this regard, it is 
also of interest to investigate the responses 
of white students to black and white trust 
agents. Thus, the present research extends the 
investigations of Zytkoskee et al. ( 1971), 
that experimenters offering rewards will be 
both black and white, allowing the differential 
effects, if any, of the race of the delay agent 
to be examined. ; 

While Zytkoskee et al. (1971) did not find 
a relationship between a belief in internal 
versus external control of reinforcement and 
delay behavior, they did find, as did Lefcourt 
and Ladwig (1965), that black subjects were 
more likely to be assessed as external than 


4 


in^ 


their white counterparts. The present research ~* 


also includes a new measure of internal- 
external control in an effort to replicate these 
findings. 

Finally, the theoretical ancestry of delay 
behavior and internal belief is somewhat simi- 
lar within Rotter’s (1954) social learning 
theory. It can be argued that a belief system 
based on the perception of world events as 
occurring in a more or less orderly and pre- 
dictable manner is related to a generalized 
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occur actually happens. Thus, an additional 
concern of the present research, particularly 
via the utilization of an improyed assessment 
instrument for locus of control, is the further 
clarification of the relationship between a 
belief in internal control of reinforcement 
and delay behavior. 


METHOD 
Subjects 


Three hundred sixth-grade subjects, ages 11-13 
years, from two public elementary schools served as 
subjects. The black subjects were drawn from an 
urban school serving an inner-city district classified 
by the public school system as a poverty area. While 
integrated, the school was predominantly black. 
White subjects were drawn from a school located 
in a suburb immediately outside the urban area 
in which the black school was located and included 
children of every socioeconomic status. This school 
was integrated with a black population of about 
15%; however, only white students were used in 
the present study, The white experimenter tested 36 
males and 48 females in the black school and 26 
males and 24 females in the white school. The black 
experimenter tested 53 males and 34 females in the 
black school and 43 males and 36 females in the 
white school. 


Procedure 


The assessment instrument for a belief in internal 
versus external control was a new 40-item yes-no 
scale designed by Nowicki and Strickland (1971). An 
illustrative item is “Most of the time, do you feel 
that you can change. what might happen tomorrow 
by what you do today?” Items answered in the 


‘external direction are scored with a range from 


1 to 40. In initial work with the scale, internal 
consistencies ranged from .69 at the third-grade level 
to 82 at the tenth-grade level (N = 1,200+). Test- 
retest reliabilities ranged from .66 to .75 over a 
l-month period. This scale is thought to be an 
improvement over earlier measures of locus of control 
for children in that it is reliable, is objectively scored, 


-is not related to social desirability, assesses a gen- 


eralized rather than specific belief in locus of control, 
shows satisfactory internal consistency, and has good 
construct validity (Strickland & Nowicki, 1971). 
The Nowicki-Strickland Locus of Control scale 
was administered within the schools by either the 
black or white experimenter. Experimenters read the 
items to the students and asked them to mark their 
answers on the tests. Following the administration 
of the scale, each experimenter then told the subjects 
that they wished to reward them for assisting with 
the research, The experimenter said that he had 
hoped to give each student several 45-rpm records 
as a reward, but that unfortunately he had not 
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TABLE 1 


NUMBERS OF SUBJECTS CHOOSING IMMEDIATE 
VERSUS DELAYED REWARDS 


Im- 
R f z De- 
ener Subjects. medi Toral layed Taal 
menter choice choice 
Black Black males 23 30 
Black females ‘| 15 ae 19 56 
White males 10 2 33 | 79 
White females 6 30 
White Black males 26 | 10. 
Black females | 30 | ° | 18 | 3 
White males 7 20 19 | g0 
White females 3 21 


brought enough records that day to give out as 
many as he wished. Subjects were then told that the 
experimenter could give each student one record at 
that time, but that if the student wished to wait, 
he, the experimenter, would return in 3 weeks and 
give the student three records. Students were then 
asked to indicate their choice, one record immedi- 
ately or three records 3 weeks hence, on the back 
of their tests. Students who chose the immediate 
reward were then given the one record. Students 
who opted for the delayed reward were given the 
three records 3 weeks later. The experimenters ap- 
peared in independent classes at the same time during 
the school day so that there was no opportunity for 


, the students to talk among themselves about the 


different race of the experimenters. Both experi- 
menters were advanced graduate students in educa- 
tional psychology and were dressed in casual clothes 
when they went to the school. Each practiced his 
experimental instructions with the other so that their 
presentations to the subjects would be as uniform 
as possible. 


RESULTS 


Table 1 presents the number of immediate 
and delay choices for subjects in each of the 
race-sex groups according to experimenter. 
Using this contingency table, a number of 
chi-squares were computed. There were no 
significant differences in responses between 
black males and females or between white 
males and females. Combining males and fe- 
males of both races, a chi-square of .44 was 
found, which is not significant. Consequently, 
it appears that in the present sample, the sex 
of the subject when controlled for race was 


not related to delay behavior. It does appear 


that white females are different from all other 
subjects combined (x? = 18.25, p < .001); 
however, because of the small number of 


10 


subjects per cell, analysis of variance for pos- 
sible interaction effects was not appropriate. 

Considering the responses of the black stu- 
dents for each of the experimenters, a chi- 
square of 9.12 was found (p< .01), with 
the black students more likely to delay for 
the black experimenter than for the white. 
Considering the responses of the white stu- 
dents across experimenters, a chi-square of .00 
was found, indicating that essentially the race 
of the experimenter did not differentially af- 
fect the immediate-delay choice responses of 
the white students. 

Table 2 presents the means and standard 
deviations of the Nowicki-Strickland Locus of 
Control scale scores for each of the race-sex— 
experimenter groups. Because of the wide 
range of numbers of subjects falling into each 
group (3-33), it was necessary to collapse 
across groups in order to meet the assumptions 
of analysis of variance which appeared to be 
the appropriate statistic in view of the pos- 
sibility of an interaction effect, Consequently, 
t tests were computed to ascertain any dif- 
ferences occurring as a function of the sex of 
the subject or the race of the experimenter 
who administered the scale. No significant 
differences were found, and the data were 
then entered into a 2 x 2, that is, Choice 
Behavior X Race, analysis of variance for 
unequal Ns. 

An F of 6.22 (df = 1/296, p < .05) was 
found for choice behavior. An F of 81.3 
(df = 1/296, p < .001) was found for race, 
and an F of 5.83 (df = 1/296, p< .025) was 
found for the interaction of choice behavior 
and race. For both immediate- and delayed- 
reward subjects, blacks were more external 


TABLE 2 


Means AND STANDARD DEVIATIONS OF 
Locus or CONTROL SCORES 


ae Immediate choice | Delayed choice 
of ex- i 
peri: Subjects 
menter AEX S| SD em) XR SD 
Black | Black males 23 | 17.35 | 3.63 | 30 | 18.97 | 2.02 
Black females | 15 | 19.13 | 4:45 | 19 | 18.47 3.78 
White males 10 | 15.2 2,82 | 33 | 13.82 |4.34 
Whitefemales | 6 | 16.17 | 4:58 | 30 | 13.63 3.84 
White | Black males 26 | 19.84 | 3.13 | 10 | 20,4 | 2.97 
Black females | 30 0.4 | 2.82 | 18 | 19.61 | 3.09 
White males uf 43 | 3.91 | 19 | 11.95 |5.10 
White females} 3 | 18.67 | 2.52 | 21 | 13.90 | 4.73 
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than whites. By means of a Duncan multiple- 


range test, it became clear that black subjects 
were similar in their choice behavior regard- 
less of their placement on the Locus of Control 
scale. White subjects who chose delayed re- 
wards were more likely to be internal than 
white subjects who chose immediate rewards. 


DISCUSSION 


Clearly, the race of the experimenter who 
presents opportunities for immediate versus 
delayed and more valuable rewards to school 
children differentially affects the choice behav- 
ior of black children, In the present study, 
white children did not seem to respond differ- 
ently when the experimenter was either black 
or white, but black youngsters were signifi- 
cantly less likely to choose the delayed reward 
when the agent giving them the choice was 


white. Additionally, as has been found in the” 


past, the black subjects were significantly more 
likely to be assessed as holding a belief in 
external control of reinforcement than were 
white subjects (Battle & Rotter, 1963; 
Lefcourt & Ladwig, 1965; Zytkoskee et al., 
LIT); 

The finding that black children donot 
choose delayed rewards from a white experi- 


: f 
menter is not unexpected when one con- ~ 


siders that the delay situation may be a 
paradigm of trust. If black youngsters have 
a history of broken promises from the white 
culture or have incorporated these attitudes 
from their general social climate, then they 
may well mistrust the white experimenter who 
says that he will return with more valuable 
rewards at a later time, It is difficult to sepa- 
rate a trust factor from delay behavior, par- 
ticularly when Mahrer (1956) and Mischel 
(1963) have shown that preferences for de- 
layed rewards are affected by the individual’s 
experimentally manipulated promise-keeping 
history. 

Zytkoskee et al. (1971) controlled for 
socioeconomic status and with a white experi- 
menter found that ninth-grade black children 
were significantly more likely to choose im- 
mediate rewards than the white children. While 
the present samples are somewhat representa- 
tive of the current prevailing school popula- 
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Fa 4, tions across the country, namely, a predomi- 


tod 


N 


nantly white suburban sample and a black 
inner-city group, they were by no means con- 
trolled in terms of socioeconomic levels. Black 
subjects were drawn from an urban school 
located in a poverty area, and white students 
ranged across all socioeconomic levels. Conse- 
quently, no comparisons were made across 
schools since any overall black-white differ- 
ences might well be a function of socio- 
economic variables or differences occurring 
because of intrinsic characteristics of the 
independent schools and teachers. Clearly, 
continued research teasing out the relative 
contributions of social class placement and 
race or culture to delay behavior is necessary. 

In comparing responses within schools, how- 
ever, as a function of the race of the experi- 
menters, it does seem that the current findings 


.have a number of implications. If the black 


lower-class student is mistrusting of the white 
authority figure who stands before him, it 
seems imperative that he be given ample op- 
portunity to observe both black and white 
authority models to determine for’ himself the 
trustworthiness of the white man. While his 
mistrust of the white authority figure may not 
be unexpected at this time, in view of the 
history of cultural oppression of the black 
minority, with continued integration of 
schools, and, hopefully, a movement toward a 
ess prejudicial white society, the black school 
child should have more opportunity to correct 
his perceptions of the white man if they are 
indeed inaccurate. The findings also suggest 
that at this time black figures available to 
black lower-class students will be invested 
with more trust than the white authority, and 
it would seem important that black children 
have continued access to black adult models. 


~ An interesting extension of this research would 


be to continue to investigate at what ages 
delay differences occur as a function of race, 
particularly since Zytkoskee and McCandless 
(1969) did not find delay differences between 
black and white preschool children when pre- 
sented alternatives from a white experimenter. 

The finding of a significant relationship be- 
tween delay behavior and a belief in internal 
control, at least for the white subjects, is 
not surprising. If one believes that the 
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events that happen to him are a result of his 
own behavior and responsibilities, then his 
belief system is likely to encompass a con- 
ceptualization of the world as logical and 
orderly with clear behavior—reinforcement 
contingencies. On the other hand, if a person 
believes that his life events are beyond his 
personal control and understanding and per- 
haps occur as a result of luck, chance, or 
indifferent powers, then it is not unexpected 
that he might seize immediate rewards not 
knowing what may happen in the future. That 
this finding occurs in the present study while 
Zytkoskee et al. (1970) were not able to 
relate delay behavior to internal belief may 
be a function of the wider socioeconomic 
range of white children tested and/or the use 
of an improved locus of control assessment 
instrument. That the finding of the relation- 
ship occurs for white children and not for 
lower-class blacks leads to some interesting 
speculation. The internal—external locus of 
control dimension is a generalized expectancy 
about behavior-contingent reinforcements and 
may not be tapping identical situational ex- 
pectancies that might be necessary to predict 
a specific behavior such as delay. Gurin, 
Gurin, Lao, and Beattie (1969), in a study 
with college students, found that while white 
students were similar in their answers to 
internal-external items differentiated as to 
personal versus institutional control, the black 
students were more likely to believe in exter- 
nal institutional control, while their belief in 
personal control was similar to whites. If the 
black sixth graders in the present study are 
also answering locus of control items in a 
different fashion than the white students, then 
the resulting delay—internal-belief relationship 
found for the whites would not be expected 
to hold for blacks. 

More extensive research with a new scale, 
particularly item analysis in regard to race 
and social class, is clearly warranted, as well 
as continued research on the correlates of 
internal—external beliefs and delay behavior 
for black and white children. Additionally, 
it is important to note continually the dif- 
ferential effects of the race of the experi- 
menter on black-white subject behaviors 
under investigation. 
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INTRINSIC MOTIVATION, EXTRINSIC REINFORCEMENT, 
AND INEQUITY 


EDWARD L. DECI? 


Management Research Center, University of Rochester 


If a person who is intrinsically motivated to perform an activity begins to 
receive external reinforcement for the activity, what will happen to his in- 
trinsic motivation? Previous studies and the present study indicate that money 
decreases intrinsic motivation, while verbal reinforcements tend to enhance 
intrinsic motivation. The beginning of a cognitive evaluation theory is dis- 
cussed, and an apparently discrepant prediction between this theory and in- 
equity theory is pointed out. It is argued, however, that the theories are not 
conceptually discrepant, and the present study gives support for this argu- 


ment, 


It is possible to distinguish between two 
broad classes of motivation to perform an ac- 
tivity: intrinsic motivation and extrinsic moti- 
vation. A person is intrinsically motivated if 
he performs an activity for no apparent re- 
ward except the activity itself (cf. Berlyne, 
1966; Hunt, 1965; White, 1959). Extrinsic 
motivation, on the other hand, refers to the 
performance of an activity because it leads to 
external rewards (e.g., status, approval, or 
passing grades). The question of interest in 
this study is whether there will be changes in 
a person’s intrinsic motivation for an activity 
when he receives external rewards for per- 
forming that activity. 

Deci (1971) reported that external rein- 
forcements do affect intrinsic motivation, and 
he suggested the initial elements of a cognitive 
evaluation theory to account for the changes 
in intrinsic motivation following an experience 
with extrinsic rewards. The theory focuses on 
a person’s cognitive evaluation of an activity 
and the reasons for his engaging in the activ- 
ity. It suggests that distinctions should be 
made among different kinds of external re- 
wards, since a person’s evaluation of different 
rewards may be different. In turn, this would 
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lead to different effects on the person’s in- 
trinsic motivation. 
For example, Deci (1971) stated that 


money is frequently used as a means of “buying” 
services which would probably not otherwise be 
rendered. Perhaps, then, the presence of money as 
an external reward suggests to the subjects that 
they “should probably not render this activity with- 
out pay,” that is, they should not be so intrinsically 
motivated to do the activity. 

This could lead the subjects to a process of cogni- 
tive reevaluation of the activity from one which is 
intrinsically motivated to one which is motivated by 
the anticipation of money [p. 107]. 


On the other hand, when verbal reinforce- 
ments are given as external rewards, these 
rewards may not be phenomenologically dis- 
tinguishable from the feelings of satisfaction 
that the subjects get from the activity, so the 
total positive value properties (Koch, 1956) 
associated with the activity have increased. 
Therefore, the tendency to perform the activ- 
ity in the absence of external rewards will be 
strengthened; that is, the person’s intrinsic 
motivation will be increased. 

The important difference in the effects of 
money and verbal reinforcements lies in a 
person’s perception of the locus of causality 
of his behavior (deCharms, 1968; Heider, 
1958). When he receives money, he could eas- 
ily come to accept the money as the reason 
for his behavior, but he is less likely to do 
this if the rewards are social approval. Hence, 
the theory posits that a person can come to 
perceive that his behavior is controlled by 
external rewards and that this will lead to a 
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decrease in intrinsic motivation. However, 
when he receives interpersonal rewards, he 
will not perceive them as controls of his be- 
havior, so they will strengthen his intrinsic 
motivation because they strengthen his sense 
of competence and self-determination. 


Inequity Theory 


Adams (1963, 1965) has presented a 
theory of inequity which might at first seem 
to be inconsistent with the predictions of the 
cognitive evaluation theory. Adams proposed 
that when a person is in an exchange relation- 
ship with another person, he will be con- 
cerned about the outcomes and inputs of each. 
Outcomes are all of the compensations a per- 
son gets, such as money, independence, per- 
sonal satisfaction, etc. Inputs are all of the 
things he gives to the situation, such as train- 

_ ing, intellect, effort, etc. Adams argues that 
the person will evaluate his own ratio of out- 
comes to inputs and compare it to the other 
person’s ratio. If the two ratios are unequal, 
the person will feel inequity and will be moti- 
vated to reduce this inequity. Inequity will 
exist for a person both when he is overcom- 
pensated (i.e., when his ratio of outcomes to 
inputs is greater than the other’s) and when 
he is undercompensated (i.e., his ratio is less 
than the other’s). This discussion is con- 
cerned only with overcompensation. 

When a person is overcompensated, he may 

` try to lower his own outcomes to restore 
equity (e.g, Wood & Lawler, 1970). How- 
ever, if a person’s outcomes were fixed, an- 
other way he could restore equity would be 
to increase his own inputs, thereby also de- 

creasing his ratio of outcomes to inputs (e.g., 
_ Adams & Rosenbaum, 1962). A large number 
_ of studies supporting these assertions were re- 

viewed by Lawler (1968), Pritchard (1969), 
and Goodman and Friedman (1971). 

_ Adams (1965) mentioned that the person 
may also use an internal standard as his com- 
3 parison other. Pritchard (1969) elaborated 
this point by defining internal standard as 
“the amount of outcome Person perceives as 
being commensurate with his own inputs, 
without regard to any comparison person {p. 
2051.” He suggested that “feelings of inequity 
arise first and foremost from the correspon- 
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dence between Person’s own inputs and out- 
comes. If his inputs are greater than [his] out-. 
comes, he will experience inequity, which will 
lead to feelings of dissatisfaction [p. 206].” 
Similarly, if his outcomes are substantially 
greater than his inputs, he*would experience 
inequity. 

Now, imagine a subject who is intrinsically 
motivated to perform an activity and has 
received substantial overpayment. He might 
be expected to experience inequity because of 
his internal standard. If after the specified 
period ended he were allowed to continue to 
work for a short period for no pay, equity 
theory would predict that he would tend to 
continue working, since at the end of the 
payment period he had more outcomes than 
inputs and making additional inputs for no 
pay would help to restore equity. 

The cognitive evaluation theory prediction, 
however, is that he would be less likely to 
continue to work, because the experience with 
money made him dependent on the money 
and decreased his intrinsic motivation to per- 
form the task. It will now be proposed that 
these two theories are not conceptually dis- 
crepant. 

In the cognitive evaluation theory, it is 
suggested that when a person performs an 
intrinsically motivated task for money, his 
perception of the reason for performing the 
task shifts from “it is intrinsically motivated” 
to “it is motivated by the money.” Since the 
person is then performing for money (and 
therefore has less intrinsic motivation), the 
principle of inequity could certainly be rele- 
vant. In other words, in the example above, 
he would continue to perform after the money 
stopped, if he were feeling inequitably over- 
payed; however, he would not perform if he 
felt equitably paid. In either case his in- 
trinsic motivation would have decreased, and 
he would have less intrinsic reason to perform, 
so the high performance predicted in the first 
case would occur because of inequity in his 
own mind, not because intrinsic motivation 
increased. This additional performance would 
last only until equity was restored. 

In a previous study by Deci (1971), several 
subjects were paid as much as $4 for about 20 
minutes of work on an interesting puzzle. It 


+ was only at the time that the subjects were 
actually handed the money that they ap- 
peared to feel “uneasy” and “inequitably 
overpaid.” During the session when the ex- 
perimenter said things like, “You have now 
earned $3,” or “You have now earned $4,” 
the subjects said nothing and did not appear 
to feel discomfort. However, when they were 
actually handed the money, they became very 
uneasy, and many of them said things like, 
“That’s a lot of money for what I did,” or “I 
feel funny taking all this money.” 

Now, imagine again a laboratory experi- 
ment such as the example given above of the 
overpaid subject. If the subject were actually 
given the money before he had the free time 
to continue working (i.e., the period with no 
pay), it seems likely that he would experi- 
ence inequity (since he has the money as an 
initiating stimulus), so he would be more 
likely to make additional inputs to restore 
equity. However, if he knew he was going to 
be paid, but payment was withheld until the 
end of the free time, he would not experience 
the same inequity (since he does not yet have 
the initiating stimulus, money), so he would 
be less likely to do more work. He is no longer 
as intrinsically motivated, and he apparently 
does not experience inequity from over- 
compensation, so he would be less likely to 
continue to make inputs. Thus, the timing of 
the payment may determine whether the 
change in performance is governed by the 
processes of inequity or cognitive reevalua- 
tion. 

Hypothesis I: When a person is rewarded 
with money for performing an intrinsically 
motivated activity, his intrinsic motivation 
will decrease. Hypothesis IJ: When a person 
is rewarded with verbal reinforcements for 


_ “performing an intrinsically motivated activ- 


ity, his intrinsic motivation will increase. 
Hypothesis III: However, when a person who 
is performing an intrinsically motivated ac- 
tivity feels inequitably overpayed, he will 
increase his performance (ie., make addi- 
tional inputs) to restore equity. 

Two other questions are investigated in the 
present experiment. First, since money and 
verbal reinforcement appear to have opposite 
effects on intrinsic motivation (Deci, 1971), 
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a cell containing both rewards will allow for 
new empirical evidence. There is, however, no 
theoretical basis for making a prediction about 
the scores in this cell. 

Second, one might ask whether the effects 
of external rewards on intrinsic motivation 
are the same for males and females. Hence, 
half of the subjects assigned to each cell were 
males, and half were females so that this ques- 
tion could be investigated. 


METHOD 


The subjects for this experiment were 96 under- 
graduates at the University of Rochester who were 
randomly assigned to one of six conditions, with 8 
males and 8 females in each-of the conditions. 

Each subject participated for a 1-hour session dur- 
ing which he spent most of his time working on a 
puzzle called Soma. The puzzle is composed of 
seven different pieces, each of which is made to look 
like it consists of three or four 1-inch cubes. The 
pieces can be fitted together to form millions of 
configurations—only four of which were used for the 
experiment. 

During a session, the subject was asked to repro- 
duce the four configurations that had been drawn on 
paper for him. The time to complete each configura- 
tion was measured with a stopwatch, and if a sub- 
ject was unable to do a configuration within 10 min- 
utes, he was stopped and told how to do it. This let 
him know that all the configurations were possible. 

When a subject reported to the waiting room, he 
was met by the first experimenter who took him 
to the experimental room where he was seated at a 
table. The experimenter then left through a second 
door at the back of the experimental room so that 
he would be outside the room observing through a 
one-way window. The subject knew, of course, that 
the experimenter was observing him, and he commu- 
nicated with the experimenter through an intercom. 

On the table in front of the subject were the seven 
puzzle pieces—each with a number on it so that the 
experimenter could refer to it over the intercom. To 
the left of the subject was a stack of the four 
configurations that he was asked to reproduce. To 
his right were three other configurations. The top 
one of the three was a sample; the other two are 
discussed below. On another table to the subject's 
right were the microphone, the speaker, recent issues 
of three magazines (New Yorker, Time, and Play- 
boy), and an ashtray. 

When the experimenter got to his position behind 
the one-way window, he read the instructions to the 
subject. The subject was told that it was an experi- 
ment to study certain problem-solving concepts, and 
that he would be asked to solve four puzzle prob- 
lems. After the instructions were read, the experi- 
menter asked the subject to look at the sample to 
his right. He was told how it could be solved and 
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was allowed 1 minute to manipulate the pieces and 
reproduce it. The subject then worked on the four 
puzzles in turn, 

During the puzzle solving, some of the subjects 
were rewarded, and others were not. Rewards con- 
sisted both of verbal reinforcements and money. The 
independent manipulation consisted of rewarding (or 
not rewarding) each puzzle that was solved within 
the 10 minutes allotted to it, The manipulation is 
explained in detail below. 

To obtain the dependent measure of motivation, 
the experimenter left his position for a period of 8 
minutes following the puzzle solving, The pretext 
was as follows: When a subject had completed the 
four puzzles, the experimenter told him that he had 
completed all of the problem solving that he had to 
do, but there was one more thing that he would be 
asked to do, and that was to complete a short ques- 
tionnaire. Since it was an experiment in problem 
solving, the subject would be asked a few questions 
about the way he had solved the puzzles. However, 
there were four different sets of questions, only one of 
which was the most appropriate for this subject and 
that would be determined by how he had’ done on 
the puzzles, To select the appropriate set of ques- 
tions, data from the session would be fed into a 
computer through a teletype. To do this, the experi- 
menter would have to leave for a short time, 5-10 
minutes. The subject was told that he could do any- 
thing that he liked during that time, but he was asked 
to stay in the room. The experimenter left his posi- 
tion and entered the experimental room through the 
back door and exited through the front door. He then 
climbed (noisily, so the subject could hear) a small 
set of steps outside the room and left the lab area 
through a door (noisily opened and closed) at the 
top of the stairs. 

Just after the first experimenter left the experi- 
mental room, he signaled to a second experimenter 
who went to the outside of the one-way window 
through a different door which the subject did not 
know existed, 

Hence, the subject was alone in the room and was 
unaware that he was being observed, so he was free 
to work on the puzzles, read magazines, or do any- 
thing he liked. Interest was in the amount of time 
out of the 8 minutes that he spent working on the 
puzzles. It was reasoned that if he worked on the 
puzzles during this “free choice” time when he could 
do other things, then he must be intrinsically moti- 
vated to do the activity. The amount of time out 
of the 8 minutes that the subject spent working on 
the puzzle was determined by the second experi- 
menter who observed him through the one-way 
window and used a stopwatch to record the time. The 
second experimenter was blind to conditions and 
also to the hypotheses of the experiment, There is 
no indication that the subjects suspected that they 
were being observed during this free choice period. 

Since any subject who was unable to do a con- 
figuration within the 10 minutes allowed was shown 
the solution, the possibility that the Zeigarnik (1927) 
effect would influence whether or not he worked on 
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the puzzle in the 8-minute free choice 
minimized. 

The configurations that were in the pile to his 
right under the sample were impossible ones to do. 
This precluded the possibility that a subject would 
finish one of these configurations and therefore be a 
causal factor in determining whether or not he spent 
more time working on the puzzles. i 

Then after 8 minutes, the first experimenter re- 
turned to the room and asked the subjects to com- 
plete the questionnaire. & 

The experimental design was a randomized three- 
factor design using after-only scores, The first varia- 
ble was verbal reinforcement with two levels, rein- 
forcement or no reinforcement, For subjects in the 
verbal reinforcement conditions, they were rewarded 
with statements from the experimenter (e.g., “Good, 
that’s the quickest that one has been done”) each 
time they solved a puzzle. The same four state- 
ments (or a subset of them if the subject did not 
solve all four puzzles) were used for each subject. 

The second variable was money. Subjects either 
received no money, received money at the end of 
the puzzle solving but before the 8-minute free 
choice period, or received money after the free 
choice period, A subject was given $1 in cash for 
each puzzle that he solved. The third variable was 
sex of the subject. The three variables crossed, 
thereby making 12 conditions, 6 for each sex, 

The two money treatments (actual payment before 
the 8-minute free choice period and actual payment 
after that period) were exactly the same except for 
the point at which the cash actually changed hands. 
Both groups were told at the beginning of the ex- 
periment that they would get $1 per puzzle solved, 


period was 


and both were told how much money they had ._ 


earned after each puzzle they solved. But in the 
actual-payment-before groups, they were handed the 
cash by the first experimenter as he walked through 
the room to leave for the 8-minute period, whereas 
in the actual-payment-after groups, the subjects were 
given the cash after the 8-minute free choice period. 

As mentioned in the introduction, it was felt that 
if a subject actually received his money before the 
8-minute free choice period he would experience 
inequitable Overcompensation, so he would work 
longer than nonrewarded subjects, However, if he 
did not receive money until after the 8-minute free 
choice period, the money would initiate a process of 


cognitive reevaluation, causing a decrease in intrinsic ae 


motivation. 

To summarize, each subject solved puzzles in one 
of six conditions (a) not rewarded, (b) rewarded 
with money before the free choice period, (c) re- 
warded with money after the free choice period, or 
(d), (e), and (f) verbally rewarded in combination 
with one of the first three. The puzzles were thought 
to be intrinsically interesting (pilot testing substanti- 
ated this), so some subjects were performing both 
for external rewards and intrinsic motives. Subjects 
were then given 8 minutes of free choice time during 
which observations were made on the amount of 
time they spent working on the puzzles 
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RESULTS 


It was predicted: (a) that subjects who 
were rewarded with money would be less in- 
trinsically motivated (i.e., the ones rewarded 
at the end would spend less free choice time 
working on the puzzle than the nonmoney 
subjects), (b) that subjects verbally rein- 
forced would be more intrinsically motivated 
to perform the activity than those who were 
not verbally reinforced (i.e., they would spend 


* more free choice time working on the puzzles), 


and (c) that subjects who actually received 
money before the free choice period would 
feel inequitably overpaid (i.e., $3 or $4 is too 
much to get for about 30 minutes of working 
on interesting puzzles), so although they 
would be less intrinsically motivated than 
subjects who got no money, they would work 
on the puzzles for more of the free choice 

_-time as a way of expending additional effort 
on the task for which they were overpaid. 
(d) There was also interest in the experi- 
mental group that received both verbal and 
monetary rewards, since the two would be, 
according to the theory, “pulling” in opposite 
directions on the person’s intrinsic motiva- 
tion. No prediction was made about this con- 
dition. (e) Finally, since there was an equal 
-number of males and females in each cell, 
comparisons were made between males and 
females. 

Table 1 presents the cell means for the 
amount of time that subjects in the 12 condi- 
tions spent working on the puzzles during the 
8-minute free choice period. The first vari- 
able was “money and timing,” with three 
levels. The prediction was that the group 
that actually received cash after the free 
choice period would spend the least amount of 
time on the puzzles, the subjects who got no 


_-#money would spend more time, and the sub- 


jects who were given money before the free 
choice period would spend the most amount 
of time working on the puzzles, The second 
variable was verbal reinforcement, and the 
prediction was that subjects who were verbally 
reinforced would spend more time than non- 
reinforced subjects. 

Of the 12 cells, all but 2 were ordered 
properly to support the predictions perfectly. 
For males, the “money-after—verbal-reinforce- 
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TABLE 1 


MEAN NuMBER OF SECONDS SPENT BY SUBJECTS 
ON PUZZLES IN THE 8-MINUTE FREE 
Cuorce PERIOD 


No verbal Verbal 
reinforcement reinforcement 
Condition 
Females Males Females | Males 
Money after 151.6 65.6 240.4 | 219.9 
No money 292.4 124.4 142.5 197.8 
Money before 346.0 248.0 392.9 


384.4 


ment” cell should have been less than the 
“no-money—verbal-reinforcement” cell. How- 
ever, this reversal is very small. Also, for fe- 
males, the “no-money-—verbal-reinforcement” 
cell should have been greater than the “no- 
money—no-verbal-reinforcement”’ cell. 

The three hypotheses were tested using a 
3 X 2 X 2 (Money and Timing X Verbal Re- 
inforcements X Sex) analysis of variance on 
the amount of time that subjects spent work- 
ing on the puzzles during the 8-minute free 
choice period. These results are presented in 
Table 2. 

Variable A with three levels (money after, 
no money, and money before) is highly sig- 
nificant ( < .005), and the ordering of the 
three levels was as predicted, thereby giving 
strong support to Hypotheses I and III. 

Variable B (verbal reinforcement) is in 
the predicted direction, although it does not 
reach significance (p= .16 and .08 if direc- 
tionality is considered). In order to gain 
further understanding of the effects of verbal 
reinforcements, male and female subjects 


TABLE 2 


ANALYsIS OF VARIANCE SUMMARY TABLE ON AMOUNT 
OF FREE CHoIcE TIME SPENT BY ALL 
SuBJECIS WORKING ON PUZZLES 


Source af MS F 
Money & timing (A) 2 288304.13 | 6.95*** 
Verbal (B) 1 81550.04 | 1.97* 
ex ( 1 63551.04 | 1.53 
AXB 2 57706.79 | 1.39 
AXC 2 289,54 01 
BXC 1 104148.38 | 2.51** 
AXBXC 2 13398.88 rays 
Error 84 41474.38 
Kp LG. 
INDI LTS 
Ik p < 005. 
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TABLE 3 


MEAN NUMBER OF SECONDS SPENT WORKING ON THE 
PUZZLES DURING 8-MINUTE FREE CHOICE PERIODS 
(SCORES ARE COLLAPSED ACROSS THE THREE 
CONDITIONS ON THE MONEY VARIABLE) 


No verbal Verbal 
Group reinforce- reinforce- Difference 
ment ment 
Females 263.33 255.75 —7.58 
Males 146.00 269.41 123.41° 
Difference 117.33* 13°67 2 


» This difference is tested using an analysis of variance, see 
Table 4 
* p < .07, two-tailed ‘test. 


were considered separately. The results are 
rather interesting and show a marked differ- 
ence between the two groups, as shown in 
Table 3. 

Males who received no verbal reinforcement 
spent less free choice time working on the 
puzzles than did females (p< .07, two- 
tailed). Then, verbal reinforcements increased 
the males’ intrinsic motivation but left the 
females’ unchanged. In other words, Hypothe- 
ses I and III about money were upheld for 
males and females, but Hypothesis II was 
supported only for males. The analysis of 
variance summary in Table 4 tests the three 
hypotheses for males only. Clearly, all three 
hypotheses are substantiated. 

Variable C (sex of the subject) shows no 
significant main effect (Table 1); however, 
the Sex X Verbal Reinforcement interaction 
approaches significance (p = .13), as would 
be expected from the above analysis. 

The final question of interest relates to 
the group that received both verbal reinforce- 

“ments and money after the free choice period. 
As mentioned, when money (given after the 
free choice period) and verbal reinforcements 


TABLE 4 


ANALYsIS OF VARIANCE SUMMARY FOR NUMBER OF 
SECONDS SPENT BY MALES WORKING ON THE 
PUZZLES DURING THE 8-MINUTE FREE 
CHorce PERIODS 


Source df MS F 
Money (A) 2 152822.9 4.126* 
Verbal (B) 1 185008.3 4.995* 
AXB 2 7827.3 241 

Error 42 37041.7 


*p <05. 
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TABLE 5 


Mean NUMBER or SECONDS SPENT WORKING ON 
THE PUZZLES DURING THE -MINUTE 
FREE CHOICE PERIODS 


Verbal reinforcement, 


No verbal reinforcement, 
money after 


no money 


208.4 seconds 230.1 seconds 


are administered separately, they affect in- 
trinsic motivation in opposite directions; 
hence, there is no theoretical basis for making 
a prediction. As shown in Table 5, there is 
ignificant difference between the control 
group (no money, no verbal reinforcements) 
and the group that received both. While the 
presence of monetary rewards may serve to 
change a person’s evaluation of causality to 
that of external rewards, the verbal reinforce- 
ments tend to strengthen his own sense of 
competence and satisfaction with the activity. > 
Hence, when both are present, they seem to 
counteract each other, at least over the short 
run. It seems possible, however, that pro- 
longed experience with the money rewards 
would tend to decrease intrinsic motivation 
in spite of the verbal reinforcements. 


Discussion 


The results of this experiment give strong 
support to the cognitive evaluation theory 
of the effects of external rewards on intrinsic 
motivation, although the hypothesis that 
verbal reinforcement increases intrinsic moti- 
vation received support for males only. Ap- 
parently, the effects of verbal reinforcements 
on intrinsic motivation are more complicated 
than originally hypothesized. There are at 
least two aspects to any external reward, a 
“controlling” aspect and an “information” or 


“feedback” aspect. The controlling aspect we 


leads to a decrease in intrinsic motivation by 
changing the perceived locus of causality, 
while the feedback aspect leads to an increase 
in intrinsic motivation by increasing the per- 
son’s sense of competence and self-determina- 
tion. When money is given “as an external 
reward,” the controlling aspect is clearly the 
strongest and leads to a decrease in intrinsic 
motivation. When verbal reinforcements are 
given, it was hypothesized that the verbal 
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rewards would not be phenomenologically dis- 
tinguishable from the internal satisfaction— 
hence, there would. be an increase in intrinsic 
motivation. In other words, it was suggested 
that the feedback aspect of the reward would 
be more prominent than the controlling as- 
pect. However, when one looks closely at 
the “no-money—verbal-reinforcement-females” 
cell, which was not in accord with the pre- 
dictions, it appears that there may also be a 
strong controlling aspect to verbal reinforce- 
ments under some circumstances. Let us 
look carefully at the relation between the 
experimenter and the subject in order to 
try to understand the sex differences in this 
experiment. 

The experimenter in this experiment was 
a very attractive and personable male gradu- 
ate student. His interaction with the females 
at the beginning of the experiment and his 


< communication with them during the experi- 


ment may have served as a reinforcement 
similar to the verbal reinforcements. Their 
liking of the situation, due partly to their 
interaction with the experimenter, may have 
been a reward which, like the verbal rein- 
forcement, was cognitively indistinguishable 
from the internal feelings of satisfaction that 
they got from engaging in an intrinsically 


» interesting activity. Hence, the females who 


received no specific. verbal reinforcement still 
experienced positive interpersonal reinforce- 
ment which served to increase their intrinsic 
motivation as much as the other females 
who received verbal reinforcement and the 
males who received verbal reinforcement (see 
Table 3). 

The males, on the other hand, showed a 
significant difference between the verbal-rein- 
forcement and no-verbal-reinforcement condi- 
tions. This would be expected, since males 


© would probably not find the mere interaction 


with a male experimenter as rewarding. 
Hence, the specific verbal reinforcements 
would be expected to make a difference in 
their intrinsic motivation, which in fact it did 
(Pp < .05). 

If this interpretation were correct, it would 
give support to the cognitive evaluation idea, 
since it would be a case of positively affective 
interpersonal reinforcement (of which verbal 


reinforcement is one example) strengthening 


intrinsic motivation, 


Now, consider the unexpected decrease in 


intrinsic motivation in the no-money-verbal- 
reinforcement cell for females. Although the 
difference between the no-money—no-verbal- 
reinforcement and the 
reinforcement cells (149.9 seconds) is not 
significant (p < .24), it is still substantial and 
deserves some attention. It is of course pos- 
sible that this nonsignificant decrease is 
merely a chance occurrence, but it is interest- 
ing to speculate about the possible reasons 
for the decrease. 

As mentioned above, the attractive, person- 
able male graduate student giving positive 


verbal rewards may have represented a very 


powerful reward for female subjects. These 
strong rewards, therefore, may (contrary to 
prediction) have been phenomenologically 
distinguishable from the internal satisfaction 
that the subjects received from the task itself. 
Hence, we could expect the controlling aspect 
of the rewards to have overpowered the feed- 
back aspect, leading to a decrease in intrinsic 
motivation due to a change in perception of 
locus of causality. 

This may mean that the strength (and 
therefore the subject’s perception) of verbal 
reinforcements determines its effects on in- 
trinsic motivation. Specifically, it is being sug- 
gested that there may be a kind of inverted-U 
relationship between the strength of verbal 
reinforcements and intrinsic motivation, As 
verbal reinforcements increase, intrinsic moti- 
tion increases up to a point and then begins 
to decrease. Of course, this is merely specu- 
lative, and there is not really any data to 
support it, but it may be responsible for the 
unexpected decrease in the females’ intrinsic 
motivation following verbal reinforcements. It 
will now be important to conduct research on 
the amount of verbal reinforcement and the 
sex differences in subjects and experimenter 
in order to understand fully the effects of 
positive feedback and verbal reinforcement on 
intrinsic motivation, 

The general findings of this study and the 
Deci (1971) studies suggest that one who is 
interested in developing and enhancing in- 
trinsic motivation in children, employees, stu- 


no-money—verbal- — 
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dents, etc., should not concentrate on external- 
control systems such as monetary rewards, 
which are linked directly to performance, but, 
rather, he should concentrate on structuring 
situations that are intrinsically interesting and 
then be interpersonally supportive and re- 
warding toward the persons in the situation. 
While large payments can lead to increased 
performance due to feelings of inequity, these 
payments will, however, be making the people 
dependent on the money, thereby decreasing 
their intrinsic motivation. 
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must still be regarded as tentative. 


The variable of choice has been investigated 
in numerous contexts. In dissonance research, 
considerable attention has focused on the role 
of choice in counterattitudinal advocacy stud- 
ies (Bramel, 1969) and on the role played by 
choice in eliciting results consistent with 

“reinforcement versus dissonance frameworks 
(e.g., Linder, Cooper, & Jones, 1967; Steiner, 
1970). 

Tn the area of attribution theory, choice also 
plays an important role. Jones and Harris 
(1967) found that attributions made about a 
person who expressed an unpopular or an 
atypical opinion were affected by whether or 
not the observer thought a prior choice had 


7 been possible. More extreme attributions were 


made when the observer thought the person 
expressing the opinion had chosen to do so, as 
opposed to being forced to do so. 

These investigations highlight the fact that 
choice has been linked with the attribution of 
intent and has been shown to be an important 
variable in the assignment of responsibility. 
The observer, or the actor, asks the question, 
“Was this particular behavior performed vol- 
untarily or was it, instead, required?” The 


„ answer to this question provides the basic 


1 This study is based on a doctoral dissertation 
submitted to the University of North Carolina at 
Chapel Hill in partial fulfillment of the requirements 
for the PhD. The author is indebted to his disserta- 
tion chairman, John Schopler, for guidance and 
inspiration. Appreciation is also extended to Robert 
Frey for statistical assistance. 

? Requests for reprints should be sent to James E. 
R. Luginbuhl, Department of Psychology, School of 
Education, North Carolina State University, Raleigh, 
North Carolina 27607. 
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ROLE OF CHOICE AND OUTCOME ON FEELINGS OF 
SUCCESS AND ESTIMATES OF ABILITY ' 


JAMES E. R. LUGINBUHL 2 


North Carolina State University 


An experiment was conducted to investigate one possible determinant of feel- 
ings of success and failure. It was hypothesized that having a choice of tasks, 
as opposed to receiving a task randomly, would lower one’s feelings of success 
and competence, irrespective of whether one succeeded or failed at the task. A 
secondary hypothesis stated that one’s feeling of “originship” would also be 
reduced, The first hypothesis received strong confirmation, while the second 


datum from which dispositional and inten- 
tional states are inferred. 

This leads to the reasonable hypothesis that 
having a choice among alternatives might 
result in a feeling of being a cause, of the in- 
dividual feeling himself to be the source re- 
sponsible for a particular outcome. deCharms 
(1968) uses the term “origin” to describe the 
person who feels that he is the personal cause 
of an event and the term “originship” to de- 
scribe that feeling. Research by deCharms and 
by Kuperman (cited in deCharms, 1968) in- 
dicated that persons who built a model sub- 
stantially on their own valued it more highly 
than those who built a model for which every 
move was specified in advance. 

However, having a choice may affect one’s 
feelings of success and competence (and thus 
originship) in another manner. A consequence 
of having a choice may be a feeling of having 
influenced your fate, of exercising control over 
your outcomes. If, for example, a person suc- 
ceeds at a problem which he chose from a 
number of problems, his feeling of success may 
be blunted by the knowledge that he influ- 
enced the situation so as to make success more 
probable. Success at a problem for which no 
choice was possible, however, can only be 
attributed to one’s own ability, since selec- 
tion of the task was not under one’s control. 
Similarly, failure at a problem one has chosen 
may result in greater feelings of inadequacy 
than failure at a nonchosen problem—people 
generally do not choose to do something at 
which they expect to fail. 

The expectation is, then, that the oppor- 
tunity to choose a task may lower the ap- 


praisal of one’s own ability, regardless of the 
performance on the task. Although the net 
effect of having a choice should be the same 
for both success and failure, the attributional 
processes would appear to differ. When one 
succeeds, the desire may be to attribute the 
success, internally, to task-relevant abilities, 
but having had a prior choice makes this 
attribution more difficult. When one fails, the 
desire may be to attribute the failure ex- 
ternally—to blame it on factors beyond one’s 
control—and again, a prior choice makes this 
attribution more difficult. 

There would seem to be two components in 
making a choice, which are intertwined psy- 
chologically but are separable experimentally. 
The first deals with what is chosen. In a situ- 
ation where a person feels some evaluation 
apprehension, he will probably choose the task 

- from a group of tasks on which he expects to 
perform the best. This might be called the 
“outcome maximization” component. The sec- 
ond component deals with the act of choosing 
itself and might be called the “outcome in- 
fluence” component. Due to the fact that he 
has selected from among alternatives, the 
person who chooses may feel that he has in- 
fluenced the final outcome, irrespective of 
which alternative he selected. Since these are 
two components of the same act, making a 
choice, it is expected that their effects will be 
similar, that is, to reduce overall estimates of 
ability. Success on an easy problem is not as 
rewarding as success on a difficult problem. 
Likewise, having brought one’s influence to 
bear on the success-producing situation may 
cause one to feel that success was a function, 
not so much of ability, but of having influ- 
enced the situation in one’s favor. 

An experiment was designed in which sub- 
jects experienced one of two outcomes (suc- 
cess or failure) and received a task in one of 
three ways. Some chose from among three 

tasks, while others thought that they had been 
randomly assigned to one of three tasks. Those 
who received a task “randomly” actually re- 
ceived either their first preference or their 
last preference. 

In addition to the expectation that sub- 
jects: who succeeded would feel more success- 
ful and competent than those who failed, 
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there were two major hypotheses: (@) Hav- 
ing a choice of tasks should reduce one’s esti- 
mate of his ability, and (b) receiving one’s 
most preferred task (randomly) should reduce 
one’s estimate of his ability. 

Thus, holding success and failure constant 
across treatment conditions, it is proposed 
that self-attributed ability will be greatest 
following performance of @ randomly assigned 
difficult task, next greatest following perform- 
ance of a randomly assigned easy task, and 
least greatest following performance of a 
chosen (presumably easy) task. 


MertHop 
Subjects 


Subjects were 68 male undergraduate students 
enrolled in introductory psychology at the University 
of North Carolina at Chapel Hill. Of this original 
sample, 1 was eliminated due to experimenter error, 
2, for succeeding at their task in the failure condi- 


tion, and 5, for incorrectly filling out their question- = 


naire. This left 60 subjects—10 in each cell. 


Procedure 


Subjects were scheduled in pairs but were run 
singly if the other member of the pair failed to 
arrive. Once seated at separate tables, each subject 
read an introduction describing a competence test 
(presented as a combination aptitude and IQ test) 
which was being developed at several universities. 
This particular university had been entrusted with 
developing measures of complex analytic skill, and 
the subject was being asked to help in the final 
validation of two of these measures. The first con- 
sisted of three subtests described as (a) a number 
problem, (b) a verbal reasoning problem, and (c) a 
concept-formation problem. After reading brief 
descriptions of the three problems, each subject was 
asked to rate the three problems in terms of how 


well he thought he could do on each of them. Then, i 


some subjects chose which problem they would solve, 
while others were “randomly” assigned one of the 
three problems. 

Subjects handed in their work after 15 minutes 
and were given preplanned information as to whether 
they had done well or poorly, after which they 
completed several dependent measures. Next, the 
subjects read preliminary instructions for the second 
measure of analytic skill. The experiment was termi- 
nated when the subject answered two dependent 
measures at the end of these preliminary instructions. 


Independent Variables 


Problem selection. After reading the descriptions 
and rating the three problems, one group of subjects 
was asked to choose which problem they wanted to 
work on, and they subsequently received that 
problem. This group constituted the choice condition, 


= 


Two other groups were told that they would be 
' randomly assigned a problem. The experimenter held 
' three folders—each ostensibly containing one of the 
~ three problems—in front of the subject. He then 
tossed a die and gave the subject the folder whose 
number corresponded to that on the die. For neither 
= group of subjects, however, did the folders contain 
all three problems. The experimenter had been able 
to quickly scan the subject’s ratings before appearing 
with the folders. Thus, for one group, the folders all 
contained the problem that they thought they would 
do the best on (high random), while, for the other 
group, the folders each contained the problem on 
which they thought they would do the worst (low 
random), 

To restate the selection manipulation, one-third of 
the subjects chose their problem, while the other 
two-thirds believed that they had been randomly 
assigned a problem. In fact, half of the latter group 
received the problem they expected to do the best 
on, while the other half received the one on which 
they felt they would do the worst. 

Performance. Accompanying each problem was a 
set of 10 true-false questions, and the subjects were 
- told that their main objective in taking the test was 
to answer the true—false questions correctly. Solving 
the problem would help, but the real test was 
whether or not they could correctly answer the 10 
questions.’ When their material was collected at the 
end of 15 minutes, the experimenter marked nine of 
the answers correct for half of the subjects (success) 
and only five correct for the other half (failure). 
Both groups received information that the average 
performance on their problem had been seven cor- 
rect, and that 2 points above or below that score 
“was significantly better or worse than average. It 
was also stressed, however, that they were simply 
being given their scores for their own information, 
and that they were not being tested themselves (they 
were helping to validate only a small portion of the 
competence test). 

In the time allotted it was virtually impossible to 
solve the problems. For those subjects in the failure 
condition, this was desirable. However, the problems 
were made easier in the success condition so that do- 
ing well on the true—false questions would not ap- 
pear too inconsistent with their performance on the 


problem. 


-Dependent Variables 


After being advised of their performance, the sub- 
jects completed a questionnaire containing items 


3 True-false questions were used to make the 
manipulation of success and failure more believable. 
If the subjects’ only task had been to work on a 
problem, it would have been difficult to convince 
those who solved it that they had done poorly and 
those who made no headway that they had done 
well. Making the evaluation of their performance 
contingent upon their correctly answering ambigu- 
ous true-false questions substantially eliminated 
this problem. 
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regarding how successful they felt, how confident — 
they were of their analytic skill, and how they felt 
they might do on the entire competence test. It was 
also ascertained whether the subjects thought they 
had had enough time and how clear the instructions 
were. 

An attempt was made to obtain a behavioral mea- 
sure of the subjects’ confidence in their ability by 
telling them that there was to be a future investiga- 
tion of complex analytic skill unrelated to the present 
investigation, and that volunteers were needed who 
would donate 3 hours of their time and subject 
themselves to detailed questioning by a psychologist. 
Subjects were asked to sign their names on a form 
sheet and indicate whether they could definitely 
or only possibly donate the required time. If they 
felt that they would be unable to participate at all, 3 
they were asked not to sign their name. A 

Next, the subjects were given the preliminary in- 
structions for the second measure of analytic skill. — 
These stated that in order to understand analytic 3 
ability more fully, it was necessary to investigate 
how people responded to clues or suggestions on how 
to solve problems—how they incorporated this addi- 
tional information into their problem-solving strat- 
egy. They were told that they would work on a 
problem related to, although not identical with, the 
first problem. Furthermore, they were to have the 
opportunity to choose from among four methods of 
solving the problem, ranging from Variation 1, which 
contained no clues, to Variation 4, which contained 
“5 or 6” clues. They were also asked to briefly ex- 
plain the reason for their choice. Upon obtaining 
this final measure, the experiment was terminated, its 
purpose was explained, and all questions were fully 
answered. 


The Problems 


The number problem consisted of a 4 X 4 “magic 
squares” problem, with 16 squares numbered 1-16. 
The object was to rearrange the numbered squares 
so that each row, each column, and the two diagonals 
summed to 34. The verbal reasoning problem and the 
concept formation problem were, in fact, the same. : 
It was stated that there were five different colored 
houses, each occupied by men of different nationali- 
ties, with different pets, drinks, and color of hair. A 
series of statements was then presented, such as, “The 
man in the green house lives next to the man with 
the fox.” The objective was to ascertain who owned 
the zebra and who drank water. Pretesting indicated 
that this problem sufficiently resembled the descrip- 3 
tions of both a verbal reasoning and a concept- 
formation problem that it was used in both cases. 


RESULTS 


The means for the performance and selec- 
tion variables are presented in Table 1, and 
the results of a multivariate analysis of vari- 
ance are presented in Table 2. The multi- 
variate F for the performance manipulation 


TABLE 1 


RELEVANT MEANS FOR THE PERFORMANCE 
AND METHOD VARIABLES 


Performance Selection 
Variable 3 ; 
Success | Failure | Choice | „Migh | Low 
Success 8.600 | 3.133 | 5.20 | 6.00 | 6.40 
Analytic skill | 7.967 | 5.000 | 5.65 | 6.85 | 6.95 
Competence | 8.400 | 5.533 | 640 | 7.05 | 7.45 
Variation 1.933 | 2.433 | 2.15 2.05 2.35 
Reason 6.567 | 4.700 | 5.70 | 6.05 | 5.10 
Importance | 7.433 | 6.667 | 6.65 | 8.10 | 6.40 


was significant. Subjects were asked to rate, 
on an 11-point scale, how successful they 
thought they were in answering the true—false 
questions. The mean for the subjects in the 
success condition was 8.60, while the mean 
for those in the failure condition was 3.13— 
a difference significant beyond the .001 level. 
Manipulation of performance was therefore 
successfully accomplished. 

There were several other variables on which 
the performance manipulation had the in- 
tended effect, Subjects were asked how con- 
fident they were in their analytic skill and 


TABLE 2 


Summary TABLE or THE MULTIVARIATE ANALYSIS OF 
VARIANCE OF THE RELEVANT 
DEPENDENT MEASURES 


Success Analytic skill 
Source 

MS F MS F 
Performance (A)4) 448,266 | 110.430***} 132.015 | 43.074 
Selection (B) 7,407 — 10.467 3.145** 
AXB 4.267 nes 1.267 = 

~x BB 
Competence Variation 

MS F MS F 
A 123.266 | 51.781% 3.750 5.127% 
B 5.617 2,633* 467 = 
AXB „117 ss 1,400 — 

Reason 

MS F 
A 52.266 8,532 
B F a — 
AXB Py 

: Multivariate F parte at p <.001. 
< 09, df = 1/54. 
<.05, HA = 1/34. 
ix 2 = 01, df = 1/54, 
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how well they thought they might do if they 
took the entire competence test. The success 
subjects felt significantly more confident of : 
their analytic skill ( < .001), and they felt 
that they would do significantly better on the 
overall competence test (p < .001) than did | 
the failure subjects. © 

In addition, there was a significant tend- 
ency (p < .028) for the success subjects to 
choose a variation with fewer clues (i.e., to 
indicate a desire for originship) than the” 
failure subjects. The reasons given for 
choosing a particular variation were coded on 
a 9-point scale along a dimension anchored 
at one end. “by feelings of challenge and 
originship and at the other end by a desire 
for expediency in quickly obtaining an an- 
swer to the problem, Performance had a sig- 
nificant effect (p< .005) on the reasons 
given, with success subjects giving more , 
originlike reasons and failure subjects ad- 
vancing more nonoriginlike reasons. The 
performance manipulation had a significant 
effect on all of the intended measures but did 
not affect how important the subjects thought | 
it was to do well on the problems. This indi- 
cates that the subjects selectively responded 
to success or failure experiences and did not , 
allow their reactions to spread into unrelated ` 
areas. 

The results of the selection manipulation 
may now be reported. Crucial to Hypothesis a 
was the assumption that in the present situa- 
tion, the subjects would choose to work on 
the problem that they considered the easiest. 
All subjects rank ordered the three problems 
in terms of perceived difficulty, and, of the 
20 subjects who were given a choice, 16 
(80%) chose to work. on the problem on j 
which they thought they would do the best. | 
The remaining 4 chose the problem of intery. 
mediate difficulty. Thus, the assumption was 
strongly supported. 3 

Although the multivariate F for the selec- © 
tion manipulation was not significant, trends 
can be noted with respect to three dependent 
variables. There was a main effect of selec- H 
tion on confidence in analytic skill (p < 04), 
as well as trends in the predicted direction. a4 
on competence (p < .09) and on feelings of 
success (p< .17). Inspection of the cel] 


et 
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means reveals that in all cases feelings of 
general adequacy were lowest in the choice 
condition, next lowest in the high random 
condition, and highest in the low random 
condition. This same pattern held when the 
results were broken down into success and 
failure groups (see Table 3), with the excep- 
tion of one slight and nonsignificant reversal 
on analytic skill within the success condition. 

A second multivariate analysis was under- 


‘taken in which special contrasts were per- 


formed between various combinations of the 
selection variable, the two most important 
being those between the choice and high 
random and between the high random and 
low random conditions, While the multivariate 
F was nonsignificant, there was a significant 
univariate F for analytic skill on the contrast 
between the choice and the high random 
conditions, suggesting that having a choice 
lowered one’s confidence in his analytic skill 
more than receiving his first preference ran- 
domly, There were no other significant uni- 
variate Fs. 

Two trend analyses on the three dependent 
measures related to an overall estimate of 
ability (success, analytic skill, and compe- 


‘ tence) were carried out over the three levels 


_of the selection variable (choice, high random, 
and low random). Trend Analysis A used a 
metric of equally spaced intervals (3, 2, 1) 
based on the original hypothesis of the equal 
importance of choosing versus randomly. re- 
ceiving one’s first preference. Since the one 
significant special contrast suggested that the 
effect of choice might be greater than that 


-of receiving one’s first preference, a post hoc 


trend analysis, B, was performed, using a 
stretched metric of 4, 2, 1. 
The results of these two trend analyses are 


Sa presented in Table 4, where it is seen that 


the multivariate Fs for the two analyses are 
quite similar (multivariate F for Trend Analy- 
sis A = 2.514, p < 069; multivariate F for 
Trend Analysis B = 2.702, p < .055). While 
the fit employing the stretched metric was the 
more accurate of the two, the difference in fit 
between the two metrics was slight and did 
not approach significance. These marginally 
significant trend analyses uphold the previous 
findings that general ratings of ability were 
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TABLE 3 


RELEVANT MEANS FOR THE SELECTION VARIABLE 
WITHIN THE SUCCESS AND FAILURE CoNDITIONS 


Condition Variable Choice Ee ae 
Success 74 9.0 9.4 

Success | Analytic ability | 7.1 8.6 9:2 
Competence 7.9 8.5 8.8 
Success 3.0 3.0 3.4 

Failure | Analytic ability | 4.2 |- 5.1 Tyi 
Competence 4.9 5.6 6.1 


highest in the low random condition, inter- 


mediate in the high random condition, and — 


lowest in the choice condition. 


An interesting pattern emerges when one a 


considers the variation that individual sub- 


jects chose with the reasons they advanced — 


for choosing it. As indicated previously, the ~ 


reasons were categorized along a 9-point scale, 
ranging from originship to expediency. There 
was no significant difference between the 
number of success versus failure subjects who 


chose an originlike variation, or in the num- — 


ber of success or failure subjects who chose 
a nonoriginlike variation. However, almost 


75% of the success subjects gave originlike — 


reasons for origin choices, while originlike 
reasons were advanced by only 53% of the 
failure subjects. On the other hand, only about 


17% of the success subjects gave nonorigin 


reasons for a nonorigin choice, while about — 


72% of the failure subjects gave nonorigin 
reasons. Thus, success subjects tended to ad- 
vance originlike reasons for their choice of 
variation, regardless of the nature of the vari- 


ation they chose; failure subjects tended to 


always advance nonoriginlike reasons. 


TABLE 4 


TREND ANALYSES FOR THE SELECTION VARIABLE OVER 
Success, ANALYTIC SKILL, AND COMPETENCE 


Trend Analysis A^ | Trend Analysis Bb 


Source 
S F b F ? 
Success 3.547 | .065 | 3.679 | .060 
Analytic skill 5,514 .023 6.364 015 
Competence 5.168 | .027 5.251 .026 


* Multivariate F = 2.514, p < .069; metric 
b Multivariate F = 2,702, p < .055; metric 
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The other behavioral measure, that of ask- 
ing subjects whether they would be able to 
donate 3 hours for future testing, was un- 
successful. Two factors probably contributed 
to the failure of this measure. First, the 
experiment was conducted toward the end of 
the semester, and many subjects specifically 
stated that they did not have the time to 
contribute extra hours. Second, the middle 
category, ‘Possibly I will have time to par- 
ticipate,” appeared to appeal both to those 
subjects who wanted to participate and who 
hoped they would find the time, as well as to 
those who had no desire to participate but 
who felt compelled to at least sign their 
names, while hoping they would never be 
called. 


Discussion 


The predictions were generally well sup- 
ported. The experience of success resulted in 
greater feelings of ability and in a greater 
desire to be an origin than did the experience 
of failure. Furthermore, the self-attributional 

analysis of the subjects’ perceptions of their 
ability was supported. Those subjects who 
saw themselves as influencing the selection of 
their task (those who had a choice) did not 
feel as successful as those who saw them- 
selves as exerting no influence. In addition, 
those subjects who received the task that they 
‘expected to do the best on did not feel as 
successful or competent as those who received 
a task that they expected to do poorly on. 

Since the performance manipulation had 
the predicted effect on all dependent variables, 
as did the selection manipulation on the mea- 

sures regarding feelings of adequacy, the 
question arises as to why the selection ma- 
-nipulation failed to have any influence on the 
measures dealing with originship (variation 
and reason). When success was directly ma- 
‘nipulated, the more successful and competent 
a subject felt, the more like an origin he 
‘acted. However, when increased feelings of 
“success were indirectly brought about (by 
“means of the selection manipulation), this 
relationship disappeared. One explanation 
“may lie in the fact that the difference in feel- 
ings of success and competence for the success 
and failure subjects was much greater than 
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that between the choice, high random, and 
low random subjects. Had the selection vari- 
able exerted a more powerful effect, its rela- 
tionship to the variation chosen might have 
been similar to that of the performance 
variable, 

Another factor could involve the different 
nature of the two manipulations. The per- 
formance manipulation bore no clear relation- 
ship to the act of being an origin. On the 
other hand, manipulation of selection resulted 
in some subjects being Origins right at the 
beginning of the experiment, while specifi- 
cally preventing others from being so. The 
prediction that subjects who had a choice 
would act least like origins later in the experi- 
ment may have been offset by their having 
been origins in the beginning (by making a 
choice) and wanting to remain so. Subjects 
who were “randomly” assigned to a problem, 
especially those who subsequently “failed,” 
may not have felt too badly about failing, 
but they also may not have had any strong 
desire to invite failure a second time by in- 
sisting on trying to solve the second problem 
with no assistance. Thus, the early “origin- 
relevant” experiences of the subjects may 
have confounded the effect of the selection 
variable. 

The first choice of some subjects differed 
greatly from their last choice, while for other 
subjects there was virtually no difference be- 
tween their first and last choices. A more 
potent manipulation of the subjects’ expecta- 
tions could be achieved by experimentally 
varying performance expectations on three 
different tasks. Then subjects who succeeded 
on their most preferred alternative would 
clearly be aware that they should have suc- 
ceeded anyway, while those who succeeded on 
their least preferred alternative would know 
that they had succeeded where the probability 
was high that they would fail. The reverse 
would hold in the case of failure.* 

4A problem with such a manipulation, however, 
is that as the discrepancy between expectation and 
outcome increases, the validity of the outcome is 
more likely to be questioned by the subject. Where 
subjective probability of success is very low, subse- 
quent success may be attributed to luck. Where sub- 
jective probability of success is very high, subse- 


quent failure may bring into question the honesty of 
the experimenter. 


= 


An issue left unresolved by the present ex- 
periment concerns the relationship between 
feelings of satisfaction and competence and 
an internal versus external self-attribution for 
the outcome. One would expect that an in- 
crease in feelings of success and competence 
would indicate a greater internal attribution 
for one’s performance. The present experiment 
provides no data relevant to this issue, If, 
however, one classifies outcomes in terms of 
those that were expected versus those that 
Were unexpected (e.g., success on a difi- 
cult problem is unexpected; failure on a 
difficult problem is expected), data from 
other experiments can be brought to bear 
on this question. Weiner and Kukla (1970) 
found that an observer attributed anoth- 
er’s unexpected success and failure inter- 
nally, while attributing expected success 
or failure externally. On the other hand, 
Feather and Simon (1971) and Feather 
(1969) found that one’s own unexpected suc- 
cess and failure were attributed more exter- 
nally than were expected success and failure. 
An added complexity is that subjects in 
Feather’s (1969) experiment who were ini- 
tially less confident in their ability to do well 
were the most satisfied if they succeeded, 
whereas the ones who were most confident 
were the least satisfied if they failed. In other 
words, unexpected success led to the greatest 
satisfaction and unexpected failure to the 
least satisfaction. Intuitively, one would ex- 
pect satisfaction and internal attribution to 
be positively related. 

The results of this investigation do, how- 
ever, highlight the sharp contrast between the 
prevalent belief that a person should be able 
to choose the behavior in which he wishes 
to engage and the results of following that 


policy. Subjects who had a choice tended to 
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feel less successful and competent, irrespec- 


tive of whether they succeeded or failed, than 


subjects who had no choice, 

A practical implication of these findings 
comes to mind. Assuming that it is desirable 
to foster feelings of success and competence, 
how is this best accomplished? In the case 
of children, for example, the results suggest 
that one should not permit the child always 
to have his own way, to choose what he 
thinks will be the easiest or most pleasant task, 
But, in addition, the parents should order 
tasks according to difficulty for the child and 
then assign tasks of moderate difficulty 
which they feel the child can master with 
some effort. 
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INFORMATION PROCESSING, SITUATION SPECIFICITY, AND 
THE GENERALITY OF RISK-TAKING BEHAVIOR * 


PAUL SLOVIC 2 


Oregon Research Institute 
‘ 


An effort was made to construct two structurally similar risk-taking tasks in 
order to evaluate intertask consistency of individual differences. Only the mode 
of response differed between tasks. In one task, subjects chose their preferred 
bet within each of a number of pairs of bets. In the other, they set selling 
prices for these same bets. A measure of the subject’s preference for “long-shot” 
gambles was obtained from each response. Reliable individual differences were 
found for each measure. However, the intermeasure correlation was relatively 
low considering the high degree of similarity between tasks. It is argued that 
the two response modes triggered different methods of processing information 
about probabilities and payoffs in a way that perturbed individual differences 
and reduced intertask consistency. Information-processing congi@erations may 
be one important component of the situation specificity prevalent in risk-taking 
behavior. These results imply that high correlations are unlikely between risk- 
taking measures in structurally different settings or between risk taking and 


other behaviors. 


Kogan and Wallach (1967) posed a ques- 
tion fundamental to our understanding of 
risk-taking behavior: “How important are 
individual consistencies relative to situational 
constraints when it comes to predicting some- 
one’s inclination for risk or conservatism? 
[p. 208].” 

Slovic (1962) found little correlation 
among several different kinds of risk-taking 
measures administered to the same subjects. 
This result implied that most of the variation 
among individuals was situationally deter- 
mined, and it casts doubt upon the existence 
of a general trait of risk-taking propensity. 
Since then, this failure to find transsituational 
generality for individual differences in risk 
taking has been replicated in studies by 
Brichaéek (1968), Flanders (1970), Good- 
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man (1970), Greene (1963, 1964), Higbee 
(1971), Johnson (1963), Weinstein (1969), 
and Weinstein and Martin (1969). 

Some investigators, such as Cohen and 
Christianson (1970), have reacted to this 
negative experimental evidence with disbe- 
lief, arguing that it flies in the face of per- 
sonal experience that attests to the coherence 
of individual behavior. Others have criticized 
these studies in more specific ways in an at- 
tempt to explain why they failed to find the 
expected levels of generality. One criticism is 
that they have failed to include a proper 
sampling of risk-taking behaviors, However, 
the studies cited above have employed a re- 
matkably thorough assortment of risk-taking 
tasks in problem-solving, athletic, social, voca- 
tional, and pure gambling situations, Only 
those tasks highly similar in structure and 
involving the same sorts of payoffs (e.g., all 
financial, all social, etc.) have shown any 
generality (e.g., Slakter, 1969), and, as simi- 
larity decreases, these intertask correlations 
rapidly decrease (Kogan & Wallach, 1964; 
Weinstein, 1969), 

Researchers such as Flanders ( 1970), Ko- 
gan and Wallach (1964), and Slovic (1964) 
have hypothesized that more generality would 
exist among tasks which provided real, rather 
than hypothetical, payoffs. Higbee (1971) has 
proposed just the opposite—that imagined or 
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fantasized risk taking may be more general 
than actual behavior. To date, neither of 
these hypotheses has received empirical sup- 
port, 

. Slovic (1964) hypothesized that the sub- 
jective aspects of risk taking might explain 
the lack of generality among observed be- 
haviors for tasks in which subjective proba- 
bilities and subjective values (utilities) vary 


> widely between individuals. That is, persons 
may act consistently with regard to their sub- 


jective perceptions of the risks involved, even 
though their overt behavior appears to be 
variable. If this hypothesis were valid, re- 
searchers would have to scale subjective prob- 
abilities and utilities in order to allow the 
orderliness of risk-taking behavior maximum 
opportunity to emerge. Pitz and Reinhold 
(1968) and Weinstein (1969) did just this, 
but neither study found much transsituational 
generality even among subjective risk-taking 
levels. 

Vhile there are only a few exceptions to 
this negative trend, they are worth noting. 
Maehr and Videbeck (1968) obtained a bi- 
serial correlation of .62 between subjects’ bet 
preferences and their scores on a risk-taking 
questionnaire. Kogan and Wallach (1964) 
found little generality across a wide variety 
of risk-taking measures for their sample as a 


whole. However, they found that certain sub- 


groups of individuals, specifically those high 
on defensiveness and anxiety, reacted to di- 
verse tasks in terms of their risk-taking impli- 
cations and thus showed a substantial degree 
of transsituational generality. In addition, 
women showed more consistency than men. 
In essence, anxiety, defensiveness, and sex 
acted as moderators of the consistency of 
interindividual differences in risk taking. Simi- 
larly, Weinstein and Lewinsohn (1968) found 
little relationship between two measures of 
predecisional information seeking across their 
entire sample, but they did find consistency 
within subgroups of “repressors” and “sensi- 
tizers.” The strategy of searching for mod- 
erator effects thus holds some promise, How- 
ever, since correlations are obtained within 
each of several subgroups, the likelihood of 
capitalizing upon chance increases. Replica- 
tion is advisable, and, thus far, there have 
been few attempts to replicate these results. 


In summary, the majority of evidence ar- 
gues against the existence of risk-taking pro- 
pensity as a generalized characteristic of indi- 
viduals. Situational determinants seem much 
more important than organismic character- 
istics. Yet, we have little knowledge about 
the nature of these situational effects beyond 
the simple realization that as intersituation 
similarity decreases, generality likewise de- 
creases. The purpose of the present study is 
to demonstrate how a rather subtle situational 
characteristic of risk-taking tasks can alter a 
person’s information-processing strategies in 
a way that attenuates individual consistencies 
in behavior. 

The rationale behind the present experi- 
ment stems from the results of previous stud- 
ies by Slovic and Lichtenstein (1968) and 
Lichtenstein and Slovic (1971) showing the 
influence of information-processing considera- 
tions on risk taking. Slovic and Lichtenstein 
found that changes in response mode had a 
strong influence on the manner in which an 
individual weights probabilities and payoffs 
when evaluating a gamble. They found that 
choices and ratings were determined primarily 
by the gamble’s probabilities. However, when 
subjects were asked to evaluate a bet in terms 
of the maximum amount they would pay to 
play it (or the minimum for which they would 
sell a ticket to play it), their responses were 
most influenced by the amounts to be won and 
lost. Specifically, when subjects found a bet 
attractive, their buying and selling prices 
correlated predominantly with the amount to 
win, and when they disliked a bet, the 
amount to lose was the primary determiner of 
their responses. It was argued that these 
differences between ratings and choices on the 
one hand and price responses on the other 
demonstrated the influence of information- 
processing considerations on the method by 
which a gamble is judged. When setting a 
price, the subjects were evaluating the gamble 
in monetary units, and this requirement ap- 
parently led them to attend more to payoffs 
than they did when making choices and rat- 
ings. 

This result suggested that it might be possi- 
ble to construct a pair of gambles such that an 
individual would choose one of them but set a 
higher selling price for the other. For example, 
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- consider the ah ee of Bet P (.90 to 
= win $4 and .10 to lose $2) and Bet $ (.30 to 
win $16 and .70 to lose $2), Although both 
bets have equal expected values, Bet P has a 
much higher probability of winning, while 
Bet $ offers a much larger winning payoff. If 
choices tend to be determined by probabilities, 
while selling prices are most influenced by the 
amount to win, one would expect that subjects 
might choose Bet P over Bet $ but set a 
higher price on Bet $. This hypothesis was 

tested by Lichtenstein and Slovic (1971), 

using many pairs of P bets and $ bets, and it 
was strongly confirmed. 

Evidence from these experiments indicates 
that an individual who is deciding on a sell- 
ing price for a bet he sees as favorable starts 
with the amount to win and adjusts it down- 
ward to take the other attributes of the bet 
into account. The amount to win translates 
directly into a price. However, the probabili- 
ties of winning and losing are more difficult to 
translate into monetary units. In trying to 
adjust his price to take this other information 
‘into account, the individual is not very pre- 
“cise, Some persons simply subtract a constant 
(or a variable amount that is uncorrelated 
with the probability and amount to lose) 
from the amount to win. Others multiply the 
amount to win by the probability of win- 
ning, entirely disregarding the amount to lose. 
The P bets offered high probabilities of win- 
ning, so their modest winning payoffs needed 
to be adjusted downward only slightly to take 


` the other factors into account when setting a 
~ selling price. In contrast, the $ bets offered 
= only a modest chance to win. Therefore, sub- 


= jects should have made a sizable downward 


adjustment of the large amount to win when 
setting prices for these bets. They generally 
failed to do so and thus ended up stating 
much higher prices for $ bets than for P bets 
of comparable expected value. This, in turn, 
led their selling prices to differ from their 
choices. In choice, each attribute of the bet 
can be directly compared with the same 
attribute of the other bet. There is no natural 
starting point, and one may use any of a 
number of strategies to determine his choice. 
Quite often, the bet with the higher probabil- 
ity of winning is selected over the bet with 
the higher winning payoff. 
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In the present study, the Lichtenstein and 
Slovic (1971) experiment was replicated with 
an emphasis on examining the consistency of - 
individual differences in risk preference 
across choice and selling-price response modes. 
Each subject was asked to choose among a 
number of pairs of P bets and $ bets. He also 
set selling prices for each of these bets. Sub- 
jects were then scored on an index of pref- 
erence for $ bets (long shots) over the P bets 
(sure things). Each subject had two scores: ` 
one based on his choices, the other on his 
selling prices. These scores were correlated 
across subjects to determine the generality of 
this particular risk-taking characteristic. 

Since each subject evaluated the same bets 
in two different ways, differential subjective 
probabilities and utilities could not bias the 
results. The choice and selling-price responses 
should have been determined only by the 
underlying subjective attractiveness of the 
gambles. Structural and motivational similar- 
ity between the tasks was very great. There- 
fore, this study can be viewed as an attempt 
to produce a high degree of transsituational 
generality. However, we know that the differ- 
ence between the two response modes has a 
profound effect on the strategies by which 
subjects process information. To the extent 
that all subjects are not equally influenced by 
these processing effects, these effects may in- 
terfere with any consistent individual differ- 
ences in risk-taking dispositions. The magni- 
tude of the obtained correlation between the 
two response conditions should indicate the 
extent to which this particular situational 
factor can perturb a stable individual differ- 
ence characteristic. 


METHOD 


The subjects were 92 college students, 44 men and 
48 women. Each subject chose among 38 pairs of P 
bets and $ bets, and also gave his minimum selling 
price for each of the 76 gambles employed in the 38 
pairs. One-half of the subjects gave selling prices 
after making their choices (Order A), while the other 
half gave selling prices prior to making choices (Or- 
der B). Prior to each task, the subjects made 10 
choices or gave 10 selling prices among practice bets. 
Results from the practice bets were excluded from 
analysis. at 

Considerable attention was given to maximizing 
motivation and minimizing indifference or careless- 
ness. The subjects were run in small groups and 
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were given lengthy and careful instructions. A num- 
ber of bets were actually played at the end of the 
session, and subjects were paid their winnings. 


Stimuli 


The bets within each of the 38 pairs were equated 
in expected value. Each P bet had a winning proba- 
bility from 24/36 to 33/36; the probability of win- 
ning for the $ bet ranged from 3/36 to 15/36. The 


> probabilities were expressed in thirty-sixths because 


a roulette wheel was used to play the bets. The $ 
bet always had a larger amount to win than the P 
bet. The win typically exceeded the loss in a given 
bet. 

The bets were chosen in an attempt to represent 
a variety of P bets and $ bets. Payoffs were ex- 
pressed in points; the winning amount ranged from 
10 points to 1,000 points; the losing amount, from 
10 points to 370 points. The expected Values ranged 
from 10 points to 300 points. Table 1 presents the 
first six pairs in the choice condition to illustrate 
their characteristics. 


Instructions 


For the choice task, each subject was simply asked 
to choose, from each pair, the bet that he preferred 
to play. After each choice, the subjects indicated how 
strongly they preferred their chosen bet by marking 
one of four lines on their answer sheet; the first line 
was labeled “slight” preference, and the fourth was 
labeled “very strong” preference, The instructions 
suggested that the two intermediate lines might be 
labeled “moderate” and “strong,” 

The instructions for the selling-price response were 
more involved, The subject was told to imagine that 
he owned a ticket to play the bet and was asked to 
name a minimum selling price such that he would 
be indifferent to playing the bet or receiving the 
selling price. All of the persuasions discussed by 
Becker, De Groot, and Marschak (1964) were used 
to convince the subject that it was in his best 
interest to state, as his selling price, exactly what 
that bet was worth to him—no more and no less. 
Specifically, the subject was told that the experi- 
menter would choose a counteroffer, against which to 
compare the subject’s price, by spinning the roulette 
wheel and entering the number so obtained in a 
conversion table specially designed for each bet. The 
conversion table was a list of the 36 roulette num- 
bers with a counteroffer associated with each num- 
ber. If the counteroffer were equal to, or greater 
than, the subject’s stated bid, the experimenter 
would buy the bet from the subject, paying him the 
amount of the counteroffer. If the counteroffer were 
smaller than the subject’s price, no sale would be 
made, and the subject would play the bet. The 
counteroffer tables were constructed on the basis of 
previous bids for similar bets, with a range chosen 
to include most of the anticipated bids, The values 
of the counteroffers can influence the expected value 
of the game as a whole, but they do not affect the 
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TABLE 1 


A Sunset or THE Bet Pairs USED IN 
THE EXPERIMENT 


Pair P bet $ bet 
1 30/36 win 250 9/36 win 980 
6/36 lose 230 27/36 lose 100 
2 33/36 win 210 9/36 win 1000 
3/36 lose 30 27/36 lose 80 
3 33/36 win 40 9/36 win 240 
3/36 lose 80 27/36 lose 40 
4 27/36 win 30 3/36 win 900 
9/36 lose 10 33/36 lose 60 
5 27/36 win 200 3/36 win 800 
9/36 lose 370 33/36 lose 10 
6 27/36 win 60 15/36 win 460 
9/36 lose 20 21/36 lose 260 


optimal response strategy, a fact that was pointed i 


_ out to the subjects. 


Further discussion of the technique emphasized two 
points: (a) The strategy that the subject should 
follow in order to maximize his gain from the game 
was always to name as his price exactly what he 
thought the bet was worth to him. (b) A good test ~ 
of whether the subject’s price was right for him was 
to ask himself whether he would rather play the bet 
or get that price by selling it. The price was right 
when the subject was indifferent between these two 
events, 

The procedures described above were designed to 
make it unwise for the subject to state a selling price 
that was higher or lower than the bet’s subjective 
worth. This fact was explained to him as follows: 


For example, if you think a bet is worth 100 
points, then you shouldn’t care whether you play 
the bet or get 100 points for sure. Suppose you 
asked 100 points for this bet, your best estimate 
of its true worth to you. Now, if the experimenter’s 
offer is 100 points or more, he will pay you his 
offer. If less, you must keep the bet and play it. By 
naming a selling price of exactly 100 points, you 
will thus either play the bet (worth 100 points) or 
be paid 100 points or more, But suppose you were 
greedy and bid a selling price of 120 points, Now, 
if the experimenter’s offer is 120 points or more, 
you will be paid his offer—but that would be the 
same as if you had only bid 100 points. Also, if — 
his offer is 100 points or less, you will still keep 
and play the bet. That is the same as before, too. 
But if the offer is between 100 and 119 points, 
you will have to keep your bet (worth only 100 
points), whereas you could have been paid off at 
the offered price had you bid the true 100-point 
value of the bet. 

Naming a price lower than your best estimate 
of the true value of the bet will also work against 
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you. Here’s how: Suppose you only asked 80 
points for this bet. Now, if the experimenter offers 
100 points or above, you will get his offered price 
—just as if you had bid 100 points, so that’s the 
same. Similarly, if he offers less than 80 points, 
you keep the bet; that’s the same, too. But if he 
offers between 80 and 99 points, then you must 
sell your 100-point bet for his offer, and thus get 
less for it than you think it’s worth. Therefore, 
your best strategy, and the one we want you to 
use, is to name a selling price exactly equal to 
your best estimate of what each bet is worth for 
you. Remember, as before, some of these bets will 
really be offered and then either sold or played. 
Your earnings will depend on the outcome, so 
name your selling price wisely ! 
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Playing the bets was deferred until the subject had 
made all his choices and set all of his selling prices. 
Points earned were converted to money such that 
the minimum win for the 2-hour experiment was 80¢ 
(even for subjects who had a net loss of points), 
while the maximum win was $8. The concept of 
converting points to money, and the minimum and 
maximum win, was explained to the subjects at the 
beginning of the experiment, However, the actual 
conversion curve was not revealed until the experi- 
ment was over, 


RESULTS 


Two indexes of preference for the $ bet 
over the P bet were computed for each sub- 
ject. The first index was the mean strength of 
preference rating, across the 38 pairs of bets. 
Preferences for the $ bet were coded as posi- 
tive numbers ranging from 1 to 4, and pref- 
erences for the P bet were coded as negative 
numbers. Therefore, a person with consis- 
tently strong preferences for the P bet would 


TABLE 2 
RELIABILITIES AND INTERTASK CORRELATIONS 
P Selling- 
Choice i I 4 
Order Sex n atthe ae, prr 
A Male 22 91 97 64 
A Female 24 86 99 30 
B Male 22 89 97 55 
B Female 24 92 96 a) 
A Male & female | 46 90 98 -40 
B Male & female | 46 90 .96 Aei 
A&B Male 44 90 97 59 
A&B Female 48 89 98 40 
A & B | Male & female | 92 90 97 46 


Note.—Order A was choice first and selling prices second. 
Order B was the opposite. 
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have a high negative score on this index. The 
second index was the mean selling price for 
the 38 $ bets minus the mean selling price for 
the 38 P bets. 

To assess the reliability of each of these 
measures of preference, the same two indexes 
were computed separately for the bets in the 
19 odd-numbered pairs and the 19 even-num- 
bered pairs. The choice index based on odd- 
numbered pairs was then correlated, across 
subjects, with the choice index based on even- 
numbered pairs. Similarly, the preference in- 
dex derived from selling prices for bets in 


odd-numbered pairs was correlated with thet 


comparable index based on prices for bets in 
even-numbered pairs. These split-half relia- 
bilities were then adjusted via the Spearman- 
Brown formula to estimate the reliability of 
the full-length measures. 

The results of these analyses are shown in 
Table 2. They indicate that both risk-taking 
indexes were highly reliable. It is also evident 
that some moderate degree of generality was 
present, although the magnitudes of the inter- 
task correlations were considerably less than 
the reliabilities of the individual measures 
would permit. 

Female subjects did not show greater inter- 
task generality than males, contrary to the 
findings of Wallach and Kogan (1964). Why 
generality should be relatively low among fe- 


‘males in the Order A condition is not clear, 


and the result may be simply a chance fluctu- 
ation. 

The difference between the high within-task 
reliabilities and the moderate cross-task cor- 
relations can be viewed as evidence for the 
perturbation of individual consistencies by 
the information-processing factors described 
earlier. The scatter plot displayed in Figure 1 
shows this in more detail, First, it is apparent 
that the response mode exerted a global in- 
fluence upon preference for the $ bet in a 
direction consistent with previous research; 
about 70% of the scores on the choice index 
were negative, indicating a general preference 
for the P bet, yet $ bets were given higher 
mean selling prices by about 80% of the sub- 
jects, As expected, there were many subjects 
whose choices indicated a preference for P. 
bets but whose selling prices were greater for 
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$ bets. Also as expected, there were few of 
the reverse inconsistencies, where a subject 
chose $ bets, yet stated higher selling prices 
for P bets. What the scatter plot shows, more- 
over, and what is most important here, is 
that, for any given level of preference based 
on the choice index, there was considerable 
variation in the level of the selling-price index. 
Some subjects gave selling prices consistent 
with their choice preferences; others did not. 
The end result was a marked reduction in 
intertask consistency. 


Discussion 


In this experiment, an effort was made to 
create two structurally parallel measures of 
risk-taking propensity, in order to produce a 
high degree of intertask consistency. The 
choice and selling-price indexes both assessed 
preference for long-shot gambles over gambles 
with a very high probability of success. The 
same gambles were used in both conditions, 
insuring that subjective probabilities and utili- 
ties would be equivalent across the two tasks. 
The subjects were carefully instructed and 
motivated, and reliable individual differences 
were found in each task. The correlations be- 
tween the two risk-taking indexes were high 
enough to provide definite evidence for inter- 
task generality. However, in light of the ef- 
forts that were made in order to increase gen- 
erality, the modest size of these correlations 
must be viewed more as support for the im- 
portance of situational factors than support 
for the notion of risk-taking propensity as a 
stable trait. The fact that a simple change in 
response mode can create so much inconsis- 
tency among individuals’ relative standings in 
the group implies that high correlations be- 
tween risk-taking measures in structurally 
different settings or between risk taking and 
other behaviors are unlikely to be found. 

The situational specificity within the do- 
main of risk taking, and the disruptive influ- 
ence of response mode shown in the present 
study, parallel the findings from other behav- 
ioral realms (e.g., see Acker & McReynolds, 
1967; Sermat, 1970). An extensive review of 
research on the stability of individual traits is 
provided by Mischel (1968), who surveyed 
the literature dealing with cognitive, attitudi- 
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Fic. 1. Relationship between choice and selling- 
price indexes across the total sample of subjects 
(r= 46). 


nal, and personality characteristics. The lat- 
ter included attitudes toward authority and 
peers, moral behavior, despondency, aggres- 
sion, tolerance for ambiguity, and rigidity. 
Mischel summarized his findings as follows: 


First, behavior depends on stimulus situations and 
is specific to the situation: response patterns even 
in highly similar situations often fail to be strongly 
related. Individuals show far less cross-situational 
consistency in their behavior than has been assumed 
by trait-state theories. The more dissimilar the evok- 
ing situations, the less likely they are to lead to . . 
consistent responses from the same individual. Even 
seemingly trivial situational differences may reduce 
correlations to zero. Response consistency tends to 
be greatest within the same response mode. . . . In- 
traindividual consistency is reduced drastically when 
dissimilar response modes are employed [p. 177]. 


Mischel and others have argued that a 
viable approach to personality must bridge 
the gap between the principles of experi- 
mental psychology and the problems that 
confront the personality psychologist. The 
present study can be viewed as a step in this 
direction. 
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One of the most popular and successful textbooks in its field, Maddj’s Per- 
sonality Theories: A Comparative Analysis has now been updated and revised 
to make it an even greater contribution than before. 


The Revised Edition of this outstanding volume includes many changes and 
improvements. Three additional theories are presented: 1) Jung, 2) existential 
psychology, and 3) behaviorism. Each receives extensive discussion and critical 
analysis. Several of the other theories examined in the previous edition are 
more fully discussed tp take into account recent developments. The research 
sections of the book have also been brought up to date. 


As before, Maddi’s Personality Theories provides a comparative analytic 
approach in which many theories are compared, contrasted, and classified 
under major models for understanding human life. The issues that emerge 
from this classification can well be considered the defining characteristics of 
the personality field, and provide clear direction for present and future research 
efforts toward their resolution. Thus, the student gains ideas as to why and 
how to theorize and conduct research in the personality field; at the same 


time he learns a great deal about existing theory and research. 


This book is more than simply a source of facts and information. It includes 
a frame of reference for the personality field, a field that the author believes 
has been notoriously ambiguous and amorphous despite its inherent promise. 
There is considerable discussion of basic problems, and characteristic strengths 
and weaknesses of both particular theories and theoretical models. Written 
in the first person, this unique book should stimulate critical thought con- 
cerning the process of theorizing and the planning of Personality research. 


Write for examination copies for college adoption. Please indicate 
course title and text presently used. - 
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ATTRIBUTION OF ABILITY TO OTHERS ON SKILL AND 
CHANCE TASKS AS A FUNCTION OF LEVEL OF RISK* 


JERALD M. JELLISON ? 


Yo 


University of Southern California 


JOHN RISKIND 
Yale University 


LORRAINE BROLL 


University of Wisconsin 


An experiment was conducted to test the hypothesis that persons who take high 
risks would be perceived as higher in ability than a person who takes low risks 
on a task involving skill but that there would be no difference in perceived 
ability on chance tasks as a function of level of risk. Subjects learned that 
another had either made high or low bets on his performance on a task. For 
half of the subjects, the task was portrayed as one in which ability could 
determine success; for the other subjects, chance was the factor that could 
determine success on the task. The ability of the other was then rated. The 
results support the prediction and are discussed in terms of the social com- 
parison of abilities interpretation of risk taking. 


In an attempt to explain the risky shift 
phenomenon, Jellison and Riskind (1970) 
predicted and found that people attribute 
greater ability to another the greater the level 
of risk he selects. It was reasoned that people 
assume that another chooses a level of risk 
that is comparable to his abilities, and that 
greater ability is required for success at high- 
risk tasks than at low-risk tasks. Therefore, 
the choice of high risk is taken as an indica- 
tion of high ability. 

Under some conditions it might be unlikely 
that taking differential risk at a task would 
be taken as an indication of differential abil- 
ity. Once the difficulty of a task is far in ex- 
cess of anyone’s ability, then taking an even 
higher risk would probably not be perceived 
as an indication of greater ability. For exam- 
ple, on a chance task, the choice of different 
levels of risk would probably not be taken as 
an indication of differential ability, since even 
a low risk on a chance task is beyond a normal 
person’s ability. The present experiment was 
conducted to test the hypothesis that on a 
skill task the higher the level of risk another 
selects, the greater will be his perceived 


1 The research reported here was conducted while 
the third author was supported by a National Sci- 
ence Foundation undergraduate research fellowship. 

2 Requests for reprints should be sent to Jerald 
M. Jellison, Department of Psychology, University 
of Southern California, University Park, Los Angeles, 
California 90007. 


ability, but, on a chance task, attributed abil- 
ity will be the same regardless of the level 
of risk chosen. 


METHOD 
Overview 


College students formed impressions of a bettor on 
the basis of his betting preferences on an angle- 
matching task. To half of the students it was ex- 
plained that the task was only moderately difficult 
and that skill was the determining factor of success. 
The remaining students were told the task was ex- 
ceedingly difficult and that chance was the factor 
determining success. Both in the chance and skill 
conditions, betting sheets were designed to create two 
levels of the bettor’s style of performance: high risk 
or low risk. Subjects learned the difficulty of the 
task by matching the same practice angles on which 
the bettor had supposedly practiced. After learning 
of the task difficulty and reading the bettor’s risk 
preferences, the subjects rated the personal char- 
acteristics of the bettor. 


Subjects 


The subjects were 52 male and 32 female students 
in introductory psychology. Subjects were run in 
mixed-sex groups ranging in size from 2 to 8. Each 
group was randomly assigned to the chance or skill 
condition, and within each group, subjects were 
randomly assigned to either the high or low bet 
condition. 


Procedure 


After the subjects for a session had assembled, the 
experimenter explained that the study concerned the 
process of forming first impressions, Subjects were to 
form an impression of another person on the basis of 
the person’s betting behavior on an angle-matching 
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task. The subjects were told that all of the bettors 
were male and that they had responded to an ad 
for paid subjects placed by a local research institute. 
Subjects were told that the bettors had known that 
the experiment would last for 2 hours and that half 
of the applicants to the betting study had left on 
learning this, It was reasoned that subjects would 
not believe that bettors motivated by curiosity alone 
would devote 2 hours to the betting experiment, and 
that those who participated valued the monetary 
payment. The experimenter explained that the par- 
ticipants in the betting study had been given $5 
with which to bet, and their payment was the por- 
tion of this remaining after betting plus or minus 
their winnings. The experimenter explained that the 
bettors had been given the option of participating 
in another experiment which guaranteed a $5 pay- 
ment. The information was included so that subjects 
would not believe that the bettors were too highly 
motivated by a need for money. Tf bettors had been 
too highly motivated for money, then the low risk 
taker might have been seen as wise because he did 
not risk losing much of his payment. Thus, the ex- 
perimenter stressed that the bettors had been moti- 
vated by both curiosity and money. 

The angles were drawn on 12.7 X 12.7 centimeter 
index cards and were arranged so that there were 10 
angles from which the subjects were to choose 1 
angle that matched a standard angle. For the skill 
condition, the 10 angles on each trial were recog- 
nizably different from the standard, and only 1 angle 
matched the standard angle. For the chance condi- 
tion, the 10 angles were virtually the same, and it 
was apparent that it was almost impossible for a 
person to consistently determine which angle matched 
the standard angle. 

Subjects became acquainted with the information 
that the task was a chance task or a skill task by 
taking the same pretest supposedly taken by the 
bettors. Subjects were told to read the original pre- 
test instructions and were asked to attempt to match 
the same three practice angles supposedly tried by 
the bettors. These angles were either moderately 
difficult or exceedingly difficult for the average per- 
son. The instructions included a figure indicating 
the percentage of the first 100 subjects run by the 
institute who had matched the angles correctly. For 
the skill condition the three percentages centered 
around 50%, and for the chance condition the three 
percentages centered around 10% or a priori chance 
probability. 

After attempting the three trials, each subject was 
given a sheet that supposedly belonged to one of 
the bettors. The sheet contained the instructions 
that the bettor was to indicate how much he wanted 
to bet on each of the 10 angles before the 10 trials 
began. For each trial he could bet O¢, 1¢, 5¢, 10¢ 
15¢, 20¢, or 25¢. The odds were always 2:1. Each 
of the 10 trials were numbered, and beside each 
number was a blank space for indicating his bet 
and a figure indicating the percentage of the first 
100 people who had matched the angle correctly. In 
the skill condition these figures were 60%, 45%, 


38%, 51%, 69%; 47%, 55%, 39%, 53%, andi 
In the chance condition the figures were 15% 
9%, 12%, 1196, 6%; 10%, 13%, 8%, and 16%. 

Level of risk was manipulated by the amo mi í 
money bet. In the PY þet condition, the 
wrote in the following Pets in cents: 25, 15, X 
25, 20, 25, 25, 15, and 25. In the low bet condi 
the following bets, written in cents were: 5, l, 

5, 0, 1, 5, 1, and 1. 

After the subjects had studied the sheet containin 
the bettor’s bets, the experimenter distributed a 
adjective rating form on which they were to in 
cate their impression of the bettor. The rating f 
contained the following adjectives: capable, cau- 
tious, clever, competent, creative, dumb, ingenious, 
innovative, insightful, unintelligent, and wise. Sub- 
jects were to indicate how well each adjective ap- 
plied to the bettor by circling a number on the s 
below each adjective which ranged from 0 (extremely 
inappropriate) to 20 (extremely appropriate). 

When the subjects completed the rating form, they 
were asked to write a few sentences describing 
their reactions to the study and any ideas they kad 
about the study. None of the comments indicated 
suspicion. 


RESULTS 


A check on the manipulation of level oi 
risk was provided by the ratings on the adjec- 
tive cautious, Each subject’s score was the 
same as the scale value circled. The means 
for cautious for the four experimental condi- 
tions were: high bet-skill (X = 6.00) ; i 
bet-chance (X = 4.81); low bet-skill (A = 
16.48); and low bet-chance (X = 17.24) ona 
scale from 0 (extremely inappropriate) to 20 
(extremely appropriate). An analysis of v 
ance indicated that only the main effec 
the amount of the bet was significant (/ 
127.81, df = 1/80, p < .001). The manipula- 
tion of level of risk seems to have jeen quite 
successful. > 

Attributed ability, the dependent variable 
was measured by the average of the mear 
the ratings for the adjectives capable, clever, 
competent, creative, lumb, ingenious, innò- 
vative, insightful, unintelligent, and wise. 
Scores for the favorable adjectives were the 
same as the subject’s rating; scores for the 
unfavorable adjectives were obtained by sub- 
tracting the subject’s rating from 20. 

The means of the ability scores for the ex- 
perimental conditions are presented in Table 
1. It was predicted that taking differential 
risk, the amount that was bet, would affect 
attributed ability on a skill task but not on a 


ATTRIBUTION OF ABILITY AS A FUNCTION OF LEVEL oF RISK 


TABLE 1 
Means oF ABILITY RATINGS 


Amount of bet 


Type of task 


High Low 
Skill 12.0 7.3 
Chance 12.1 10.9 


Note.—Each cell mean is based on ratings for 21 subjects. 
The larger the number, the greater the attributed ability. 


chance task. An analysis of variance indi- 
cated that the predicted Type of Task Xx 
Amount of Bet interaction was significant (F 
= 5.48, df = 1/80, p < .05). ¢ tests indicated 
that the other was perceived as higher in abil- 
ity in the skill-high-bet condition than in the 
skill-low-bet condition (t = 4.72, df = 40, p 
< .001). However, in the chance conditions, 
there was no significant difference in the rat- 
ings of the other as a function of the amount 
of the bet (t = 1.24, df = 40, ms). When the 
other took high risks, there was no difference 
in attributed ability as a function of the type 
of task. However, when the other’s level of 
risk was low, greater ability was attributed to 
the other on the chance task than in the skill 
task (£ = 3.45, df = 40, p < .01). 

Examination of Table 1 shows that the 
other was rated higher in ability in the high 
bet conditions than in the low bet conditions. 
An analysis of variance indicated that the 
main effect for the amount of money bet was 
significant (F = 16.03, df = 1/80, p < .01). 
Somewhat surprisingly, the other was rated 
significantly higher in ability (F = 6.38, df = 
1/80, p < .05) in the chance conditions than 
in the skill conditions. 


Discussion 


This experiment was designed to test the 
hypothesis that individuals choosing high risk 
would be perceived as higher in ability than 
individuals choosing low risk in a skill task, 
but in a chance task, individuals choosing 
high risk would not be perceived as any higher 
in ability than individuals choosing low risk. 
The above prediction, which is in line with 
the social comparison of abilities interpreta- 
tion of risk taking behavior, was clearly sup- 
ported. These results can be interpreted in 
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terms of the relevance of differential ability 
to performance in skill tasks but not in chance 
tasks. In a skill task, if people have no other 
reason to doubt the rationality of an indi- 
vidual’s choice of high risk, people will at- 
tribute the individual’s choice of high risk to 
the individual’s perception that he is high in 
ability. In a chance task, people would doubt 
the rationality of any individual who chooses 
high risks. Outcomes in a chance task are be- 
yond the individual’s control, and an indi- 
vidual who chooses high risks would not be 
perceived as any higher in ability than an 
individual who chooses low risks on a chance 
task. 

The finding that, in general, the stimulus 
person was perceived as higher in ability when 
he bet a large amount than when he bet a 
small amount can be considered as a con- 
ceptual replication of the previous finding that 
the level of risk selected is directly related to 
perceived ability. 

The finding that, in general, the other was 
rated higher in ability in the chance conditions 
than in the skill conditions was not predicted 
and was somewhat unexpected. Upon reflec- 
tion, this finding may not be surprising. While 
taking differential risk on a chance task is not 
perceived as an indication of differences in 
ability, the fact that the individual is willing 
to choose to at least try on the chance task 
may be interpreted as an indication of high 
ability. That is, a chance task represents a 
high-risk situation because the difficulty of 
the task exceeds the average individual’s abil- 
ity. In such a high-risk situation, a person of 
only moderate or low ability would probably 
be expected to choose not to take any action 
that would involve costs. In the present ex- 
perimental situation, the stimulus person was 
assigned to the chance task, but, in a sense, he 
did choose to take significant action. He 
could have avoided taking any significant 
action by choosing to bet nothing, 0¢, on each 
trial. The fact that even the low-risk person 
was almost always willing to choose to bet 
some money on the chance task may have 
been interpreted as an indication of ability. 
Thus, it seems that choosing to attempt any 
risk on a chance task, will be perceived 
as a indication of relatively high ability, but 
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the magnitude of the risk taken, in terms of 
the costs associated with failure, will not sig- 
nificantly alter the amount of attributed abil- 
ity. 

The present study was designed so that 
when the other bet a large amount, he would 
be perceived as taking a high risk since he 
was almost always betting the maximum al- 
lowed. In the context of the present situation, 
betting the maximum of 25¢ is actually not a 
very high risk. It is quite possible that if the 
amount bet were substantially increased, then 
the ratings of ability would increase in the 
skill task condition but would remain the 
same in the chance task condition. Also, the 
nature of the task, regardless of whether it 
involves chance or skill, probably contains 
inherent limits on the amount of ability that 
will be inferred. In this regard, it may be 
that on some tasks, persons take high risks in 
order to be perceived as high in ability and 
on other tasks in order to avoid the attribu- 
tion of low ability. 

It has been suggested (Dion, Baron, & Mil- 
ler, 1970) that high risk is perceived as an 
indication of positive qualities in general, and 
ability is just one of these qualities. A thor- 
ough reading of the first experiment by Jelli- 
son and Riskind (1970) showing the direct 
relation between risk and perceived ability 
makes such an interpretation seem very un- 
likely. In that experiment, risk was varied 
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over a wide range, and there was no difference 
in the ratings of the characteristic honest, 
which, according to Anderson (1968), is 
about the most positive characteristic an in- 
dividual can possess. Further, there was no 
difference in ¢he ratings for likable, which 
would seem to bê an excellent indicator of 
positive characteristics, Wallach, Kogan, and 
Burt (1968) also failed to find any evidence 
that high risk takers are liked more than low 
risk takers, The present findings and those 
previously cited make the assumption of the 
universal value of risk in Brown’s (1965) 
value hypothesis seem questionable. 
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DEMAND CHARACTERISTICS OF THE MODELING 
EXPERIMENT: 


; ALTRUISM AS A F UNCTION OF AGE AND AGGRESSION? 


JOAN E. GRUSEC 2 


University of Toronto 


Explanations of the phenomena of imita- have Proposed that observers imitate because 
tion and identification have been abundant. they have Previously been reinforced for 
Freud (1925) Proposed that the young boy matching their behavior to that of a model. 
identifies with his father in order to reduce Many of the innumerable laboratory demon- 
anxiety over anticipated punishment for his _ strations during the last decade that indi- 
incestuous wishes toward his mother and, at viduals will, in fact, fashion their behavior 
the same time, to vicariously gain her affec- after that of someone they have observed have 
tion. Mowrer (1950) and Sears (1957) sug- been explained by these theories, or variants 
gested that children imitate their caretakers thereof. Lately, however, it has been sug- 
because this matching of behavior has in- gested that subjects in many studies of imita- 
trinsic reinforcement value. Maccoby (1959) tion may be imitating observed behaviors not 
hypothesized that children attend more closely for any of these relatively complex reasons, 
to the behavior of powerful models in order but simply in response to the demand charac- 
to maximize their own rewards, thus learning teristics of the experimental situation. Thus, 
à more about that behavior and hence repro- for the child placed in an unusual situation, 
ducing more of it in an appropriate stimulus the model’s behavior may provide information 
situation. Lately, Gewirtz and Stingle (1968) about the appropriateness of a given act 

+ This investigation was supported by Grant 116 (Aronfreed, A 1968; Flanders, 1968; Kr ebs, 
from the Ontario Mental Health Feundation. The 1970). This information need not be mediated 
author wishes to thank officials of the North York by actual performance of the behavior, but 
Parks and Recreation Department, their playground presumably could just as easily be provided 
supervisors, and the principal and teachers of Trans- by nonsocial cues or verbal instruction. Thus, 
figuration School, Toronto, and of the Institute of a subject in a modeling study need not an 


Child Study at the University of Toronto for their : 
generous cooperation. Appreciation is due to Valerie ally observe Someone perform the behavior 


Hunt, who acted as the experimenter, and to Nancy of interest; a statement about what ought to 
Satterberg, Karen Seabrook, ae ore and þe done in the situation should be sufficient. 
Norman Greenberg who acted as models, A test of this hypothesis was undertaken 
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E. Grusec, Department of Psychology, University by Grusec and Skubiski (1970). They ex- : 
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of Toronto, Toronto 181, Canada. posed 8- and 10-year-old subjects to an adult 
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who either performed the behavior of inter- 
est—in this case giving half of his winnings 
from a game to poor children—or who merely 
verbalize what he thought the appropriate 
behavior was—to donate half of what he won 
to charity. An attempt was made to keep the 
amount of information conveyed to the sub- 
jects in each condition constant. In the per- 
formance condition the model shared five 
times, while in the verbalization condition, he 
stated, in five various ways, that one ought 
to share. Grusec and Skubiski found that the 
subjects in the performance condition shared, 
while those in the verbalization condition, 
with the exception of girls to whom the model 
had previously been warm and friendly, 
shared virtually not at all. Similarly, Bryan 
and Walbek (1970), using 8- to 10-year- 
olds, found that moral exhortations about the 
general desirability of sharing were ineffective 
in producing that behavior. It seems evident, 
then, that at least with regard to the behav- 
ior of sharing by children aged 8-10, and 
with the exception of nurtured girls, actual 
demonstration of that behavior by a model, 
rather than just verbal references to it, is 
necessary for it to be displayed. The reason 
for this is not obvious. Performance may pro- 
duce sharing, while verbalization or moral 
exhortation do not since only in the former 
situation do subjects see that people who give 
up rewards do not necessarily feel unhappy 
and uncomfortable afterwards. Perhaps per- 
formance leads to disinhibition of the tend- 
ency not to engage in novel behaviors in a 
strange situation, while verbalization still ne- 
cessitates too much initiative on an observer’s 
part in translating words into actions. Again, 
subjects who observe a model share would 
have to label themselves as selfish by com- 
parison with a generous model if they did 
not also share. In a verbalization condition 
the model would not have established himself 
as generous, and so nonsharers would suffer 
much less in comparison with him. These 
are probably only a few of the possible 
explanations. 

In order to gain some idea of why per- 
formance and verbalization were not equally 
effective, and thereby just what function a 
model does serve in imitation, the present 
studies were devised to, hopefully, illustrate 
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conditions under which performance and ver- 
balization would be equally effective. The first 
study compares the effects of performance and 
verbalization in a sharing situation in which 
subjects of different ages were employed. 
Numerous studies have shown that sharing 
increases with age (Handlon & Gross, 1959; 
Ugurel-Semin, 1952). If a belief about the 
virtue of sharing, or a “norm of social respon- 
sibility” (Berkowitz & Daniels, 1963) has 
been well-learned, we might expect that simple 


statements about appropriate behavior should . 


suffice to remind the subjects of that norm 
and induce them to share. On the other hand, 
for subjects for whom the norm is not well- 
established, verbalization, for the kinds of 
reasons offered above, might not be sufficient 
to produce sharing. Thus, young subjects 
might not realize that self-sacrifice is virtuous 
and approved of, and that one is supposed to 
feel good after he has engaged in it, even if 
he is not discovered in the act. 

In an initial experiment, then, to compare 
the relative effectiveness of performance and 
verbalization with younger and older children, 
7- and 11-year-old boys and girls? saw a 
model either share some of his winnings from 
a game with poor children or merely verbalize 
his expectations that it was appropriate to 
share in that situation. Then, the subjects 
played the game alone, and the amount of 
sharing they did was observed through}a,one- 
way mirror. Subjects were also asked to recall 
as much as they could of the model’s behavior 
in the two conditions in order to see if the 
model’s behavior might be better remembered 
in the performance than the verbalization 
condition and thus if differences in acquisition 
might account for any observed differences in 
sharing. In addition to the performance and 


verbalization groups, a control group of 


11-year-olds was run to see how much sharing 
they would do in the absence of a model. 
Previous data from the Grusec and Skubiski 
(1970) study indicated that younger children 
would not share at all under these conditions. 


3 Seven-year-olds were chosen because they were 
the youngest subjects who could reliably understand 
the rules of the game employed by Grusec and 
Skubiski (1970), and 11-year-olds were chosen be- 
cause they were the oldest subjects available—and 
older than Grusec and Skubiski’s subjects, 


DEMAND CHARACTERISTICS OF THE MODELING EXPERIMENT 


It seemed possible, however, that an experi- 
menter’s permissive reference to sharing might 
elicit that behavior in older subjects. 

Sharing is a behavior whose performance 
has no attraction in and of itself. Giving up 
to others material rewards that one might 
have kept for oneself necessitates self- 
denial, especially when no external reinforce- 
ment for this is offered. Not only does altru- 
istic behavior appear unnatural in the sense 
‘that it runs counter to learning theory prin- 
ciples, but it also is dissonant with psycho- 
analytic theory, which maintains that chil- 
dren are basically selfish and id driven. It is 
not too surprising, then, that something more 
than a verbal indication of its appropriate- 
ness would be necessary to induce some 
subjects, at least, to share. 

But what of a behavior whose performance 
is not aversive, and may even be attractive? 
If subjects actually enjoy doing something, 
then verbalization of its appropriateness 
might be sufficient to ensure its occurrence. 
This hypothesis was tested in a second study, 
with aggression chosen as the relevant behav- 
ior. Certainly aggression involves no self- 
denial, and something about its production— 
perhaps the strong proprioceptive feedback it 
provides—may be pleasurable. Psychoanalytic 
theory suggests that aggression is one of man’s 
basic drives, and Lorenz (1963) talks of 
man’s instinctive aggressiveness. Even re- 
search inspired by social learning theory may 
be interpreted to suggest that aggressiveness 
has some intrinsic attractiveness. Bandura and 
Huston (1961) found that aggressive re- 
sponses were readily imitated by young chil- 
dren regardless of a model’s social character- 
istics, while the reproduction of neutral be- 
haviors depended more greatly upon those 
characteristics. 

In an experiment to test this second hy- 
pothesis that performance and verbalization 
would be equally effective in producing ag- 
gression, children either observed a model who 
engaged in a series of aggressive behaviors or 
one who stated that it was appropriate to 
perform these same behaviors. A third group 
of children did not see or hear a model, but 
were placed in a situation with the same 
physical materials as the first two groups in 
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order to control for the amount of aggression 
these materials themselyes might elicit. 


EXPERIMENT I 
Method 


Subjects 


The subjects were 20 boys and 20 girls aged 7 
years, and 30 boys and 30 girls aged 11 years. 
Twelve boys and 12 girls from the 7-year-old group 
and 18 boys and 18 girls from the 11-year-old group 
were attending summer playgrounds run by the 
North York Recreation Department, while the rest 
of the subjects came from a Metropolitan Toronto 
separate school and participated in the experiment 
approximately 3 months later. 


Design 


A 2X2 X2 factorial design was employed, with 
the model’s method of presentation—performance or 
verbalization—of sharing behavior, age of subjects, 
and sex of subjects being the factors of interest. 
Forty 7-year-old and forty 11-year-old boys and 
girls were randomly assigned to either a performance 
or verbalization group, and twenty 11-year-olds were 
assigned to a control group which did not observe a 
model. The same proportion of boys and girls from 
the playground and from the separate school were in 
each group. A male and female adult served as 
models, with each child seeing a same-sexed model. 
The same female experimenter was used with all 
subjects, with a different set of models being 
employed during the summer and the fall. 


Procedure 


Performance and verbalization manipulations. Chil- 
dren were brought individually by the experimenter 
to a mobile laboratory parked in the school yard, 
supposedly to test a bowling game designed for use 
by both children and adults. As they entered the 
trailer, the experimenter introduced the child to an 
adult who, she said, was also going to play the 
game. The game apparatus was a miniature bowling 
alley, bounded at the far end by a vertical shield. 
Eight jewel lights, labeled with numbers ranging 
from 10 to 80, were mounted in three staggered rows 
on the front of the shield. The model and subjects 
were told that the game involved skill and that 
when a ball was rolled into one of the holes (which 
they could not see) behind the shield, the correspond- 
ing light would come on. In fact, scores were preset 
from a control panel located in the adjoining ob- 
servation room. Thus, the model and child could 
receive predetermined scores in all conditions. The 
players were told that when they bowled winning 
scores of 70 or 80, they would receive two marbles 
from a dispenser attached to the game, These marbles 
could be traded for prizes, and the more marbles 
they won, the better the prize they would get. 
Players were told to knock on the door at the end 
of their game so that the experimenter could then 
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give them a prize in exchange for their marbles. The 
model was instructed to play first. 

Just as she was leaving, the experimenter informed 
the players, almost as an afterthought, that marbles 
were being collected for toys for poor children. She 
indicated a red bowl which held a large number of 
marbles on a table across the room from the game 
and behind which was aspicture of a child clothed 
in rags. The experimenter stated that, if they wanted, 
players could give some of the marbles they won 
to the poor children, but that they did not have to 
if they did not want to. 

1. Performance: As soon as the experimenter left, 
the model played his game. He bowled 20 trials, 
winning on 5 of these trials and donating one of 
every two marbles he won to the poor children, On 
winning trials the model would gather his two 
marbles from the dispenser, cross jn front of the 
subject to the poor children’s bowl, deposit a marble 
in it, and say, “One for the poor children.” The other 
marble he put in a blue bowl provided for players 
beside the game, saying, “And one for me.” When he 
had finished playing, the model knocked on the door 
and was taken by the experimenter to another room 
to get his prize. 

2. Verbalization: When the experimenter left, the 
model stood before the game, musing slowly to him- 
self and taking care not to look at the child, He said: 


Well, I guess they expect us to share our marbles 
with those poor children. Probably that’s what 
one had better do. I guess if I gave one out of 
every two marbles I win to the poor children, that 
would be fair. So that means whenever I get a 
score of 70 or 80, I would get two marbles. Pd 
put one in the red dish for the poor children 
[the model looked at the bowl] and one in the 
blue dish for me. Jf I got any other score, I 
wouldn’t win anything, SO I really ought to hope 
I get a lot of 70s and 80s. Then there would be a 
chance to share some of the marbles with the 


poor children. 


As in the Grusec and Skubiski (1970) study, an 
attempt was made in this speech to equate the 
amount of information that the subjects received 
about sharing with the information they received in 
the performance condition. In the performance con- 
dition the subjects saw the model share five times, 
while in the verbalization condition that number of 
references were made to sharing. 

Immediately after this speech, before the model 
had a chance to begin his game, the experimenter 
returned to explain that the model’s boss had called 
on the phone and would like to talk to him. The 
model left the room, followed by the experimenter 
who had instructed the child to go ahead and play. 

Measurement of internalization and acquisition. 
The subject was left alone to bowl 40 trials, during 
which he won 20 marbles. The number of marbles 
he donated to the poor children’s bowl was observed 
through a one-way mirror by the experimenter and 
recorded. A check on the number of marbles donated 


was also made at the end of the experiment, after 


the subject had left, by counting the number of 4 
marbles in the poor children’s bowl and subtracting E 
from this the number in it before the child began 
to play. In every case, there was complete agree- 
ment between these two numbers. When the child 
had finished bowling, the experimenter returned and 
announced that the model had had to g0 back to 
work and so would not be seeing the child again. 
He also stated that it would be possible for the “X 
subject to win more marbles, and hence an even 
better prize, if he could remember everything the 
model had done when he bowled (performance con- 
dition) or had said just before he was going to bowl 
(verbalization condition). The subject was then given 
a marble for every item of behavior correctly re- p) 
called, Finally, the child was given a prize in €x- 
change for his marbles and was asked not to discuss 
the events in the trailer with any other children. 
Prizes were not distributed until all children at a 
given playground or from a given class in the school 
had been to the trailer. 

Control groups. Children in the control groups 
were treated identically to those in the experimental 
groups except that they were not exposed to a 
model. They were brought to the trailer, were in- 
structed about the game and the poor children’s 
bow], and were then left alone to bowl 40 trials. 
Only 11-year-olds were run in a control condition. 


Results 


The mean number of marbles given to the ` 
poor children in each performance and ver- 
þalization condition is presented in Table 1. 
Analysis of variance of these data yielded a 
significant main effect of method of presenta- 
Gon (F = 538, af = 1/12, 6 < 05) end ae 2 
interaction between method of presentatio D 
age, and sex (F = 4.13, p < .05). All cell 
values were less than 1. Further analysis of © 
the terms involved in the three-way intet- 
action showed that younger boys shared more - 
in the performance than in the verbalization 
condition (t = 2.51, df= 18, 2 < .02, one- 
tailed). None of the other comparisons þe- 
tween performance and verbalization reached 
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TABLE 1 


MEAN NUMBER OF MARBLES SHARED 
tn Eaca CONDITION 


Boys Girls 
Condition 
Tyt. (1 yr. Tyr. tiyr. 
Performance 7.4 6.6 6.0 77 
Verbalization 3.0 64 5.7 4.5 
Control T 1.0 oo 2.2 
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ee statistical significance (t <1 for older boys 
~ and younger girls, and ¢= 1.83 for older 


girls). There were no differences between per- 
forfmance groups (f<1 in every case). 
Younger boys shared less in the verbalization 
condition than did older boys (¢ = 1.94, 
p < .05, one-tailed), while there was no dif- 
ference between younger and older girls 
(é¢ < 1). It appears, then, that observation of 
an altruistic model was necessary to produce 
sharing in younger boys. The amount of 
sharing they did in the verbalization condi- 
tion did not differ significantly from zero 
(t= 1.96, df = 9, p < .10, two-tailed; a two- 
tailed test was used here since no difference 
was predicted). Younger girls and all older 
children were equally affected by modeled 
behavior and verbal description of appropriate 
behavior. 

Finally, the mean number of marbles 
shared by 11-year-olds in the control groups 
is also reported in Table 1. The amount of 
sharing in the control groups did not differ 
significantly from zero (¢ = 1.25 for boys, and 
t= 1.10 for girls), 

The number of children in the performance 
condition who were able to recall that the 
model had shared half of the marbles he had 
won were: younger boys, 7; younger girls, 9; 
older boys, 9; older girls, 10. In the verbaliza- 
tion condition these acquisition scores were: 
younger boys, 6; younger girls, 9; older boys, 
10; older girls, 10. The acquisition scores of 
interest in light of differences in sharing—the 
recall scores of younger boys in the perform- 
ance and verbalization conditions—clearly did 
not differ from each other. Thus, although 
7-year-old boys in the performance condition 
shared more than did 7-year-old boys in the 
verbalization condition, they did not recall 
the model’s sharing behavior any better. 


Discussion 


These data do not completely support the 
hypothesis the study was designed to test. In 
accord with prediction, performance and ver- 
balization were equally effective in producing 
sharing in older children. They were, contrary 
to prediction, also equally effective in pro- 
ducing sharing in 7-year-old girls. Verbaliza- 
tion did not, however, lead to any significant 
amount of sharing in 7-year-old boys. 
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Recall the findings of the Grusec and 
Skubiski (1970) study. Again, girls exposed 
to an adult who verbalized appropriate be- 
havior shared as much as children in a per- 
formance condition, provided that the adult 
was warm and rewarding. Extrapolating 
from the findings of the present study, 
then, it would appear that the model’s non- 


“nurturance in the Grusec and Skubiski ex- 


periment may have served to suppress the 
effectiveness of verbalization rather than 
nurturance facilitating it. 

The results of the present study might be 
explained in terms of 11-year-old boys and 
girls and 7-year-old girls having better inter- 
nalized a norm of social responsibility than 
7-year-old boys. There is, however, no evi- 
dence for sex differences in the speed with 
which the norm of sharing is aquired. Thus, 
the following speculations about the data are 
offered. The first is that young girls are more 
concerned with behaving in ways that would 
be approved of by adults than are young 
boys. Although boys may respond to deeds, 
as they did in the performance condition, 
they may feel less pressure to conform to 
mere words, as appeared to be the case in 
the verbalization condition. Adult females do 
show greater concern with displaying socially 
desirable behavior and are more conforming 
in their behavior (Crutchfield, 1955). Pre- 
sumably, this is motivated in part, at least, 
by the importance of social approval—real or 
inferred—for them. If this differential impor- 
tance of social approval for males and females 
has already developed by the age of 7, then 
we might expect that girls in the verbaliza- 
tion condition who had been told that sharing 
is appropriate behavior might have been more 
highly motivated to do the socially acceptable 
thing than were boys. In the Grusec and 
Skubiski (1970) study, nonnurtured girls 
may not have felt inclined to share when 
reminders came from someone who was cool 
and aloof, and who might therefore be con- 
sidered a less credible source of information 
about the kind of behavior that people gen- 
erally like. It is also possible that for indi- 
viduals who desire social approval, more 
pleasure is derived from conforming to the 
suggestions of someone who is liked, and whose 
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approval thereby has value, than of someone 
who is not particularly liked. This explana- 
tion, of course, does not account for the shar- 
ing of both sexes in the 11-year-old groups. 
Here, one might resort to the original hypothe- 
sis that the norm of social responsibility has 
been fairly well internalized by the age of 11, 
that verbalization of appropriate behavior 
makes that norm more salient, and that such 
additional motivators as a desire for inferred 
social approval are no longer necessary for 
the production of altruistic behavior. 

Another explanation of the present find- 
ings? may lie in the fact that girls seem, 
until about the age of 10, to be more verbally 
fluent than boys (Maccoby, 1966). If this 
means that 7-year-old girls are also more 
capable of comprehending the meaning of 
speeches and translating these speeches into 
action than are 7-year-old boys, this would 
account for the differences in effectiveness of 
verbalization. By the age of 11, boys have 
caught up. to girls in verbal skills, and thus 
no differences in the effectiveness of verbaliza- 
tion should be apparent, as indeed they were 
not, This explanation is incomplete when the 
Grusec and Skubiski (1970) data are con- 
sidered. Staub (1971) recently suggested that 
adult nurturance may decrease tension and 
fear of punishment for initiating novel action. 
Perhaps nonnurtured girls felt ill at ease 
and were therefore reluctant to translate the 
model’s words into action, even though they 
had the cognitive ability to do so. 

One additional finding of this first experi- 
ment should be noted. Even by the age of 11, 
an experimenter’s permissive reference to 
sharing did not produce altruistic behavior 
since older children in the control group did 
not share at all. Apparently, even at this age, 
sharing has to be made more salient at least 
by verbal reminder. 


EXPERIMENT II 
Method 


Subjects 


Subjects were 27 boys and 27 girls, aged $ and 9 
years. Fifteen boys and 15 girls were attending 
playgrounds organized by the North York Recrea- 
tion Department and participated in this second 


4 The author is indebted to Gordon Finley for 
suggesting this explanation. 
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experiment during the same period of time as Ex- 
periment I was being run. The rest of the subjects 
came from the Institute of Child Study at the 
University of Toronto and were run approximately 
24 months later. 


Design 


A 3X2 factorial design was employed. Boys and 
girls either saw a model perform a series of aggres- 
sive responses, state that these were appropriate 
responses to perform, or were not exposed to a 
model, Five subjects in each group were from the 
summer playground, and four were from the Insti- 
tute of Child Study. The same models and experi- 
menter served in the study as had been employed 
in Experiment I, with the same change in models 
between summer and fall. 


Procedure 


Performance, verbalization, and. control group ma- 
nipulations. Children from the playground were 
brought individually by the experimenter to the 
mobile laboratory in the school yard. Those at the 
Institute were run in a research room in the school 
itself. This room was approximately the same size 
as the experimental room in the trailer and was also 
equipped with a one-way mirror, Subjects were told 
that the experimenter was from a toy company and 
wanted to find out about the kinds of toys with 
which children and adults prefer to play. As they 
entered the experimental room, the experimenter 
introduced the child to an adult who was also going 
to take part in the study. Located in the room were 
the following items: a panda bear, a mallet, five 
balloons, two safety pins, a book, a small doll, a bow 
and several arrows, a plastic boat, a plastic bottle, 
and a rope. In addition to these items, which were 
used as props in the display of aggression, a coloring 
book, crayons, and a puzzle were included so as 
not to make the experimental situation entirely 
demanding of aggression. After the introductions, 
the experimenter excused herself, saying she had 
a few things to do in the other room, and instructed 
the model to play first since he had been waiting 
the longest. The subject was told to play when the 
model had finished. 

1. Performance: As soon as the experimenter left 
the room, the model proceeded to play. He punched 
the doll and then kicked it acioss the room, shot 
arrows at the panda, whipped it with the rope, and 
stabbed it in the stomach with the mallet. Then 
he pounded the bottle on the table, jumped on the 
boat, ripped a page from the book, and broke two 
balloons. At this point the experimenter knocked on 
the door, entered the room, and instructed the 
model to come into the next room. The subject 
was told he could now play with the toys. 

2. Verbalization: When the experimenter left, the 
model, musing slowly to himself and taking Save not 
to look at the child, said: 


Tf one were to make up games with these toys 
one ought to punch- the baby doll and kick it 
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across the room. Then one ought to shoot arrows 
at the panda and whip it with that rope and stab 
it in the stomach with the mallet. Then one ought 
to pound the bottle on the table, stomp on the 
boat, rip a page from the book, and break a 
couple of those balloons. 


Immediately after the speech, before the model had 
a chance to begin playing with the toys, the experi- 
menter knocked and entered the room, explaining that 
the model’s boss had called on the phone and would 
like to talk to him, The model left the room, and 
the experimenter then told the child to go ahead and 
play with the toys. At this point, the experimenter, 
too, left the room. 

3. Control: Each subject in this condition was 
brought to the experimental room, was told the 
experimenter wanted to find out what kinds of toys 
children prefer to play with, and was instructed to 
play with the toys. No model was present. The 
subject was then left alone by the experimenter. 

Measurement of internalization and acquisition. 
The child was observed through a one-way mirror 
for 10 minutes, and a complete written description 
was made of his activities by both models. Analysis 
of these descriptions later revealed that the recorders 
were in agreement 98% of the time. At the end of 
10 minutes, the experimenter returned, announcing 
that the model had had to return to work and so 
would not be seeing the child again. She also stated 
that it would be possible for the child to win a 
prize if he could remember everything that the 
model had done when he had played with the toys 
(performance condition) or had said before he 
started to play with the toys (verbalization condi- 
tion). Approval and encouragement were given for 
every item correctly recalled, and the child was 
occasionally reminded of the prize that could be his. 
Subjects in the control condition, of course, could 
not be tested for acquisition. Finally, all subjects 
were given a prize and asked not to discuss events 
that had just taken place with any other children. 
Again, as in the first experiment, prizes were not 
distributed until children from a given playground 
or class had taken part in the study. 

“When the subject left, any props that showed 
signs of wear were replaced before the next subject 
was run. 
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Results 


The mean number of aggressive responses 
similar to those displayed by the model that 
were exhibited by children in the various 
conditions is presented in Table 2. Analysis 
of variance of these data yielded a main 
effect of method of presentation (F = 11.01, 
df = 2/48, p < 01). The effect of sex ap- 
proached significance (F = 3.65, p< .10), 
with boys tending to display more aggression 
than girls. The interaction between method 
of presentation and sex was not significant 
(F = 1.88). Further analysis of the signifi- 
cant main effect yielded a ¢ of 2.00 (df = 34, 
p < .06, two-tailed) for the comparison be- 
tween performance and verbalization groups 
and a ¢ of 2.66 (p< .02) for that between 
verbalization and control groups. Thus, chil- 
dren in the performance conditions were more 
aggressive than those in the verbalization 
conditions, with this difference falling just 
short of an accepted level of statistical signifi- 
cance. Subjects in the verbalization conditions 
clearly displayed more aggression than those 
in the control conditions. 

Additional aggressive responses performed 
by subjects beyond those displayed by the 
model are also reported in Table 2. Analysis 
of variance of these data revealed no main 
effects of either method of presentation 
(F = 2.00) or sex (F <1), nor was there 
a significant interaction between the ‘two 
(F = 1.48). Thus, the experimental variables 
did not appear to have any effect on the sub- 
jects’ willingness to display novel aggression. 

Finally, the number of aggressive responses 
similar to the model’s that were made by 
subjects in the control condition did not differ 
from zero (f <1 for both boys and girls). 
Novel aggressive responses did differ from 


TABLE 2 
MEAN NUMBER oF BEHAVIORS DISPLAYED AND RECALLED BY SUBJECTS IN ALL CONDITIONS 


Boys Girls 
Variable 
Pi t i a 2 
a E E 1 Gotan 
Model’s behaviors displayed 3.33 2.22 11 1.89 1.00 .44 
Novel aggressive behaviors displayed 4.44 5.56 1.22 5.11 1.56 1.89 
Model’s behaviors recalled 7.00 5.56 aS 6.67 4.11 = 
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zero for boys (£= 2.39, df= 8, 2 < .05) 
but not for girls (t = 1.05). 

The mean number of the model’s behaviors 
recalled in the performance and yerbalization 
conditions is also reported in Table 2. Analy- 
sis of variance of thése data produced a sig- 
nificant main effect of method of presentation 
(F = 15.32, df = 1/32, p< 001). The effect 
of sex was not statistically significant (F= 
3.03), nor was the interaction between method 
of presentation and sex (F= 1.18). Thus, 
children in the performance condition were 
able to recall more of the model’s behavior 
than were children in the verbalization condi- 
tion. 

The reason for this difference in recall can- 
not, of course, be clear from the present 
study. The actual performance of aggression 
may have been more salient than a verbal 
description of the same behaviors. If this 
were so, then the marginally significant dif- 
ference in display of aggressive behaviors be- 
tween the performance and verbalization 
groups might have resulted from the fact that 
subjects in the performance group knew more 
possible behaviors that they might exhibit. On 
the other hand, if performance subjects were 
more aggressive than verbalization subjects 
for some other reason, acquisition differences 
might have occurred because performance sub- 
jects had had more practice at recalling the 
model’s behaviors. In order to assess these 
possibilities, an additional study was carried 
out in which subjects were asked to recall the 
model’s acts or words immediately after he 
left the experimental room. 


Experiment III 


Method 
Subjects 
Subjects were 10 boys and 10 girls, aged 8 and 9 
years, from the same Metropolitan Toronto separate 
school as those children employed in Experiment F 
This study was run immediately upon completion 
of Experiment I. 


Design 

A2X2 factorial design was employed. Boys and 
girls either saw a model perform aggressive responses 
or state that it was appropriate to do so. The models 
who had served in Experiment {I with the Institute 
children also were employed in this study. The same 
female was the experimenter again, and all subjects 
were run in the mobile laboratory. 
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Procedure 


The procedure was identical to that for the per- 
formance and verbalization groups of Experiment II 
except that subjects were not allowed to play with 
the toys. As soon as the model left the room, the 
experimenter returned and asked them to recall all 
the model had done or said. Again, material and 
social incentives were offered for all accurate recall. 


Results 


The mean number of items of the model’s 
behavior recalled by boys were: performance, 
7.00; verbalization, 5.00. For girls they were: 
performance, 6.60; verbalization, 6.20. Analy- 
sis of variance of these data yielded an F of 
7.20 for method of presentation (df = 1/16, 
p< 05), an F of < 1 for sex, and an F of 
3.20 for the interaction between method of 
presentation and sex (p < 10). It is evident, 
then, that more of the aggressive behaviors 
were recalled by subjects in the performance 
condition than by those in the verbalization 
condition. 


Discussion 


From these last two experiments, it appears 
that when 8- and 9-year-old children observe 
an adult actually perform a series of aggres- 
sive behaviors, they show a decided tendency 
(p < .06) to reproduce more of these behav- 
iors than when they merely hear the adult 
state that one ought to be aggressive in a 
particular way. This latter condition, however, 
clearly did facilitate the display of aggres- 
sion, for it led to greater production of the 
adult’s behaviors than when no performing or 
yerbalizing model was present. In this latter 
way, sharing and aggression are certainly dif- 
ferent, at least for boys. While verbalization 
elicited a fair amount of aggression from 8- 
and 9-year-old boys, relative to a control 
group, it elicited-no sharing at all in the same 
age group (Grusec & Skubiski, 1970). 

The effects of performing or verbalizing 
aggressive behaviors seemed to be quite spe- 
cific to the model’s behaviors. There was no 
difference between the two experimental 
groups and the control group in their display 
of novel aggression. Thus, the presence of a 
performing or verbalizing model apparently 
did not convey to the subjects that they were 
freer to aggress in any way than if they had 
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not seen or heard the model but only in the 
ways indicated by the model. 

The difference between the effects of per- 
formance and verbalization seems at first in- 
spection accountable for by differences in the 
amount learned by the two groups. Watching 
the performance of a relatively complex set 
of aggressive behaviors apparently produced 
better learning than just hearing them de- 
scribed. This difference presumably was trans- 
lated into the observer’s behavior. Children 
in the verbalization condition simply could 
not remember as many things to do as could 
the children in the performance condition. (It 
will be recalled, of course, that acquisition 
differences could not account for the obtained 
results of Experiment I—here the behaviors 
displayed by the model were fewer and less 
complex.) Thus, one could suggest that if 
modeled aggressive behaviors were simple, and 
therefore easily learned, performance and 
verbalization would be equally effective. One 
finding, however, that was nearly statistically 
significant, casts some doubt on this as a 
complete explanation. This was the Method of 
Presentation X Sex interaction of Experiment 
III, which yielded an F with an associated p 
value of less than .10. If one were to compare 
the mean recall scores for performance and 
verbalization, the difference for boys is quite 
significant (£ = 3.17, df = 8, p < .01), while 
the comparison for girls is not (ź< 1). 
Boys remembered more performed than 
verbalized material, while girls tended to re- 
member both equally well. For boys, then, 
differences in acquisition do seem to account 
for the differential effectiveness of perform- 
ance and verbalization. For girls, the picture is 
less clear. Repeatedly, it has been found that 
girls display less aggressive behavior than 
boys (e.g., Bandura, 1965; Sears, 1951; Ter- 
man & Miles, 1936). Presumably, this is at 
least partly because our society discourages 
physical aggression in girls while encouraging 
it, to a degree, in boys (Kagan, 1964; Mis- 
chel, 1966). If girls with their greater verbal 
facility (Maccoby, 1966) were able to learn 
verbalized material as well as performed ma- 
terial, their greater willingness to display ag- 
gression in performance conditions may have 
reflected a fear of the consequences of aggres- 
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sion. Seeing a model perform aggression un- 
punished certainly would be more indicative 
to a female subject that she could be aggres- 
sive with inpunity. A model who merely ver- 
balized aggression could in no way give her 
this assurance. (Note that the model was 
always a female. Had it been a male, of 
course, girls might have decided the behavior 
was appropriate for him but not for them.) 
The fact that aggression was greater in the 
verbalization than control condition, with no 
tendency toward an interaction between sex 
and method of presentation, does indicate, of 
course, that verbalization had some effect on 
the display of aggression for girls. At any rate, 
the behavior of girls in this situation—as it 
was also in the sharing situation—seems some- 
what more ambiguous than that of boys and 
must be the object of further inquiry. 
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POSITIVENESS, AND REVEALINGNESS ON 
SELF-PRESENTATION * 
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It was predicted that ingratiators would 
of their targets. Generally, this was fou 
more positive to a bragging rather than 


match the self-presentation behavior 
nd to be the case; ingratiators were 
a modest other, and they were more 


revealing to a revealing other. Both ingratiators and controls responded to a 
highly revealing other with increased positiveness. Implications for the psy- 


chology of ingratiation and the structuri 


An individual defines himself for others in 
part through his verbal self-presentations 
(Goffman, 1955). Therefore, distorted presen- 
tations may be useful in at least some situa- 
tions, a fact of which salesmen and job appli- 
cants seem particularly well aware. Jones 
(1964) has focused on self-presentations as 
behavioral weapons that an individual can 
employ in order to misrepresent himself to 
another, thereby inclining that other to re- 
spond to him as the presenter desires. It has 
been shown that ingratiators project more 
positive self-images than subjects not given 
instructions to ingratiate (Gergen, 1965; 
Jones, Gergen, & Davis, 1962). 

This attention to the exploitative aspects of 
self-presentations can be compared with an 
approach that emphasizes the cooperative as- 
pects. Goffman (1959, 1963) argues that self- 
presentations are influenced by normative fea- 
tures of the situation; proper self-presentation 
facilitates role enaction and consequently en- 
hances smooth social interaction. Goffman’s 
implicit argument is that the nature of a 
given self-presentation is more strongly influ- 
enced by normative and other features of the 
interaction than by personal motivation. Some 
research has shown that the positiveness of 


1 This research was supported in part by National 
Science Foundation Grant GS-1946 to the senior 
author, The article is based on a senior honors thesis 
submitted by the junior author to the Amherst Col- 
lege Psychology Department. 

? Requests for reprints should be sent to David J. 
Schneider, who is now at the Department of Psy- 
chology, Brandeis University, Waltham, Massachu- ` 
setts 02154. 

3 Now at Cornell University. 
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ng of interactions are discussed. 


self-presentation is responsive to situational 
changes even in the absence of explicit ingrati- 
ation motivation (Gergen & Taylor, 1969; 
Gergen & Wishnov, 1965; Schneider, Eustis, 
Manzolati, Miller, & Gordon, 1971). 
Another dimension of self-presentation that 


‘may have both ingratiation and structuring 


functions is self-disclosure. Goffman (1963) 
has analyzed this aspect of self-presentations, 
and there is evidence that self-revealing be- ` 
havior is both a function of personality 
(Jourard, 1964) and the behavior of the other 
in the situation (Altman & Haythorn, 1965; 
Taylor, 1968; Worthy, Gary, & Kahn, 1969). 

Both ingratiators and those responding more 
casually to the normative features of a social 
situation may respond to the behavior of the 
other person, in particular, to his self-presen- 
tation. The self-presentation of the other pro- 
vides information about what that person 
regards as appropriate and permissible- be- 
havior in the situation, and it defines that 
other in a socially significant way. That social 
definition in turn both determines how worthy 
the other is of future attention and by com- 
parison how one’s self will be defined in the 
situation. Thus, for example, the braggart in 
effect says that bragging is appropriate, he is 
worthy of further attention, and that, to the 
extent the other is not bragging, he has lower 
status than the braggart. There has been some 
research attention to how one person’s self- 
presentation affects another’s. 

Gergen and Wishnoy (1965) found that 
subjects were more positive to a bragging than 
to a modest target, but it was unclear whether 
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this occurred because of a status threat posed 
by the bragging target or because subjects 
wanted to ingratiate with such a desirable 
person. Worthy et al. (1969) showed that 
revealing behavior from a target was recipro- 
cated, but given that disclosure was also high 
to liked as opposed to disliked others, ingrati- 
ation motives cannot be ruled out as a factor. 
One of our concerns then was whether the 
matching of targets’ self-presentations by 
subjects would be intensified by explicit in- 
gratiation motivation. 

Byrne (1969) has summarized ample evi- 
dence to support the fact that we like those 
who are similar to us. Jones and Jones (1964) 
have indicated that conformity to the target’s 
opinion is a good ingratiation tactic, and 
Jones, Gergen, Gumpert, and Thibaut (1965) 
have shown that ingratiating self-presenta- 
tions match the values stressed by the target. 
Therefore, it seems that making one’s self out 
to be as similar as possible to one’s target 
(“matching” his behaviors) serves an ingra- 
tiator well, and we expected to obtain this 
effect from our subjects. 

The overall design of the study was a 
2% 2 x 2 factorial (ingratiation—control ver- 
sus high and low target positiveness versus 
high and low target revealingness) with both 
the positiveness and revealingness of subjects’ 
self-presentations being examined. Three spe- 
cific hypotheses were advanced. 

First, we expected ingratiators to be more 
positive about themselves than controls; this 
effect has been found so consistently that we 
would be surprised if it did not appear in the 
present context (Jones, 1964). : 

Second, we predicted that ingratiators 
would be more positive to high positive tar- 
gets than to low positive targets and more 
revealing to high revealing targets than to 
low revealing targets; these effects should be 
stronger for ingratiators than controls. 

Third, we hypothesized that ingratiators 
would be less self-revealing than would con- 
trols. Since bragging seems so integral a part 
of the ingratiators’ strategy, an ingratiator 
who is intimate with his target runs a very 
strong risk of being discovered a braggart. 
Therefore, ingratiators should studiously avoid 
any intimacy with their targets. 
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METHOD 
Subjects 


One hundred and one male undergraduates from 
the University of Massachusetts were run in the 
eight experimental conditions. Five of the subjects 
were dropped from the final analysis due to sus- 
picion or procedural difficulties, leaving 12 subjects 
per cell for the various analyses. Subjects were 
scheduled in groups of 4 (but several groups con- 
tained only 3 subjects because of scheduling prob- 
lems) and were paid: $1.50 each for their participa- 
tion. e 


Procedure 


Each member of each group of subjects first com- 
pleted two forms, one measuring his general self- 
positiveness and the other measuring his general 
willingness to be intimate with others. After describ- 
ing the experiment as “one of a series of researches 
into the dynamics of interpersonal communication,” 
the experimenter told the group of subjects that 
this particular study was concerned with “how vari- 
ous kinds and amounts of personal information ex- 
changed between two strangers might influence the 
accuracy with which they were able to evaluate each 
other.” 

Subjects were then told that they were each paired 
with another student who was waiting for them 
down the hall and that they and their partners 
would be exchanging some personal information and 
then trying to evaluate each other. The order of 
events was described as follows: 


First you will each receive some personal infor- 
mation about your partners—information in the 
form of the scores he made on a number of per- 
sonality tests. Second, you will each send a writ- 
ten self-descriptive communication to your partner. 
Third, you and your partner will each evaluate 
the other on the basis of whatever information 
you then have about each other. Finally, we will 
have the two of you meet for a brief interaction, 


after which you can change your evaluation if 
you want to. 


After making sure that the subjects understood 
this order of events, the experimenter described the 
“personality score information? that each would 
receive and then passed out 3 stapled pages of 
“information” to each subject. (This material con- 
tained the target positiveness and target revealingness 
manipulations.) After studying this material for sey- 
eral minutes, the subjects were given a second set of 
positiveness and revealingness forms to fill out, and 
they were told that this set of forms would be sent 
to their partners, who would then be trying to eval- 
uate them on the basis of how they had filled out the 
“communication” forms. Before the subjects filled out 
these forms, one-half of the groups were given an 
ingratiation instruction. One subject in each grou 
of four was given one of the four target picts p 
only three subjects were run, one of the targets ae. 
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not given to any subject; ingratiators and controls 
were never run in the same group. After filling out 
the second set of positiveness and revealingness 
forms, the subjects were asked to evaluate their part- 
ners on another form (which contained several de- 
pendent measures). Finally the subjects were de- 
briefed, paid, and dismissed. 


Manipulations 


Target positiveness and target revealingness were 
manipulated by giving the subjects certain kinds of 
personality score information about their “partners.” 
The first of the 3 pages of such information given 
each subject consisted of a Cutick General Self- 
Evaluation Questionnaire (Diggory, 1966). This was 
prominently scored in red pencil and described to 
some extent by the experimenter as measuring “how 
positive about himself a person usually is.” The 
partners either scored above the 90th percentile (high 
target positiveness) or below the 30th percentile 
(low target positiveness). 

The second page consisted of a “Manzolati Self- 
Disclosure Scale” (invented for the occasion to look 
as though it might measure self-disclosure). This 
too was scored in thick red pencil and described at 
some length by the experimenter as a measure of 
the partner’s general willingness to be intimate with 
other people. Partners again scored either above the 
90th percentile (high target revealingness) or below 
the 30th percentile (low target revealingness), 

Subjects were also given a third page of informa- 
tion about their partners. On this page were listed 
a number of scores the fictitious partner had sup- 
posedly made on a number of personality tests. 
These scores were all near the 50th percentile except 
for the scores on the crucial tests. Scores for two 
tests, an Edwards Personal” Preference Schedule 
Self-Concept and Jourard Self-Disclosure, were 
within 4 percentile points of the previous extreme 
percentile scores on the Cutick Self-Evaluation and 
Manzolati Self-Disclosure tests. Furthermore, the 
subjects were instructed to pay most attention to 
those scores which “were widely different from the 
50th percentile, as these scores convey information 
about how your partner may be different from most 
people.” 

The ingratiation instruction that was given to half 
of the subjects was: “Try to make as good an im- 
pression as you can upon your partner; make him 
give you a good evaluation.” Controls were given 
no instructions about how to present themselves on 
the “communication” forms. 


Dependent Measures 


1. Positiveness forms: Each of the two positiveness 
forms consisted of 20 adjectival phrases, 10 of which 
were socially desirable (‘creative thinker”) and 10 
of which were socially undesirable (“hard to warm 
up to”). Subjects were required to rate how de- 
scriptive of them (5-point scale) each phrase was. A 
“quite descriptive” response to a desirable phrase and 
a “quite undescriptive” response to an undesirable 
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phrase were scored as 5 (ie, socially undesirable 
phrases were reverse scored), The total of a sub- 
ject’s scores on each of the 20 items was his score on 
that positiveness form; the higher the score, the 
more self-positive the subject had been. Since it was 
known from earlier work that these forms were well 
correlated and that mean responses were roughly 
equal (Schneider et al, 1971), they were not 
presented in balanced order. Form “X” was al- 
ways given to the subjects at the beginning of the 
experiment, and Form “Y” was always presented 
later as the “communication.” 

2. Revealingness forms: Each of the two reveal- 
ingness forms contained 40 items taken from Taylor 
and Altman’s (1966) list of 671 intimacy scaled 
items. Their items were placed into five categories of 
intimacy value. For our scale, items were presented 
in groups of five, and the subjects were required to 
select from each group of five the two items they 
most wanted to discuss. Each group of 5 consisted 
of one very nonintimate item (“names of places I 
grew up”), scored as 1; one item of each of the 
three intermediate intimacy ranges, scored as 2, 3, 
and 4; and one very intimate item (“guilt feelings I 
may have about my sexual behavior”), scored as 5. 
There were eight such groups of 5 items, and a 
subject’s revealingness score on one of these two 
forms was the sum of the scores of the 16 items he 
selected as most willing to discuss; the higher the 
score, the more intimate the subject had indicated 
himself willing to be. These two forms were given 
to the subjects in balanced order; Form “R” ap- 
peared first for half of the subjects and Form “S” 
first for the other half. 

3. Other dependent measures: The form on which 
the subjects were asked to evaluate their partners 
(the “Evaluation Sheet”) contained a number of 
dependent measures that were included as manipu- 
lation checks and to further assess the subjects’ reac- 
tions to the targets. Subjects responded to questions 
asking: “How much did you like your partner?” 
“How much do you want to interact with this person 
further?” “How comfortable did you feel in com- 
pleting the forms for him?” “How good a rating do 
you think he gave you?” and “How accurate were 
you in describing yourself to him?” As a manipula- 
tion check of target positiveness and target reveal- 
ingness, the subjects were required to select from a 
list of 14 adjectives any that they felt described 
their partner. These adjectives were happy, aloof, 
open, threatening, secretive, confident, defensive, out- 
going, withdrawn, insecure, proud, shy, unappealing, 
sad. 


RESULTS 
Manipulation Checks 


While, initially, the adjective checklist was 
designed simply to be a rough check on the 
perceptions of the manipulations, a more pre- 
cise technique was developed. First, the data 
for all of the subjects were combined, and 
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conditional probabilities were calculated be- 
tween every pair of adjectives (e.g., given that 
A was checked, the probability that B also 
would be checked). Traits were grouped into 
clusters such that each trait had an average 
conditional probability of implying and being 
implied by the other traits in the cluster of at 
least .50. Three clusters appeared: (a) with- 
drawn-shy-insecure, (b) defensive-secretive, 
(c) confident-outgoing-proud-open. The other 
traits, happy, sad, unappealing, and threaten- 
ing, had low probabilities of endorsement and 
were not further considered. Aloof was ana- 
lyzed separately. Each subject was then given 
a score for each cluster that was the number 
of adjectives checked for that cluster (e.g., a 
subject who checked withdrawn and shy 
would be given a score of 2 for the withdrawn- 
shy-insecure cluster). These scores were then 
submitted to analysis of variance. 

The results for the major clusters confirm 
our manipulations. Bragging others are seen 
as higher on the confident-outgoing-proud- 
open cluster (F = 128.67, p < 001), as lower 
on the withdrawn-shy-insecure cluster (F = 
61.75, p < .001), and as moderately lower on 
the defensive-secretive cluster (F = 2.84, $ 
<.10). High revealing subjects are seen 
as more confident-outgoing-proud-open (F = 
67.47, p < .001), as less withdrawn-shy-inse- 
cure (F = 6.86, p < .05), and as less defen- 
sive and secretive (F = 28.71, p < .001). The 
only other effect was an interaction between 
the two manipulations for confident-outgoing- 
proud-open (F = 4.93, p< .05) which was 
produced by an unusually high number of 
adjectives checked in the high positive-high 
revealing cell. 

While only 18 subjects checked aloof as 
characteristic of their partner, 17 of these 
were in the high positive cells (x? = 20.00, p 
< .001). Eleven of 18 were in the low reveal- 
ingness cells, and this split is not significant 
(x? = 1.42, ns). 

For the scale items, there was a tendency 
for ingratiators to indicate that they had been 
Jess accurate in presenting themselves than 
did controls (F = 3.00, df = 1/88, p < .10); 
also, ingratiators felt that they had received 
better evaluations from their targets than did 
controls (F = 10.78, df= 1/88, p< 01). 
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Both of these findings suggest that ingrati- 
ators were concentrating on making a good 
impression to a greater extent than were con- 
„trols. 


Self-Presentational Data 


An analysis of covariance was applied to 
the positiveness data, with the subjects’ per- 
formance on Form X used as the covariate 
and their performance on Form Y as the cri- 
terion. Table 1 presents the results of the 
three-way analysis of covariance, and Table 2 
contains the adjusted positiveness means for 
each cell. Separate analyses were also done 
for ingratiators and controls. 

Hypothesis I was that ingratiators would 
be more positive than controls, and this was 
strongly confirmed (F = 10.77, df = 1/87, p 
<.01). 

The patterns of self-presentation positive- 
ness by ingratiators and controls were dif- 
ferent as evidenced by the three-way inter- 
action (F = 5.85, df = 1/87, p< .05). As 
predicted by Hypothesis II, ingratiators were 
more positive to the positive other (F = 4.37, 
df = 1/43, p < .05). The effect was not sig- 
nificant for controls (F = 1.72, df = 1/43, 
ns), but the effect for ingratiators was not 
significantly more pronounced than for con- 
trols (Ingratiation-Control x Positiveness in- 
teraction F = 1.42, df = 1/87, ns). An unex- 
pected effect was that both ingratiators (F = 
4.52, df = 1/43, p < .05) and controls (F = 
7,51, df = 1/43, p < .01) were more positive 
to the high revealer (F = 9.45, df = 1/87, p 
< 01). 


TABLE 1 


ANALYSIS OF COVARIANCE FOR SELF-PRESENTATION 
POSITIVENESS AS A FUNCTION OF INGRATIATION- 
CONTROL, TARGET POSITIVENESS, AND 
TARGET REVEALINGNESS 


Source df MS F 
Ingratiation-control (A) 1 | 402.16 | 10.77** 
Target positiveness (B) 1 | 244.63 6.55* 
Target revealingness (C) 1 | 352.88 9.45** 
AXB 1 53.18 1.42 
AXC 1 9.45 25 
Bixac 1 46.54 1,25 
AXBXC 1 | 218.35 5.85* 

Error 87 37.34 
b <.05, i 


a 


fa 
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TABLE 2 


FINAL ADJUSTED Positivenrss Scores 
FOR ALL CONDITIONS 


Ingratiators Controls 
Positive- 
a High Low High Low 
revealing | revealing revealing | revealing 
High 86.18 78.51 76.00 73.03 
Low 78.69 77.03 74.50 71.06 


An analysis of covariance was also applied 
to the revealingness data, with the subjects’ 
responses to the first of the two revealingness 
forms they completed as the covariate and 
their responses to the second form as the cri- 
terion. Table 3 presents the adjusted reveal- 
ingness means for all cells, 

The only significant effect was that the 
subjects were more revealing to high revealing 
targets (F = 4.27, df = 1/87, p < .05). Sepa- 
rate two-way analyses of covariance for in- 
gratiators and for controls indicated this effect 
to exist for ingratiators (F = 5.35, df= 1/43, 
p < .05) but not for controls (af 
1/43, ns). Again, the Ingratiation-Control x 
Revealingness interaction was not significant, 
however, F < 1. 

Hypothesis IIT, which suggested that ingra- 
tiators would be less revealing than controls, 
was not supported (F < 1). 


Other Dependent M. casures 


The data from the five dependent questions 
that asked the subjects for their perceptions 
of their targets and the situation were sub- 
jected to three-way analyses of variance. 
There were some appropriate trends for these 
measures, but no results reached Significance 
other than those reported under the heading 
Manipulation Checks, 


Discusston 


The hypotheses of the present study held 
up reasonably well. Ingratiators were more 
positive than controls, as predicted; how- 
ever, ingratiators were not less revealing as we 
had predicted, The general prediction that 
ingratiators would match their targets both in 
terms of the positiveness and revealingness 
dimensions of self-presentation was confirmed, 
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although they did not differ significantly in 
this regard from the controls. The major un- 
anticipated result was the fact that both in- 
gtatiators and controls were more positive to 
a revealing than to a nonrevealing other, 

That the data support our hypothesis that 
ingratiators would be more positive than con- 
trols (p < .01) merits little discussion here. 
This effect has been found now in many dif- 
ferent ingratiation situations. Far more in- 
triguing is our failure to obtain any gross dif- 
ferences between ingratiators and controls 
with regard to their overall level of revealing- 
ness. 

Our original expectation was that revealing- 
ness would be too dangerous for an ingratiator 
to use—he might be found out a braggart— 
and that he should, therefore, avoid any 
intimacy at all with his target. There may be 
one or two other factors operating here that 
can account for our failure to obtain the 
predicted effect. First, it has often been 
argued that by the act of giving others 
personal information about himself, an in- 
dividual gives away some degree of control 
over himself. Any personal disclosure is risky 
because it gives another a lever by which he 
can move the discloser. The additional moti- 
vation to conceal introduced by the ingratia- — 
tion manipulation may simply have been 
overwhelmed by the more powerful motives 
in both conditions to avoid intimacy. This 
Position is somewhat supported by the fact 
that the average intimacy value of the items 
chosen on Forms R and S (by all subjects) 
was just less than the median value of 3, a 
relatively nonintimate level of responding, 

A second possibility here is that despite 
the obvious danger to an intimate Ingratiator, 
the norm in our society that says that řteveal- 


TABLE 3 


FINAL ADJUSTED REVEALINGNESS 
FOR ALt CONDITIONS 


Ingratiators Controls 
Positive- 
ness 
High Low High Low 
revealing revealing revealing revealing 
High 47.80 42.68 47.19 43.74 
Low 41.96 43.91 45.45 


| 


45.24 
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ingness is a good thing might make an ingrati- 
ator unwilling to be totally nonintimate, for 
this could lead to his being judged unattrac- 
tive by his target. This is all to say that the 
opposing influences of (a) the norms that say 
revealingness is good and (b) the danger in 
which an intimate ingratiator might find 
himself might simply cancel each other out, 
with the result that ingratiators are generally 
no more and no less revealing than controls, 
Our hypothesis that predicted that ingrati- 
ators would “match” the pehavior of their 
targets (in an attempt to be perceived as 
similar to them) fared much better. Ingratia- 
tors were both more positive to high positive 
targets than they were to low positive targets 
(p < .05) and more revealing to high reveal- 
ing targets than they were to low revealing 
targets (p < .05). There is reasonable sup- 
port for the general hypothesis that making 
oneself out to be similar to one’s target is a 
common and a valuable ingratiation tactic. 
We think that this finding is particularly 
useful because it provides a more articulated 
explanation for several other findings in this 
area. Jones et al. (1965) found that self- 
presentations were used by ingratiators to 
match the values stressed by their targets. 
One plausible explanation for this finding is 
that their ingratiators were trying to make 
themselves out to þe similar to their targets 
(assuming, of course, that they felt that their 
targets would act in accordance with their 
own values). We also have here an explana- 
tion for why conformity has been shown to 
be so appropriate in ingratiation situations 
(Jones & Jones, conformity again 
is a tactic by which an ingratiator can send 
out information that he is similar to his target, 
and he may do so because he feels that this 
perception of similarity will lead to liking. 
We propose that any behavior that may lead 
to a perception of similarity by the target 
(similarity between the ingratiator and the 
target) may be valuable for the ingratiator. 
There are, of course, additional reasons why 
the matching behavior might occur. One pos- 
sibility is that the behavior of others in the 
situation constitutes a basis for appropriate- 
ness, and ingratiating people would be likely 
to be particularly motivated to behave in an 
appropriate manner. This differs from the 
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similarity explanation in that in one case 
subjects are trying to be similar to an- 
other and in the other they are trying to be 
normative. Certainly more research attention 
should be given to this and other possibilities 
underlying the matching process. 
In the present experiment, ingratiating sub- 
jects did not match the behavior of their 
targets to a greater extent than controls. 
While the control conditions were run as @ 
comparison for the ingratiation conditions, it 
is also worth*noting that the ingratiation 
instructions probably only intensify and make 
more explicit motives operating in the control 
conditions. 
What seems especially interesting is the 
unpredicted result that the subjects were 
more positive to high revealing targets than 
to low revealing targets (p < .01), an effect 
found both for ingratiators (p < 05) and for 
controls (p < .01). We see one possible expla- 
nation for this effect within the framework 
of the Gergen and Wishnov (1965) analysis. 
They discussed the character of “status 
threats” posed by highly self-positive others 
and argued that presenting onself very posi- 
tively to such a threatening other is an ef- 
fective means of minimizing the threat he 
poses. It may be the high revealer and not 
the high positive other who poses the “status 
threat.’ This may be so because high self- 
revealingness is an overture to friendship 
(Taylor, 1968), an invitation to drop formali- 
ties, and, therefore, a threat to the indepen- 
dence and possibly to the reputational superi- 
ority of the target of the revealing disclosure. 
However, since only 1 of the 96 subjects 
checked the adjective “threatening” as de- 
scriptive of his target, there is little empirical 
evidence that a status threat existed for any 
subjects. 
Another possible explanation for this un- 
predicted result is that subjects perceived the 
high revealing targets to be more gullible, 
receptive, or in need of friendship than the 
low revealing targets, and they felt that they 
could more easily get away with saying very 
positive things about themselves to such tar- 
gets. This interpretation would also explain 
the triple interaction (p < .05) for the posi- 
Le ee that results primarily from 
gratiators’ unusually high positiveness 
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to the high positive-high revealing other. 
Again, we have no real evidence to support 
this speculation, 

Finally we turn to a brief consideration of 
the role of revealingness in self-presentations. 

. We did obtain a significant effect for the 
infratiators’ use of the revealingness compo- 
nent of their total self-presentation. (Ingrati- 
ators were more revealing to high revealers 
than to low revealers, p < 05.) Both in- 
gtatiators and controls were sensitive to the 
tevealingness of the target person in their 
positiveness. These findings indicate the value 
of continuing to consider the revealingness 
component of self-presentations in face-to-face 
interactions, particularly where ingratiation 
motives may operate. 

Certainly there are a number of problems, 
both theoretical and operational, with the 
measure of revealingness that was developed. 
First, Forms R and S were directed at 
measuring “willingness to be intimate” and 
not at actual intimacy behaviors. We might 
well expect in this situation that the subjects 
were telling us what they would like to think 
their intimacy behavior will be rather than 
what it actually would be. On the operational 
level, Forms R and S were not very well cor- 
related (r = 46). With more reliable mea- 
sures of revealingness, we might well have 
obtained stronger results. 
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Subjects were either annoyed by a confederate or left 
subjects then gave electric shocks to the confederate ; another third watched the 
experimenter give identical shocks; and for the rest of the subjects the con- 


federate was not hurt by anyone. All 


to give electric shocks to the confederate. It was found that when subjects were 


annoyed, having the confederate get 


shocks that he was subsequently given, 
annoyed, hurting the confederate increased the number of shocks he was sub- 
sequently given. The results are discussed in 
the catharsis hypothesis as well as other alternative 


retaliation. 


The catharsis hypothesis of aggression usu- 
ally refers to a decrease in aggression after 
the expression of aggression. The assumption 
as stated by Freud is that there is a certain 
has to be expressed, 
and that once this has happened, there is less 
left to be expressed later on. In war, for ex- 


Freud (1959) stated that “Killing an 


275]? This instinct, he goes on, “is at work 
in every living being and is striving to bring 
it to ruin and to reduce life to its original con- 
[p. 282].” Dollard, 
Mowrer, and Sears (1939) pic- 


stating that “the occurrence of any act of 
aggression is assumed to reduce the instiga- 
tion to aggression [p. 501.” On the other hand, 
they admit the possibility of opposite effects 
when they say that “the repetition of a mode 
of release may produce learning of it [p. 50].” 

Although these would seem to be quite 
straightforward statements of the catharsis 
hypothesis, social psychologists have not 
treated the hypothesis that simply. In the first 
place, the Aristotelian notion of catharsis is 
definitely a vicarious one—that one’s emo- 
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alone. One-third of the 


subjects were then given an opportunity 


hurt decreased the number of electric 
whereas when the subject was not 


terms of a very specific form of 
interpretations such as 


tions are drained through the vicarious €x- 
perience of that emotion. And this notion 
when applied to the catharsis of aggression 
suggests a very different kind of experimen 
from that suggested by the Freudian or th 
Dollard et al. statement of the hypothesis. 
For reasons that are not clear, experimenta! 
social psychologists seem to have taken the 
position that either the catharsis hypothesis 
is true (under specified conditions) or it i: 
not true at all. Disconfirmations of the hy 
pothesis under certain conditions have been 
taken as nonreplications of previous experi- 
ments, rather than as experiments done with 
very different research paradigms testing very 
different hypotheses. 
As outlined previously (Doob, 1970), tests 
of the loosely defined catharsis hypothesis 
have involved measures of aggressive behavior 
(e.g., Doob, 1970; Hartmann, 1969; Mallick 
& McCandless, 1966), feelings of general hos- 
tility or arousal (e.g, Berkowitz, Green, & 
Macaulay, 1962; Feshbach, 1955; Worchel, 
1957), physiological arousal (e.g., Hokanson 
& Burgess, 1962; Hokanson & Shetler, 1961), 
or feelings of hostility toward an annoying 
person (e.g., Bramel, Taub, & Blum, 1968; 
Kahn, 1966). There is no good theoretical 
reason why all of these should be considered 
to be equivalent forms of “aggression,” unless 
one automatically assumes a perfect corres- 
pondence between cognitive, conative, and 
affective aspects of aggression. 
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The problems involved in the “tests” of 
~ the catharsis hypothesis would be fairly simple 
if the only problem were that different mea- 
sures were used. Unfortunately, not only are 
different measures used in these tests, but very 
different things seem to happen to the sub- 
jects, and, once again, there is no good reason 
why all of these things should be considered 
to be cathartic. The catharsis hypothesis 
states that the “expression” of aggression is 
supposed to be cathartic, In tests of the 
catharsis hypothesis, this expression has con- 
sisted of a variety of things such as watching 
a videotape of a person having an unpleasant 
drug experience (Bramel et al., 1968), watch- 
ing a movie of a fight at a basketball game 
(Hartmann, 1969), writing Thematic Apper- 
ception Test stories (Feshbach, 1955), saying 
nasty things about an obnoxious person 
(Kahn, 1966), writing notes to a person who 
was insulting (Thibaut & Coules, 1952), help- 
ing the experimenter take money away from 
another person (Doob, 1970), and a variety 
of other things. 

Moreover, there seems to be at least some 
difference in opinion as to whether the sub- 
ject must be annoyed in order to show a 
catharsis effect or whether the state of the 
subject makes no difference. Feshbach (1961), 
Buss (1961), and Bramel et al. (1968) all 
appear to believe that it is important that 
subjects be annoyed, whereas Hartmann 
(1969), Walters and Llewellyn Thomas 
(1963), and others do not appear to feel that 
this is an important variable. 

It is clear, then, that when these three 
sources of differences are combined, there are 
a large number of things that could qualify 
as “tests” of the catharsis hypothesis. It is no 
wonder, then, that there have been contradic- 
tory results from the tests. 

The experiments that tend to support the 
catharsis hypothesis can be divided into two 
categories. In the first place, there are those 
experiments in which aggressive feelings have 
been measured, rather than aggressive behav- 
ior. Thus, for example, Hokanson and Shetler 
(1961), using systolic blood pressure as a 
measure of physiological arousal, found that 
when subjects had been frustrated and were 
then given an opportunity to aggress against 
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a relatively low-status annoyer, there was a 
decrease in the monitored level of physio- 
logical arousal, Similarly, Worchel (1957) 
found that expressing hostility when annoyed 
reduced drive (as measured by performance 
on a digit-symbol task). Thus, it would seem 
that at least in some situations, giving an 
annoyed person a chance to ageress will reduce 
drive levels, 

A second set of situations which produces 
decreased levels of aggression are those in 
which a person sees the person who annoyed 
him getting hurt. Bramel et al. (1968) found 
that annoyed subjects who watched a video- 
tape of their annoyer suffering (even though 
they knew that this tape had been made 
some time in the past) subsequently rated 
him as more competent and courteous than 
did subjects who had watched either of two 
other tapes. The opposite effect was shown 
for subjects who had not been annoyed. Doob 
(1970) showed that subjects who had had a 
chance to hurt the person who annoyed them 
subsequently gave shorter shocks to the an- 
noying person than did subjects who had’ not 
had a chance to hurt the annoying person. 
Once again, there was no such effect for sub- 
jects who were not annoyed. 

Tt is clear then that there is no such thing 
as a “test” of the catharsis hypothesis. It is 
perhaps better to put such tests into the 
framework of experiments designed to deline- 
ate the conditions in which aggression is fol- 
lowed by a reduction of hostility or by ag- 
gressive behavior. 

The present experiment is a replication and 
extension of the Doob (1970) experiment. It 
is clear from previous work that the state 
(annoyed or not) of the subject is a critical 
variable. Also, it appears that a decrease in 
aggressive behavior is more likely to be found 
when the annoying person is actually hurt in 
some way. However, it is unclear from previ- 
ous work whether it is necessary to have the 
subject himself involved in this hurting. This 
experiment attempts to compare the effect of 
having the annoying person get hurt by the 
subject or by another person. In addition, it 
compares these effects for subjects who have 
been annoyed and for those who have not been 
annoyed, 


The experiment, then, is a 2 X 3 factorial 
design: half of the subjects were annoyed by 
a confederate; half were not. One-third of the 
subjects in each of these groups were then 
given a chance to hurt the confederate; one- 
third watched the confederate being hurt by 
the experimenter; and for one-third of the 
subjects, the confederate did not get hurt. All 
subjects then had a chance to give electric 
shocks to the confederate. 

It is predicted that in the annoy conditions, 
having the confederate get hurt (either by the 
subject or the experimenter) would decrease 
the number of shocks that the subject gave. 
In the no-annoy conditions, it was predicted 
that watching the experimenter hurt the con- 
federate would, if anything, increase the num- 
ber of shocks that the subject gives the con- 
federate. It was expected that having the 
subject hurt the confederate in the catharsis 
session when the subject was not annoyed 
would, if anything, increase the number of 
shocks that the confederate was given. 


METHOD 


Subjects were run individually with a confederate. 
The subject was given a card-sorting task and left 
alone with the confederate. For the next 10 minutes, 
the confederate either annoyed the subject or sat 
quietly watching the subject. When the experimenter 
returned, the confederate was given the task of 
learning a list of paired associates. The confederate 
was then tested by the subject alone or by the 
experimenter in the presence of the subject. In two 
of the three conditions, the confederate was sup- 
posedly shocked for incorrect responses either by the 
subject or by the experimenter. In a third condition, 
no shock was given in this second stage. The con- 
federate was then given the task of giving creative 
associations in a free association task. All subjects 
judged these responses and supposedly delivered a 
shock if he evaluated the confederate’s responses as 
“uncreative.” The number of shocks given during 
this last phase was used as the main dependent 
variable. 


Procedure 


The subjects were 99 male and female first-year 
university and high school students of a minimum 
age of 16 years. The university students participated 
to fulfill a requirement for their introductory psy- 
chology course, while the high school students were 
paid $1.50. 

The confederate, a female, and the subject arrived 
at approximately the same time and were shown to 
the experimental lab. This lab consisted of two 
small rooms separated by a door and a wall with a 
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large glassless window. For the experiment, this 
window was covered by a black curtain which pre- 
vented visual, but not auditory, contact between the 
two rooms. A large table was placed against this 
wall in the first room. A small metal box with a 
button was placed at the edge of the table. A wire 
ran from this box under the curtain to an event 
recorder in the second room. A detachable electric 
buzzer was connected to the recorder, When the 
buzzer was connected to the recorder, and the button 
on the metal box was depressed, the buzzer sounded 
for approximately 2 seconds. 

After the subject and the confederate were seated 
at the table in the first room, the experimenter told 
them that they would be participating in a number 
of different learning experiments, He stated: 


The usual learning experiment consists of one 
person coming in, the experimenter giving him 
some task to learn, and later testing him. Today, 
however, we will be doing something slightly 
different, For this first task, a concept-formation 

è task, one of you will be the teacher and the other 
the learner. I guess for this task you [pointing 
to the subject] might as well be the learner and 
you [pointing to the confederate] the teacher. I 
have here a stack of white index cards, 125 of 
them in all. On each card, there is a different num- 
ber of colored figures. There are either 1, 2, 3, 4, 
or 5 figures per card, and these figures are either 
triangles, squares, rectangles, circles, or diamonds 
of five different colors. For this task, you [to the 
subject] are to sort these 125 cards into 25 sets of 
5 cards each. In each of these 5-card sets, you 
must have each of the five numbers, colors, and 
shapes represented, There can be no repetitions of 
any of these three dimensions in a set, For example, 
a set that is a valid set might be one red triangle, 
two green squares, three yellow circles, four blue 
rectangles, and five orange diamonds. You [to the 
subject] will have 10 minutes to make up as many 
sets as you can. Your job [to the confederate] 
will be to check that each set that he makes is a 
valid one, to write down on this sheet what each 
set consists of, that is, one red triangle, two green 
squares, etc., and also to time with this stopwatch 
how long it takes to make up each set. 


The experimenter then answered any questions the 
subject asked by repeating the appropriate section 
of the instructions, and then left the room. At this 
point the annoy manipulation was begun for subjects 
who had been randomly assigned to the annoy con- 
ditions. 


Annoy Manipulation 


Approximately 2 minutes after the experimenter 
had left the room, the confederate began sarcasti- 
cally to attack the subject’s “obvious lack of intelli- 
gence.” The confederate used a variety of deroga- 
tory remarks and reacted with disgust to any com- 
ments or questions the subject made. The confed- 
erate attacked any personal characteristic that the 
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subject might be sensitive to, that is, dress, the sub- 
ject’s major, etc, 

Obviously, this procedure Was not exactly the same 
for every subject. In order to get all subjects an- 
noyed, the confederate used a number of different 
techniques which varied slightly from one subject to 
the next, However, at this point the experimenter did 
not know what condition the subject was in, nor 
did the confederate know whether or not shock 
would be given on the second task. In the no-annoy 
conditions, the confederate acted like a normal sub- 
ject, and sat relatively quietly for this first task until 
the experimenter returned. 


Shock-No-Shock Manipulation 


When the experimenter returned after 10 minutes, 
he was careful not to comment on the subject’s per- 
formance, but indicated that it would be best to 
continue with the rest of the experiment. The instruc- 
tions for the next part went as follows: 


The next task is called a paired-associate learn- 
ing task. Let’s see, you [to the confederate] will 
be the learner for this task. You will have to 
learn the number that goes with each of ‘these 
words. On the testing, the word will be read, and 
you will give the correct number, You'll have 4 
minutes to study this list. 


At this point, the instructions varied slightly for 
each of the shock conditions, 
In the no-shock conditions, the experimenter 


stated: 


During the testing, you [to the confederate] will 
be seated in the next room, and I will do the test- 
ing. I will read the word on the list, and you will 
give the number that goes with it, You [to the 
subject] won’t have anything to do for the next 
few minutes. 


In the conditions where the subject shocked the 
confederate, the experimenter stated: 


Now, during the testing, you [to the confederate] 
will be seated in the next room. You [to the sub- 
ject] will read each word aloud, and you [to the 
confederate] will give the number that goes with 
it. If the number is correct, then nothing will 
happen on that trial. If the number is incorrect, 
you [to the subject] will give her a medium in- 
tensity, fixed-duration electric shock, Medium in- 
tensity means that it is an uncomfortable shock, but 
it is not extremely painful or dangerous. 


Moving the metal box in front of the subject, the 
experimenter then stated: “When this button is 
pressed, a buzzer will sound, and the fixed-duration 
shock will be automatically delivered. Do either of 
you have any questions?” He then answered any 
questions by repeating the instructions. In the condi- 
tions where the experimenter shocked the confed- 
erate, the instructions were the same as when the 
subject gave the shocks except that the experimenter 
referred to himself as the tester, 


to 
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After the 4-minute learning period, the confederate 
was seated in a metal chair in front of the apparatus 
in the second room. For the conditions in which the 
subject gave shocks, the experimenter gave the sub- 
ject the list of paired associates and the instructions: 


Tf she gives the correct response, go on to read 
the next word. If she gives an incorrect response, 
mark an X beside the word and press the button. 
A buzzer will sound when the button is depressed, 
and the fixed-length shock will be automatically 
delivered. 


In the conditions in which the experimenter delivered 
the shock, he said: “I'll read the list in order once. If 
you make a mistake, a buzzer will sound, and a 
shock will automatically be delivered,” 

In the two shock conditions, the experimenter 
attached the electrodes to the confederate and turned 
on the recorder. He instructed the confederate not 
to make comments or noises if a shock was deliv- 
ered. He then closed the door to the second room 
and either left the first room or proceeded with the 
testing (depending on experimental condition). Dur- 
ing the testing, the confederate made eight intentional 
errors and ostensibly received eight shocks. In the 
no-shock conditions, the experimenter simply read 
the words to the confederate, and the confederate 
answered with the same responses as he did in the 
two shock conditions. When the testing was complete, 
the experimenter turned off the recorder and dis- 
connected the buzzer. 


Dependent Measure 


The last part was the same for all subjects and 
went as follows: 


The last thing that we will be doing is what is 
known as a free association task. In this type of 
task what usually happens is that one person reads 
a word, and the other person gives the first word 
that comes to mind. Today, we will be doing some- 
thing slightly different. For this task, you [turning 
to the subject] will be reading a list of words, 
and you [to the confederate] will be giving a 
response to each word. However, instead of neces- 
sarily giving the first word that comes to mind, 
you should try to give a creative response, perhaps 
an unusual response, to the word that is read. 
Your job [to the subject] will be to write down 
the word that she gives and to judge whether or 
not you think her response is a creative one. If 
you think it is a creative response, then nothing 
will happen on that trial. Tf you think that it is 
not a creative response, then you will be giving 
her a medium intensity shock. The length of the 
shock is to be determined by how bad a response 
you think the one she gives is, If you think that 
it is a very bad response, then you hold the button 
down a longer time. If you think it is only 
slightly uncreative, then hold the button down a 
shorter time. If you think her response is a good 
response, then there will be no shock at all. Now, 


» 


ie 


obviously, I cannot give you any strict set of 

criteria about what is creative and what js not 

creative. That’s for you to decide. Use your own 
judgment. 

Jn the two shock conditions, the experimenter ex- 
plained that there would be no buzzer on this last 
test. He then turned on the recorder, closed the 
door to the second room (after attaching the elec- 
trodes in the no-shock conditions), and instructed 

~ the confederate not to comment if a shock was 
~ delivered. He then left the room, In all of the condi- 
tions, the confederate gave the same set of responses. 
Obviously, at no point in the experiment did the 
confederate actually receive any electric shocks. 
When the testing was completed, the experimenter 
returned, turned off the recorder, detached the elec- 
trodes, and asked the confederate to wait in another 
room. He then gave the subject a questionnaire de- 
signed to evaluate his opinions of the confederate. 
When the subject had finished the questionnaire, the 
experiment was completely explained to him. 


RESULTS 


Nine subjects’ data were not included in 
the analysis: 3 had heard about the experi- 
ment from their friends; 3, because of lan- 
guage difficulties, did not understand that 
they were supposed to be delivering shocks; 
2 terminated the experiment before the end, 
having read one of the popularized versions 
of Milgram (1963) and thinking that that 
was what the experiment was all about; 1 was 
color blind and could not perform the first 
part of the experiment. This left 90 subjects 
for analysis (15 per cell). 

From looking at the results of the ratings 
that the subjects made of the confederate, it 
is clear that the annoy manipulation was Sut- 
cessful. Annoyed subjects liked the confed- 

erate significantly less than did the subjects 
who had not been annoyed; they thought that 
it was less likely that they could ever be good 
friends; they rated her as more aggressive, 
colder, less likable, more emotional, less 
friendly, less interesting, more dishonest, and 
stronger. There were no differences in the rat- 
ings attributable to the shock variable. 
"Phe major results of the experiment are 
presented in Table 1. All analyses are reported 
in terms of the number of shocks delivered. 
4 The number and average length of the shocks 
were slightly correlated (average ” for the six 

5 experimental conditions = 39). However, per- 
haps because of some confusion between the 
oe 4 Jenoth” shocks in the “catharsis” ses- 
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TABLE 1 


Mran NUMBER OF SHOCKS DELIVERED ON THE 
“CREATIVE ASSOCIATION” TASK BY 
EXPERIMENTAL CONDITION 


Experi- 


; Subject Shoe 
Condition Ses nea No shock 
TUO a 

Annoy 6.80 7.60 10.67 
No annoy 8.07 9.73 6. 


sion, and the high variance on the length of 
shocks delivered (since the subjects had no 
stated range given to them), the results are 
clearest for the number of shocks delivered. 
The interaction between the annoy variable 
and the catharsis activity was significant (F 
= 4.54, df= 2/84, P< 05), and the two 
main effects were not (Fs <1). The major 
hypothesis is that in the annoy conditions 
having a chance to hurt the annoying person 
should be cathartic. Clearly, this is so: the 
difference between the annoy-subject-shocks 
cell and the annoy-no-shock cell is significant 
(F = 6.03, P< 05). The difference between 
the annoy—no shock and annoy-experimenter 
shocks is in the same direction, but is not 
significant (F = 3.79, p <10). Subjects in 
the no-shock conditions gave significantly 
more shocks when they had been annoyed (F 
= 6.69, p < 05). 

As indicated by the significant interaction, 
the catharsis activity had different effects for 
the subjects who were not annoyed as com- 
pared to the effects on the annoyed subjects. 
Having the confederate get hurt (either by 
the subject himself or by the experimenter) 
when the subject was annoyed reduced the 
number of shocks that the confederate subse- 
quently was given, whereas having the ex- 
perimenter hurt the confederate when the sub- 
ject was not annoyed at him tended to in- 
crease the amount of shock he was given (no 
annoy-no shock versus no annoy—experi- 
menter shocks, F = 3.94, p < 07; no ann y= 
no shock versus no annoy-subject shocks, 75) 


DISCUSSION 


The Doob (1970) experiment showed that 
angered subjects who participated in the 
hurting of their annoyer subsequently hurt 
him less than did annoyed subjects who were 
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3 4 given no chance to hurt their annoyer. No 


such decrease in aggressive behavior occurred 
for subjects who were not annoyed. The pres- 
ent experiment extends these results some- 
what by showing that this decrease will occur 
when either the subject or someone else hurts 
the annoying person. 

It is difficult to interpret these data in terms 
of the subjects’ feeling guilty in the shock 
conditions. Berkowitz (1962) suggested that 
this is a major problem with many experi- 
ments in the area of catharsis of aggression, 
In this experiment, if one is to postulate that 
the decrease in aggression after aggression in 
the annoy conditions is due to guilt, it would 
seem reasonable to postulate the same effect 
for the no-annoy conditions. Obviously, how- 
ever, the results do not support this notion, 
since subjects who were not annoyed tended 
to give more shocks after the confederate had 
been hurt than when he had not been hurt. 

These data are entirely consistent with 
those reported by Bramel et al. (1968) in 
which subjects who had just been annoyed by 
someone rated him less negatively if they had 
just finished watching a videotape of him 
suffering than if they had watched him having 
a neutral or happy experience. In that experi- 
ment, an opposite effect was found for people 
who were not annoyed. In many ways that 
experiment is analogous to the no-shock and 
experimenter-shocks conditions of this experi- 
ment, in that in neither case did the subject 
actually do the hurting. 

At first glance, it would seem that these 
results were exactly opposite from those of 
Kahn (1966), where annoyed subjects were 
either encouraged to express their hostile feel- 
ings toward an obnoxious experimenter or 
were given no such opportunity. Subjects who 
had been induced to say nasty things about 
the experimenter in the “catharsis” session 
once again rated the experimenter on the nega- 
tive ends of the rating scales. Subjects who 
had had no such initial opportunity did not 
rate the experimenter so negatively. Kahn 
(1966) explained these results in terms of 
dissonance theory: 

Once the subject has counteraggressed and gotten 


[the obnoxious experimenter] into trouble, the cog- 
nition of this action would seem dissonant with the 
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thought that the experimenter was not such a bad 
guy and that the subject was not really angry with 
him [p. 285]. 


In other words, the subject in this experiment 
was simply acting in a consistent manner— 
after aggressing once, when given an addi- 
tional opportunity, he aggressed again. The 
experiment reported here minimizes the simi- 
larities between the catharsis activity and the — 
dependent measure. Thus, the subject who 
aggresses in the catharsis session is not under 
the same sort of consistency pressure as is the 
subject in Kahn’s experiment. 

It might be tempting to decribe the results 
of this experiment simply in terms of a suc- 
cessful “test” of the catharsis hypothesis. 
However, as pointed out above, there are so 
many things that might be considered to be 
“tests” of the catharsis hypothesis that it is 
futile to consider any single experiment as a 
test. One alternative way of looking at these 
results is in terms of retaliation that occurs 
in the subject-shocks and experimenter-shocks 
conditions. As Berkowitz (1970) pointed out: 
“We evidently feel better when we see that 
the person who had angered us has been hurt. 
... We do not have to hurt the frustrator our- 
selves to experience this pleasure [p. 6].” 
However, this does not mean that some sort of 
diminution of aggressive impulses has neces- 
sarily occurred. All that it implies in the con- 
text of the present study is that the angered 
person who has hurt his annoyer (or seen 
his annoyer hurt) has “evened the score.” 
Thus, the annoyed person whose annoyer has | 
been hurt will feel less of a need to retaliate 
further than the annoyed person whose an- 
noyer has not been hurt in some way. 

Although these results are different from 
the experiments on observed film aggression 
(cf. Bandura, 1965; Berkowitz, 1965; Hart- 
mann, 1969; Walters & Llewellyn Thomas, 
1963), they do not in any way contradict 
those findings. As Berkowitz (1970) has 
pointed out, the last 10 years of research have 
demonstrated quite thoroughly that observing 
the aggressive behavior of others in situations 
such as movies tends to increase the level of 
aggression rather than decrease it. This tends 
to be the case when the aggressive response is 
the same as that which was observed (e.g. 


Bandura, Ross, & Ross, 1963) or when it is 
different (e.g., Hartmann, 1969). However, in 
the experiments on the effects of film violence, 
the subject (either angry or not) is typically 
exposed to a film where the aggression is un- 
related to the setting in which the subject 
finds himself. In the present experiment, the 
person being hurt (in either the subject-shocks 
or experimenter-shocks conditions) is the 
person with whom the subject interacted in 
the first part of the experiment. It would seem 
that the paradigms are different enough so 
that there would be no good reason to expect 
the experience of aggression to have the same 
_ effect in these two very different situations. 
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The experiment was carried out to determine whether the reduced recall 
usually observed in subjects after receiving ego-threatening information is due 
to repression or interference. Subjects learned a list of words, had their recall 
tested for the list, and then took a personality test in which the words were 
used as stimuli. Subjects were then given either ego-threatening, ego-enhancing, 
or neutral feedback. Following this, they were tested for recall again, de- 
briefed, and tested for recall once more. Subjects receiving threatening and en- 
hancing feedback performed alike and showed poorer recall on the postfeedback 
test than the subjects receiving neutral feedback. Since the subjects in the threat 
and enhancement conditions performed the same in terms of the number and 
nature of the words forgotten and since there was no reason to expect repres- 
sion in the enhancement condition, it was concluded that postfeedback recall 


decrements were due to interference. 


The research strategy that has been most 
influential in offering support for the con- 
cept of repression was developed by Zeller 
(1950a). This approach has three major 
parts. First, experimental and control sub- 
jects learn and have their retention tested for 
a list of words. On this test, no differences 
are expected. Second, the experimental sub- 
jects are “ego threatened” (subjects are 
forced to fail on another task or are given neg- 
ative personality feedback), while control sub- 
jects are not threatened. Then both groups are 
given a second recall test for the words. Since 
it is assumed that the anxiety introduced by 
the threat procedure will generalize to the 
words because they were learned in the same 
situation, and since it is also assumed that 
this anxiety will result in repression, it is 
predicted that on the second recall test the 
performance of the experimental subjects will 
be below that of the control subjects. Third, 
in an attempt to “lift the repression,” the 
threat is eliminated (subjects are allowed to 
succeed or are debriefed), and the subjects 
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are then given a third recall test on which 
it is predicted that there will be no differ- 
ences in recall between the groups. Using 
this approach, numerous investigators have 
found results consistent with the predictions, 
and these results have been cited as empirical 
evidence for the existence of repression 
(Aborn, 1953; Flavell, 1955; Merrill, 1954; 
Penn, 1964; Truax, 1957; Worchell, 1955; 
Zeller, 1950b, 1951). 

Recently, the interpretation of these find- 
ings as evidence for. repression has been- 
called into question by two investigations. 
D’Zurilla (1965) carried out postexperimental 
interviews with subjects who had participated 
in an experiment similar to the one discussed 
above and, contrary to what would be ex- 
pected on the basis of repression, found that 
62% of the experimental subjects thought 
about things related to the threatening task, 
while only 24% of the control subjects, 
thought about things related to their task; 
that is, the experimental subjects attended to 
rather than avoided the threat. D’Zurilla 
went on to point out that the additional 
thoughts of the experimental subjects were 
irrelevant to the task of recalling the words, 
and he, therefore, speculated that the “in- 
crease in amount of conflicting cognitive events 
could have reduced the efficiency of recall 
[p. 256]” in the experimental group. In other 
words, he suggested that the reduced recall 
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following ego threat might be due to response 
competition rather than to repression. 

Holmes and Schallow (1969) attempted to 
experimentally test the question of whether 
reduced recall “following ego threat was due 
to repression or response competition. In 
addition to the groups that received ego- 
threatening (experimental) or no (control) 
feedback, these investigators employed a 
third group which was shown a neutral movie 
(interference). After the experimental ma- 
nipulation (negative feedback, no feedback, 
interfering movie), all of the subjects were 
tested for their retention of some previously 
learned words. They were then debriefed*and 
tested again for retention. Consistent with the 
predictions based on the interference theory, 
the investigators found that while there were 
no differences among the groups prior to the 
experimental manipulation, after the manipu- 
lation, the control group performed signifi- 
cantly better than either the ego-threat or 
interference groups and, of most importance, 
that there were no differences between the 
performance of the ego-threat or interference 
groups. After the debriefing, which redirected 
the subjects’ attention to the words, the recall 
test again indicated no differences between 
any of the groups. In reviewing their results, 
Holmes and Schallow (1969) pointed out 
that: 


If only the curves of the control and ego-threat 
groups are considered, the results of the present 
study can be viewed as a replication of the earlier 
work in which it was concluded that the ego threat 
had resulted in repression. However, the fact that 
in the present study the performance of the inter- 
ference group differed from the control group but 
did not differ from the ego-threat group suggests 
that previous interpretations of differences between 
ego-threat and control groups as being due to repres- 
sion could be called into serious question. . . . While 
it is impossible to prove the null hypothesis that 
the same process was operating in both the ego- 
threat and interference conditions, the inability to 
generate differential predictions which are supported 
by the data is certainly suggestive. It might, there- 
fore, be that response competition rather than repres- 
sion is responsible for the lowered recall performance 
found in the ego-threat condition of experiments of 
this type [pp. 150-151]. 


As the authors indicated, the fact that the 
interference and ego-threat groups performed 
in the same way does not necessarily prove 


that the same process was operating in both Fi 
situations. This criticism is especially rele- 
vant since in their experiment the two hy- 
pothésized interferences were very different in 
origingand nature; that is, in the interference 
condition, the interference stemmed entirely 
from an extrinsic source (the movie) and 
was impersonal, while the hypothesized inter- 
ference in the ego-threat condition was in- 
trinsic to the individual (concerns about 
adjustment) and highly personal. These are 
important differences, and they increase the ¥ 
conceptual distance between the conditions 
that the authors wanted’ to suggest as equiva- 
lent. If the interference that was hypothe- 
sized to be present in the two conditions had 
been more similar in source and nature, the 
authors would have been in a better position 
to suggest that repression was actually due 
to interference and response competition. Om 

The present experiment was carried out in 
an attempt to overcome the problems noted 
above. Similar to the earlier experiment, the 
subjects in the control condition were given 
neutral feedback, while the subjects in one 
experimental condition were given negative 
“ego-threatening” personality feedback. In 
contrast to the previous work, however, in 
the second experimental group, an attempt 
was made to introduce interference by 
giving the subjects positive “ego-enhancing” 
personality feedback. 

In this experiment the only difference be- 
tween the ego-threatening and interference 
conditions was the fact that in one case the 
feedback was ego threatening, while in the 
other it was ego enhancing. This difference is, 
of course, crucial for the different predictions 
stemming from the interference and the re- 
pression theories. Interference theory ignores 
the threat versus enhancing nature of the 
feedback and suggests that the same type of 
interference (thoughts about the feedback) 
would be elicited and, therefore, would affect 
recall in both experimental conditions, Conse- 
quently, this position would not predict a 
difference in recall performance between the 
two experimental groups. Furthermore, since 
it was expected that less interference would 
be elicited in the control condition than in 
either of the experimental conditions, inter- 
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% ference theory would predict that after the 
experimental manipulation, the recall per- 
formance of the control subjects would be 
superior to that of the experimental subjects. 
On the other hand, repression theory empha- 
sizes the influence of ego threat and predicts 
a reduced recall only for those subjects ex- 
posed to an ego threat. That is, the repression 
: position would predict that the performances 
of the subjects in the control and ego- 
enhancement conditions would be similar and 
that the performance of the subjects in these 
conditions would be superior to that of sub- 
jects in the ego-threat condition. These are 
the differential predictions that were under 
investigation in the present experiment. 

Interestingly enough, with regard to post- 
debriefing performance, both theories predict 
an improvement. In the case of interference 
theory, the improvement would stem from the 
fact that after being debriefed, the subjects 
would no longer be distracted by the thoughts 
and concerns elicited by the feedbacks, while 
according to repression theory the improve- 
ment would stem from a return of the re- 
pressed made possible by the elimination of 
the anxiety. 


METHOD 
Subjects 


Subjects in this experiment were 36 women from 
introductory psychology classes at the University of 
Texas at Austin. Subjects were run in groups of 2, 
and each subject was assigned to one of three 
conditions. During the experiment, each subject was 
seated in a chair facing a projection screen and was 
separated by a partition from the other subject. 
Attached to the arm of the subjects chair was a 
“response panel” which contained one green signal 
light and four switches labeled A, B, C, and D. The 
experimenter sat behind the subjects at a screened- 
off control center where he controlled the signal 
f" light and recorded which switches were thrown by 


the subftcts. 


Procedure 


When the subjects arrived at the laboratory, they 
were Msigned a seat and then told that the experi- 
ment would be run completely by a computer located 
in an adjoining room. The experimenter then threw 
a switch and took his place at the control center. 
Throwing the switch activated a concealed tape 
recorder which was supposedly running in conjunc- 
tion with the computer and which was used to 
present all of the instructions, The subjects were 
first presented with a list of 40 words that was 
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read twice. It was pointed out that these words 
would be used in a later part of the experiment 
and that at this point the subjects should “just 
listen to the words and become familiar with them.” 
The words that were presented were the same ones 
used earlier by Holmes and Schallow (1969, p. 147, 
Table 1). After the words had been read, the sub- 
jects were asked to write down as many of the 
words as they could remember on a word familiarity 
check form, Supposedly, this was done as a check 
on their familiarity with the words, but actually it 
provided the premanipulation measure of recall for 
the words. After 2 minutes, the forms were col- 
lected, and the tape-recorded instructions went on 
to explain that the experiment in which the subjects 
were participating was a test “of a very sensitive 
and accurate test of personality . . . which can be 
used to predict future adjustment and performance 
in college women” and which was being adapted 
for use at the university. It was explained to the 
subjects that “the famous Rorschach Psychodiag- 
nostic Inkblots” would be projected on the screen 
one by one, and that while each blot was on the 
screen, four nouns would be read aloud to them. The 
task of each subject was to decide which of the four 
nouns best described the blot she was looking at and 
to indicate her choice by throwing the appropriate 
switch on her response panel. They were led to 
believe that this would register their responses in 
the computer, which would then score the responses 
and give each subject»feedback concerning each of 
her responses. Feedback was provided via the light 
on the response panel which flashed on and off one, 
two, or three times after the subject made each 
response, Each subject was told to count the flashes 
and record the number on the score sheet she had 
been given. The score sheets supposedly were going 
to be collected at the end of the session so that 
the members of the Psychology Department could 
evaluate the subjects’ performance and determine 
whether the subjects “should receive future attention 
of any kind.” Between recording their scores and 
being presented with the next inkblot, the subjects 
were told to use the number of flashes they had 
received as feedback after their preceding response 
to look up the meaning of that response in the 
computer code books that they had been given. 
Each computer code book contained three response 
interpretations for each blot; one ego enhancing, 
one ego threatening, and one neutral. Each interpre- 
tation was printed on a separate page, and the pages 
were official-looking computer printouts. Each 
interpretation page had a tab on it identifying the 
blot and the number of flashes with which that 
interpretation was associated, Three different books 
were constructed so that the same number of flashes 
would result in three different feedbacks, depending 
on which book was being used. Consequently, in 
any one experimental session, the subjects in dif- 
ferent conditions could be run despite the fact that 
the apparatus gave all subjects the same number of 
flashes for feedback. Subjects in the threat and 
enhancement conditions received negative or positive 


feedback after responding to Blots II, III, V, VI, 
VIII, and X. For example, after Blot VI, in the 
enhancement group, the subjects were informed that 


This response is indicative of outstanding leader- 
ship abilities. The leadership ability should enable 
this woman to assume positions of responsibility 
in large groups as well as in interpersonal rela- 
tionships, and she should meet with considerable 
success. 


After Blot X, they were informed that 


This response indicates that the individual is ex- 
ceptionally well adjusted. This level of adjustment 
will enable her to function at a personal and aca- 
demic level well above that of the average college 
student, 


On the other hand, after Blot VI, the subjects in 
the threat group were informed that 


This response indicates the possibility that the 
individual has some underlying personality prob- 
lems. Either at the present time or in the near 
future this underlying maladjustment could cause 
serious difficulties for this person in her inter- 
personal relations with friends and superiors. 


After Blot X, they were informed that “This re- 
sponse is suggestive of a high degree of pathology, 
and corrective steps must be considered.” After 
responding to these blots, the subjects in the neutral 
condition received feedback indicating that their 
response had no particular significance. Feedback to 
all subjects after responding to Blots IV, VII, and IX 
indicated that “This response is consistent with pre- 
vious responses thus confirming previous feedback. 
This response adds no new information.” Subjects 
were not given feedback after responding to the first 
blot since the computer supposedly did not yet have 
enough information. 

It is important to note that the words offered as 
response possibilities for the Rorschach inkblots were 
the same as those presented on the list at the begin- 
ning of the experiment and for which recall was 
tested. In this way, threat was directly associated 
with the material that was to be recalled. 

After the subjects had looked up the interpretation 
of their Jast response, the score sheets and the 
computer code books were collected, and then the 
subjects were given the second 2-minute recall test 
using a word familiarity check form. This provided 
the measure of the subjects’ postmanipulation recall 
for the words. After this recall test, the tape- 
recorded voice of the experimenter completely de- 
briefed the subjects concerning the experimental ma- 
nipulations. The subjects were then given a final 
2-minute recall test using a word familiarity check 
form, and this provided the postdebriefing recall 
measure. 

In summary, the procedure was as follows: learn- 
ing task for 40 words; pretreatment recall test for 
words (Test 1); Rorschach administration (previ- 
ously learned 40 words as response alternatives) 
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S 
with subjects getting either ego-threatening, ego- 
enhancing, or neutral feedback; posttreatment recall ” 
test for words (Test 2); debriefing; postdebriefing 
recall test for words (Test 3). It should be noted 
that the procedure was almost completely automated 
and that the subjects were simultaneously run in 
different conditions, therefore reducing or eliminating 
any subtle experimenter effects. The high degree of 
automation in addition to the large amount of bogus 4 
technical information that the subjects were given 
concerning the test, its scoring, and the built-in 
safeguards to assure the reliability and validity of 
the feedback served to make the deceptions very 
believable. We 


‘RESULTS 


The manipulations used in this experiment 
were evidently quite credible since only one , 
subject (enhancement condition) indicated 
that she did not believe the instructions. The 
data from this subject and those of one other 
subject in the enhancement condition who did 
not follow the instructions were not used in 
the analyses. The recall data are presented 
graphically in Figure 1. 
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Fic. 1. Mean changes in words recalled by the 
subjects in the ego-threatening, ego-enhancing, and 
neutral feedback conditions. (Since change scores 
were plotted in this figure, all scores at Point 1 are 
actually 0 and were included only as a reference 
point.) 
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4 Effect of Differential Feedback on Recall 


A between-within analysis of variance com- 
paring the recall performance of the three 
groups (neutral, enhancing, threatening) over 
the three trials (Recall Tests 1, 2, and 3) was 
carried out on the data. This analysis indi- 
cated that the groups effect was not signifi- 
cant. (F = .97, df= 2/31), that the trials 
effect was significant (F = 37.94, df = 2/62, 
É < .001), and that the Groups x Trials in- 
teraction approached significance (F = 2.20, 
dj = 4/62, p = .079). The important question 
to be answered, however, was which of the two 
rival hypotheses (i.e., repression or interfer- 
ence) was best able to account for the pattern- 
ing of results. The repression hypothesis pre- 
dicted that on the second trial the neutral and 
the enhancement groups would perform alike 
and better than the threat group. On the other 
hand, the interference hypothesis predicted 
that on the second trial the neutral group 
would perform better than the threat and 
enhancement groups and that these latter two 
groups would perform alike. Neither the re- 
í pression nor interference hypotheses predicted 
differences between any of the groups on 
Trials 1 (before manipulation) or 3 (after 
debriefing). In other words, the hypotheses 
differed concerning whether on Trial 2 the 
enhancement group would perform like the 
neutral group (repression hypothesis) or 
whether it would perform like the threat 
group (interference hypothesis). The follow- 
ing procedure was used to statistically evalu- 
-ate the rival hypotheses. The sum of squares 
for the Groups X Trials interaction (with 4 
degrees of freedom) was partitioned into two 
components. One component (with 1 degree 
of freedom) reflected the extent to which the 
obtained means were congruent with what 
was expected from the particular hypothesis 
in question, namely, interference or repres- 
sion, The other component (with 3 degrees of 
freedom) was the remainder sum of squares 
for the interaction not accounted for by the 
predicted relationships (after Winer, 1962, p. 
65). The results indicated that the component 
of the Groups X Trials interaction predicted 
by the interference hypothesis was significant 
at the .01 level of confidence (F = 7.27, df 
= 1/62), while the F for the remainder was 
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less than one and not significant (F = 51, df 
= 3/62). On the other hand, the component 
of the Groups X Trials interaction predicted 
by the repression hypothesis was significant 
at only the .05 level of confidence (F = 4.03, 
df = 1/62), while the F for the remainder 
was greater than one (F = 1.59, df = 3/62, p 
< .25). To decide how well data can be ex- 
plained by a particular hypothesis, one should 
know how much of the data can be accounted 
for (the predicted component) as well as how 
much of the data cannot be accounted for 
(the remainder component). On both counts, 
the interference hypothesis was superior.® 

That the performance of the groups con- 
formed to what was expected by the interfer- 
ence hypothesis is further corroborated by the 
following subanalyses of the data, Between 
Recall Trials 1 and 2, the neutral group 
showed greater improvement in recall than 
either the enhancement or threat groups (F 

2.79, df = 1/20, p= 107; F = 18.84, dj 
= 1/22, p< .001). Looked at in another 
way, the subjects in the neutral group im- 
proved in performance (F = 66.00, df = WWE 
11, p < .001), while those in the enhancement 
group remained the same, and those in the 
threat group deteriorated somewhat (f= 
1.73, df = 1/11, p = .213). The fact that the 
subjects in the enhancement and threat groups 
performed in essentially the same manner and 
differed from the subjects in the neutral group 
could lead to the conclusion that feedback, 
irrespective of whether it was very positive or 
very negative, had the effect of interfering 
with recall performance, and that it was this 
interference rather than repression that re- 
sulted in the reduced level of recall. 

One alternative hypothesis must be exam- 
ined, however. It may be that the reduced 
recall on Trial 2 evidenced by the subjects in 
the threat group was due to repression, while 
the reduced recall on Trial 2 evidenced by 
the subjects in the enhancement group was 
~ 81t should be noted that these two sets of com- 
parisons are not orthogonal and that the only dif- 
ference beteween the two components revolves 
around one mean. Therefore, in view of that and 
the fact that the component of the interaction pre- 
dicted by interference was significant at the .01 level, 
it is not surprising that the component of the inter- 


action predicted by repression was significant at the 
05 level. 
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due to interference. That is, while the result- 
ing overall performances were the same, the 
underlying processes may have been different. 
If repression was the underlying process, it 
would be expected that the reduced recall 
in the threat group resulted from a large 
decrement in recall of response words used by 
the subjects (i.e., those that the subjects se- 
lected as responses) because these were the 
words directly associated with threatening 
feedback. It would also be expected that 
there would be only a minimal decrement in 
the recall of the words that were not used as 
responses (i.e., those that were not selected) 
because these were not associated with threat- 
ening feedback. On the other hand, if inter- 
ference were the underlying process, it would 
be expected that the reduced recall in the 
threat group resulted from a recall decrement 
distributed across all of the words (i.e., those 
that were and were not selected as responses 
and, consequently, were and were not associ- 
ated with threatening feedback) because in- 
terference would influence all words equally. 
To test these alternative predictions, the pro- 
portions of response words recalled were com- 
pared across the three trials for the three 
groups. (Each subject’s score was the number 
of her response words recalled on a trial di- 
vided by the total number of words that she 
recalled on that trial.) Data from the neutral 
and enhancement groups were used for com- 
parison since it is clear that repression was 
not occurring in these groups. The mean pro- 
portions are presented in Table 1. Inspection 
and analysis of these data did not suggest that 
on Test 2 the subjects in the threat group 
recalled proportionally fewer response words 
than did the subjects in the other groups (F 
= .22, df = 2/31). Furthermore, a between- 
within analysis of variance comparing the 
three groups across the three tests did not 
reveal the interaction predicted on the basis 
of repression (F = .25, df = 4/62); that is, 
the subjects in the threat group did not evi- 
dence a relative reduction in their recall of 
response words on Trial 2. With regard to 
this latter analysis, it might be noted that 
there was not a significant groups effect (F = 
92, df = 2/31), but there was a significant 
trials effect (F = 16.53, df = 2/62, p < .001) 
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TABLE 1 
MEAN Proportions OF RESPONSE WORDS RECALLED ` 
Group Test 1 Test 2 Test 3 
Neutral 24 32 34 
Enhancement 25 35 33 
Threat 23 35 34 


which was due to the fact that in each case 
after words were used as responses (i.e., be- 
tween Trials 1 and 2), the degree to which y 
they were recalled improved relative to non- 
response words—a finding that would be ex- 
pected on the basis of the increased attention 
that they had received as a function of being 
used as responses. In summary of these analy- 
ses, it is clear that no evidence was found for 
the predictions based on the theory of repres- 
sion, and it seems reasonable to conclude that 
the reduced recall in both the enhancement 
and threat groups was due to interference 
caused by thoughts elicited by the feedback. 


Effect of Debriefing on Recall 


A between—within analysis of variance com- ` 
paring the performance of the three groups 
over the second and third recall trials revealed 
a strong trend in the direction of the pre- 
dicted interaction (F = 2.21, df = 2/31, p= 
.125). Nonorthogonal analyses that were car- 
ried out to describe this interaction indicated 
that the subjects in the enhancement and 
threat groups evidenced about the same 
amount of improvement between the second 
and third trials (F=.12, df=1/20, p=-, 
.725), while both of these groups tended to 
show greater improvements in performance 
than did the neutral feedback group (F = 
2.39, df=1/20, p=.135; F=3.58, dj = 
1/22, p = .069). Since the enhancement group + 
evidenced the same amount of improvement 
as the threat group, and since there was no 
evidence that repression was occurring in the 
threat group on Trial 2, it does not seem justi- 
fied to conclude that this improvement in 
performance was due to a “return of the re- 
pressed,” as it is usually interpreted. Rather 
it seems that the general improvement in sacks 
of the three groups between Trials 2 and 3 
was due to greater attention to the recall task 
after the debriefing which eliminated 


5 ah ouehis elicited by the previous manipula- 
tion and made the task more salient, The 
greater improvement in the threat and en- 
hancement groups as compared to the neutral 
group occurred because the subjects in these 
groups were initially (Trial 2) more distracted 
and had lower levels of performance, and, 
therefore, the redirection of their attention 
had more impact both on their attention and 
on their subsequent performance. These find- 
ings are consistent with the interference ex- 
“planation and inconsistent with the return-of- 
the-repressed explanation. 


Discussion 


From the preceding results it seems clear 
that the presentation of salient feedback con- 
E cerning the subject’s personality interfered 

with recall performance on what seemed to 
_ the subject to be an unrelated task. Of most 

importance, however, is the fact that the effect 
was found regardless of whether the feedback 
was ego enhancing or ego threatening. The 
fact that ego-enhancing feedback had essen- 
«tially the same effect as ego-threatening feed- 
back raises serious questions concerning the 
interpretation used by many authors that the 
reduced recall after threatening feedback is 
due to repression. Similarly, it seems doubt- 
ful that the postdebriefing improvement in re- 
call performance that has been noted by many 
investigators can be attributed to a return of 
- the repressed. The conclusion that postdebrief- 

ing improvement cannot be attributed to a 

return of the repressed stems from the indica- 
“tion of the present experiment that repression 

was not occurring in the first place and from 

the finding that the same effect was noted in 

the ego-enhancing group in which there was 
o reason to expect repression, much less a 
return of the repressed. 

The present results suggest that the recall 

decrements found in this experiment and in 

others of this type were due to problems with 
© retrieval from long-term memory and that 
these problems were caused by attentional 
factors. More specifically, because the sub- 
jects were able to overcome the earlier recall 
deficit when their attention was focused on 
the critical material (i.e. after the debrief- 
ing), it must be concluded that the earlier 
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recall deficit was due to poor retrieval of 
learned material stored in long-term memory. 
The poor retrieval apparently resulted from 
the subjects’ attention to other (interfering) 
material. Interestingly enough, the attention- 
interference explanation is consistent with the 
results of Aborn (1953), who reported that 
when subjects were given a set to learn rather 
than incidental learning instructions, the re- 
call decrement usually attributed to repression 
did not materialize. That is, when the sub- 
jects’ set focused their attention on the criti- 
cal material, there was no decrease in the 
recall, despite the presence of threat. 

It should be noted that the present research 
is not the first instance in which the interfer- 
ence hypothesis has provided the most viable 
explanation for a performance deficit associ- 
ated with personality factors. In their exten- 
sive reviews of the research on the perform- 
ance deficit usually associated with schizo- 
phrenia, Buss and Lang (1965) and Lang and 
Buss (1965) concluded that interference was 
probably responsible for this deficit. It should 
be noted that these authors also pointed out 
that when the schizophrenics’ attention and 
sets were properly focused and maintained, 
the deficit was eliminated. This effect is not 
dissimilar from the effect found in the present 
experiment when the subjects were debriefed 
and, as a result, had their attentions redi- 
rected. What is important here is not that 
interference impairs performance, because 
that effect has been known for a long time; 
what is important is the very strong possibil- 
ity that the recall deficit that was previously 
thought to stem from repression is, in fact, a 
function of interference. This interpretation 
of the effects may find applicability beyond 
the research strategy considered in this paper. 

It might be suggested that polarizing the 
question as “repression or interference” is in- 
appropriate and that it would be more con- 
structive to conceive of interference as the 


process by which repression occurs. With 


regard to this, however, it should be pointed 
out that the concept of repression already im- 
plies a process, and that process is very dif- 
ferent from the interference process. Most 
important in this regard, however, is the fact 
that the unconscious plays a crucial role in the 


process traditionally associated with the proc- 
ess of repression, while it plays no role what- 
soever in the interference process discussed 
here. This difference would result in widely 
divergent prescriptions for dealing with ma- 
terial that has been “repressed.” In view of 
the theoretical confusions and conflicting im- 
plications that would result from the synthesis 
of these concepts, it seems desirable to main- 
tain a clear distinction between them. 

In reviewing the threat and enhancing 
feedback conditions employed in this experi- 
ment, it is important to note that in both cases 
the feedbacks were highly personal and that 
in both cases the interference that was hy- 
pothesized to result from the feedbacks 
(thoughts about the feedbacks) was produced 
by the subjects themselves. By using condi- 
tions that were so similar, the conceptual dis- 
tance between the conditions was reduced 
over what it was in the previous work 
(Holmes & Schallow, 1969), thus making it 
easier to suggest that the same process was 
operating in both conditions when the groups 
in these conditions evidenced similar per- 
formances. Despite this, however, it still must 
be pointed out that it is impossible to prove 
the null hypothesis, and, therefore, the fact 
that the threat and enhancement groups per- 
formed the same does not necessarily prove 
that repression as it is traditionally conceived 
of was not occurring in the threat group. On 
the other hand, of course, neither can it be 
proved that interference was not taking place 
in the threat group. In considering the alter- 
natives, it seems to the present author that 
parsimony as well as the weight of experi- 
mental evidence is in favor of the interference 
explanation. Independent of what actually 
was or was not occurring, and apart from 
theoretical speculations generated by this re- 
search, it must definitely be concluded that 
because of the alternative explanations for 
the data, Zeller’s (1950a) paradigm cannot 
be used to provide evidence for the concept 
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of repression. This leaves the concept of re- ° 
pression in a precarious position with respect” 
to the necessary experimental verification. 
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THE HOW AND WHAT OF WHY: 


SOME DETERMINANTS AND CONSEQUENCES OF 
CAUSAL ATTRIBUTION * ‘ 
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Brandeis University 


Kelley’s attribution theory is investigated. Subjects filled out a questionnaire 
that reported 16 different responses ostensibly made by other people. These 
responses represented four verb categories—emotions, accomplishments, opinions, 
and actions—and, for experimental subjects, each was accompanied by high or 
low consensus information, high or low distinctiveness information, and high 
or low consistency information. Control subjects were not given any informa- 
tion regarding the response. All subjects were asked to attribute each response 
to characteristics of the person (ie. the actor), the stimulus, the circum- 
stances, or to some combination of these three factors. In addition, the sub- 
jects’ expectancies for future response and stimulus generalization on the part 
of the actor were measured. The three information variables and verb category 
each had a significant effect on causal attribution and on expectancy for 


behavioral generalization. 


Identifying the cause of an event gives it 
“meaning,” since causal knowledge carries 
with it a wide scope of connotations regarding 
an event and makes possible a more or less 
stable, predictable, and controllable world. 
What an individual expects from his environ- 
ment and what he does to influence it will 
undoubtedly depend on his beliefs about 
causality. Hence, the psychologist who 
searches for the causes of man’s behavior 
may do well to consider man’s belief about 
causality. This is the focus of attribution 
theory, which represents an attempt to de- 
termine the antecedents and consequences of 
some of man’s causal beliefs. More specifically, 
it concerns the processes through which an 
individual assigns causes to various responses 
he makes or observes and the consequences of 
his resulting beliefs about causality. While 


1 This article is based on a dissertation submitted 
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committee, and Wendell Garner and David Hamil- 
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McArthur, Department of Psychology, Brandeis Uni- 
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there is no monolithic theory of attribution, 
several of the current conceptualizations have 
their roots in perceptual phenomena. 
Brunswik (1952) conceived of the percep- 
tual process as an arc encompassing two end 
points—the entity represented or the distal 
stimulus and the local representation or the 
proximal stimulus. While proximal stimuli 
serve as cues to the underlying distal stimuli, 
Brunswik noted that there is no one-to-one 
connection between them, and he proposed 
that the organism must venture hypotheses as 
to what type of entity has most probably 
caused the given cues. Heider’s (1958) at- 
tribution theory derives directly from Bruns- 
wik’s treatment of perception. Just as the 
perceiving organism must integrate the highly 
variable cues given in proximal stimulation in 
order to “infer” the relatively unchanging 
object that gave rise to them, the attributing 
organism, according to Heider, must integrate 
the cues given in responses in order to infer 
the more stable factors that gave rise to them. 
Kelley’s (1967) conceptualization, which is 
derived directly from Heider, details the cog- 
nitive processes that might be engaged in by 
the individual during this “inference.” He 
proposed that the individual interprets a 
given response in the context of the informa- 
tion gleaned from experimentlike variations of 
conditions. The conditions varied are (a) 
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entities è from which distinctiveness informa- 
tion is obtained (i.e., whether or not the re- 
sponse occurs when other entities are pres- 
ent); (b) persons from whom consensus in- 
formation is obtained (i.e., whether or not the 
same response is produced by other people in 
the presence of the entity); and (c) time/ 
modalities from which consistency informa- 
tion is obtained (i.e., whether or not the re- 
sponse occurs whenever the entity is presented 
and in whatever way it is presented). 

Kelley presented these three sources of in- 
formation in an analysis of variance model 
for which entities, persons, and time/modali- 
ties are orthogonal dimensions. For the simple 
case in which information can take on one of 
two values (i.e., high or low distinctiveness, 
high or low consensus, high or low consis- 
tency), eight information configurations can 
be generated from this model. Only a few of 
these are discussed by Kelley (1967), whose 
major prediction is that “external attribution 
is made when evidence exists as to the (high) 
distinctiveness, consistency, and consensus of 
the appropriate effects [p. 196].” 

The primary goal of the present study was 
to investigate the following aspects of Kelley’s 
model: (a) What causal attributions are fa- 
cilitated by various combinations of consensus, 
distinctiveness, and consistency information? 
(b) Do each of these sources of information 
independently affect causal attributions? (c) 
How do these sources of information interact 
with one another? (d) What is the relative 
importance of these variables? 

In an attempt to answer these questions, 
consensus, distinctiveness, and consistency in- 
formation were manipulated orthogonally: the 
subjects were told of the occurrence of a re- 
sponse by another person; they were given 
one of the eight possible combinations of con- 
sensus, distinctiveness, and consistency in- 
formation regarding that response; and they 
were asked to indicate whether they thought 
something about the person, something about 
the stimulus entity, something about the cir- 
cumstances, or some combination of these 
three factors probably caused the response to 
occur. To provide a satisfactory means of 
answering Question a above, a group of con- 
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trol subjects was given no information about 


the responses to be attributed. Comparisons % 


between the pattern of attributions to various 
causes obtained for these no-information con- 
trols and the patterns obtained for experi- 
mental subjects under each of the eight infor- 
mation combinations would reveal what ef- 
fect, if any, each configuration was having on 
causal attributions. Although it was expected 
that the information variables would have 
many effects on attribution, only the follow- 
ing predictions could be explicitly derived 
from Kelley’s theory: 


1. Stimulus attribution will be more fre- 
quent when a response is characterized 
by high consensus, high distinctiveness, 
and high consistency than when no in- 
formation is given regarding that re- 
sponse. 

2. Person attribution will be more frequent 
when a response is characterized by low 
consensus, low distinctiveness, and high 
consistency than when no information is 
given regarding that response, 

sa 


From these predictions regardihg the com- ; 


bined effects of consensus, distinctiveness, and 
consistency information, the following predic- 
tions regarding their independent effects were 
derived: 


3. Stimulus attribution will be more fre- 
quent when there is high consensus, high 
distinctiveness, or high consistency than 
when there is low consensus, low dis- 
tinctiveness, 07 low consistency. 

4. Person attribution will be more fre- 
quent when there is low consensus, low 
distinctiveness, or high consistency than 


when there is high consensus, high dis- 


tinctiveness, 0” low consistency. 


5. Circumstance attribution will be more. 


frequent when there is low consistency 
than when there is high consistency. 


Differences in causal attribution as a func- 
tion of the nature of the response were also 
investigated in this study. Each subject made 
causal attributions for 16 different responses 
representing four verb categories—emotions, 
accomplishments, opinions, and actions. Al- 
though no clear predictions can be derived 
from existing theory, differences in attribu- 
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tion as a function of the nature of the re- 
sponse would certainly not be surprising. For 
example, Heider (1958) suggested that the 
attribution of pleasure tends to be leveled at 
the environment (p. 156); and Jones, Rock, 
Shaver, Goethals, and Ward (1968) found 
that performance tended to be attributed to 
the person’s ability rather than to the diffi- 
culty of the task. Hence, one might expect 
stimulus attribution to predominate for items 
reporting a person’s pleasurable emotions (per- 
haps for unpleasurable emotions as well), 
while person attribution might be predomi- 
nant for items reporting a person’s accomp- 
lishments. 

In addition to investigating the antecedents 
of causal attributions, an attempt was made 
in the present study to determine some of the 
consequences (or concomitants) of different 
attributions. Heider (1958) suggested that an 
attribution may affect an individual’s expecta- 
tions of events given certain conditions, and 
this is borne out in several empirical studies 
(James & Rotter, 1958; Phares, 1957; Rot- 
ter, Liverant, & Crowne, 1961). While these 
studies concerned expectancies for future rein- 
forcement as a function of attributing prior 
reinforcement to one or another cause, it is 
certainly reasonable to assume that expectan- 
cies for future behavior would similarly vary 
with the cause to which prior behavior is at- 
tributed. Taking three helpings of dessert, if 
attributed to a person’s “gourmandism” 
would certainly produce an expectancy re- 
garding that person’s consumption of other 
foods—at least to a greater extent than would 
attribution of the same behavior to the suc- 
culence of the dessert. This latter attribution 
might, on the other hand, produce an expec- 
tancy for the person to manifest other behav- 
iors toward that particular dessert—such as 
asking for the recipe. Let us call an expec- 
tancy regarding the person’s consumption of 
other foods an expectancy for stimulus gen- 
erlization inasmuch as it represents an expec- 
tancy that the person will make the same re- 
sponse (consumption) to other stimuli (other 
foods). Similarly, let us call an expectancy 
regarding the person’s other behaviors toward 
that particular dessert an expectancy for re- 
sponse generalization inasmuch as it repre- 
sents an expectancy that the person will make 
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other responses to the same stimulus. Bearing 
these definitions in mind, the following two 
predictions seem quite reasonable: 


1. Those variables that produce stimulus 
attribution will also produce a greater 
expectancy for the actor to show re- 
sponse generalization than for him to 
show stimulus generalization. 

2. Those variables that produce person 
attribution will also produce a greater 
expectancy for the actor to show stimu- 
lus generalization than for him to show 
response generalization. 


Since there are many reasonable methods 
for investigating the various aspects of causal 
attribution that have been detailed above, a 
word about the questionnaire methodology 
that was chosen is in order. This method in- 
volved presenting the subjects with a written 
description of a number of responses ostensi- 
bly made by other people. For control sub- 
jects, no additional information was given; for 
experimental subjects some combination of 
consensus, distinctiveness, and consistency in- 
formation regarding each response was pre- 
sented. All subjects were asked to indicate 
what they thought probably caused the re- 
sponse to occur—something about the person, 
the stimulus, the circumstances, or any com- 
bination thereof, 

This method is limited in at least two not- 
able respects: (a) It will not reveal whether 
individuals who are attempting to find an 
appropriate attribution will seek out or utilize 
consensus, distinctiveness, or consistency in- 
formation of their own accord; and (b) the 
results may be generalized only to causal 
attributions made for another person’s behay- 
ior; they may or may not apply to causal 
attributions for one’s own behavior. With re- 
spect to this second limitation, it should be 
noted that Kelley indicated that his model 
should apply to attributions made for another 
person’s behavior as well as for one’s own— 
and the former is certainly worthy of investi- 
gation in its own right. With respect to the 
first limitation, I can only say that once it has 
been demonstrated that individuals can use 
these information variables systematically, it 
will certainly be important to investigate 


in: 
whether or when they will spontaneously use 
the information. 

The questionnaire methodology does have 
certain advantages as well. A much broader 
scope of investigation is possible, since each 
subject can be asked to make attributions for 
a number of behaviors. The presentation of 
information is also facilitated by the question- 
naire format: the time span necessary to “act 
out” consensus, distinctiveness, and consis- 
tency information would preclude investiga- 
tion of all possible combinations of these 
variables in a single study. It should be 
pointed out that the questionnaire method 
does not completely sacrifice realism to 
breadth of inquiry. Although both the event 
description and the information presentation 
are symbolic and brief rather than live and 
lengthy, this state of affairs is not uncommon 
in everyday life. Someone will often tell you 
what so and so did, and he will feel quite 
justified in asking you why you think he did 
it. And, you would probably be quite willing 
to give your “causal opinion” with even the 
scantiest information. It seems then that the 
methodology necessary to study causal attri- 
butions need not create a particular psycho- 
logical state in the subject. It need only tap 
into a readily available psychological process, 
and the questionnaire method seems adequate 
for this purpose. 


MeEtHop 
Subjects 


Ninety-five Yale male undergraduates volunteered 
to. participate in the present study as partial fulfill- 
ment of an introductory psychology course require- 
ment. Sixty-four subjects completed one of eight 
experimental questionnaires, and 23 subjects com- 
pleted the control questionnaire. Eight additional 
subjects (5 experimental and 3 control) were not 
included in the statistical analyses due to incomplete 
data from them. 


Questionnaires 


All subjects completed a 16-item questionnaire. 
Each item reported the occurrence of some response 
by another person (eg., John laughs at the co- 
median). In addition to this, the 64 experimental 
subjects were presented with three statements repre- 
senting one of eight possible combinations of con- 
sensus, distinctiveness, and consistency information 
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regarding that response. Consensus information took *s, 
the form: = 
a. Almost everyone who (hears the comedian 
laughs at him)—high. 
or b. Hardly anyone who (hears the comedian 
laughs at him)—low. 
Distinctiveness information took the form: 
a. (John) does not (laugh at) almost any other 
(comedian)—high. 
or b, (John) also (laughs at) almost every other ™ 
(comedian)—low. 
Consistency information took the form: 
a. In the past (John) has almost always 
(laughed at the same comedian)—high. 
or b. In the past (John) has almost 
(laughed at the same comedian)—low. 
Four verb categories—emotions, accomplishments, 
opinions, and actions—were represented by four 
items apiece. Some typical items were: “Sue is afraid 
of thejfos” (emotion) ; “George translates the sen- 
tence incorrectly” (accomplishment) ; “Bill thinks his 
teacher is unfair” (opinion); and “Jack contributes 
a large sum of money to an automobile-safety fund.” 
(action).* Each of the 16 items was paired with each 
of the eight possible information combinations in the 
following manner: Items 1-8 were paired with each A 
of the information combinations to form an 8X 8 
Latin square design; this design was duplicated hy 
yoking Items 9-16 to Items 1-8. Hence, within any 
given questionnaire (ie., within any column 6f the 
duplicated 8 X 8 Latin square), Item 9 was paired s< 
with the same information combination as Item 1, 
Item 10, with the same information as Item 2, and 
so on. For each of the eight resultant questionnaire 
forms, five random orders of items were constructed. 
Each form was completed by eight subjects. 
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Instructions to Subjects 


The following written instructions for completing 
the questionnaire were presented to experimental sub- 
jects along with a sample item: 


This questionnaire contains a number of state- 
ments which report the occurrence of some event. 
Following each statement you will find three items 
of information, all of which apply to the event 
reported, Your task is to decide, on the basis of 
the information given, what probably caused the 
event to occur. You will be asked to choose among:) 
four alternative causes and to indicate your choice 
by circling the letter next to the cause which you 
think is most probable. 

Having clarified what you feel to be the cause 
of some event, you will be asked to estimate the 
probability of occurrence of related events and to 


4A complete presentation of the questionnaire 
items as well as the response and stimulus generali- 
zation questions, analyses of variance, and means 
and proportions are tabled in Appendices A, B, and 
C (pp. 104-128) of the PhD dissertation on which 
this article is based (McArthur, 1970), 
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indicate your estimate by placing an “X” between 
the vertical lines on the scale provided. 


Instructions to control subjects omitted any reference 
to “information,” but were otherwise identical to 
the above instructions for experimental subjects. 


Dependent Measures 


The dependent measures consisted of the cause 
that the subjects assigned to each response event and 
the probability that they assigned to the occurrence 
of related events. The four alternative causes that 
the subjects were asked to choose among took the 
following form: ` 

a, Something about the person (e.g, John) prob- 
ably caused him to make Response X (eg. 
laugh) to Stimulus X (eg., the comedian). 

b. Something about Stimulus X probably caused 
the person to make Response X to it. 

c. Something about the particular circumstances 
probably caused the person to make Response X 
to Stimulus X. 

d. Some combination of a, b, and c above probably 
caused the person to make Response X to 
Stimulus X. 

Subjects were asked when they chose “d”. to 
specify the particular combination of factors that 
they thought caused the event (i.e, a and b, a and c, 
b and c, or a, b, and c). The frequency of attribu- 
tion to the three latter combinations was very low, 
Hence, for statistical analyses, they wete summed 
and called “other” attributions. Attribution to one 
joint cause—the person and the stimulus—was quite 
frequent. It was, therefore, treated as a separate 
locus of causality in the statistical analyses. 

The events whose likelihood the subjects were 
asked to estimate consisted of one instance of stimu- 
lus generalization and one instance of response gen- 
eralization for each item. The general format was 
as follows: Response generalization—How likely do 
you think the person (e.g., John) would be to make 
Response Y (e.g., tell a friend to see) to Stimulus X 
(e.g., the comedian)? Stimulus generalization—How 
likely do you think the person would be to make 
Response X (e.g., to laugh) to Stimulus Y (eg., at a 
friend’s joke)? 

Some typical generalization questions were: “How 
likely do you think Sue would be to warn a mailman 
about that dog?” (response generalization for emo- 
tion item reported above); “How likely do you 
think Sue would be to be afraid of a cat?” (stimulus 
generalization); “How likely do you think Jack 
would be to volunteer to spend five hours a week 
working on a committee to promote automotive 
safety?” (response generalization for action item 
reported above); “How likely do you think Jack 
would be to contribute a large sum of money to a 
community public library?” (stimulus generaliza- 
tion). Subjects indicated their estimates on a 9-point 
Likert-type scale with end points labeled “very 
unlikely” and “very likely.” 


RESULTS AND Discussion 


Since neither a 5 x 8 (Locus of Attribution 
X Questionnaire Form) nor a 5 x 5 (Locus 
of Attribution x Random Order of Items) 
chi-square analysis performed on the 1,024 
causal attributions proved significant (both 
ps> .25), analyses of variance were per- 
formed on these data without regard to the 
questionnaire from which they were obtained. 
Analyses of variance rather than chi-square 
analyses were used on these frequency data 
both because they are less cumbersome to per- 
form and also because they permit better de- 
scription of the data. Because the items that 
were sampled contributed much more variance 
than did the subjects sampled, the subjects 
were not included as a way of the design in 
these analyses, and item-within-verb-category 
effects were employed as error terms.’ (Al- 
though items contributed more variance than 
subjects, it should be noted that there was 
both remarkable consistency of the informa- 
tion effects across items and considerable- 
similarity of attributions and expectancies for 
generalization among items within a given 
verb category.) 

An overall analysis of variance performed 
on the experimental subjects’ attributions to 
all five causes revealed that each of the inde- 
pendent variables had marked effects on 
causal attribution. Separate 4(4) X2 X2 x 2 
(Item within Verb Category X Consensus X 
Distinctiveness X Consistency) analyses of 
variance were, therefore, performed on the fol- 
lowing dependent variables: person attribu- 
tion, stimulus attribution, circumstance at- 
tribution, person-stimulus attribution, other 
attribution, expectancy for response gener- 
alization, and expectancy for stimulus gen- 


5 The particular error terms used in these analyses 
were the appropriate higher order interactions, Al- 
though the frequencies being analyzed were corre- 
lated, these error terms are acceptable for the follow- 
ing reason: the appropriate contrasts revealed that 
the causal attributions obtained for items completed 
by the same subjects were not significantly more 
similar than those obtained for items completed by 
different subjects; that is, the item made much more 
difference than who was responding to it. This was 
also true for the data on expectancies for generaliza- 
tion. Hence, the correlation between items introduced 
as a function of their being completed by the same 
subject was negligible. 
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eralization. Both information effects and verb 
category effects are reported for each of 
these analyses. Similar analyses were per- 
formed on the data from control subjects. An 
overall analysis of variance on the control 
subjects’ attributions to all five causes re- 
vealed that the one independent variable— 
verb category—had marked effects on causal 
attribution, and 4(4) (item-within-verb-cate- 
gory) analyses were therefore performed on 
each of the seven dependent variables. Only 
verb category effects are reported for these 
no-information control subjects. 

To facilitate readability, higher order inter- 
actions that are weak or inexplicable and ex- 
act F and values are generally omitted from 
the text. All effects that are reported are 
significant at the .05 level or better (two- 
tailed). 

In addition to the analyses of variance, 
chi-square analyses were performed contrast- 
ing the proportion of person, stimulus, circum- 
stance, person-stimulus, and other attribu- 
tions given no information with that given 
each of the eight information combinations. 
Because these proportions are correlated (each 
subject made two attributions within a given 
information combination), the chi-square 
analyses are not strictly appropriate, and 
magnitude of the effects may be somewhat 
inflated. To compensate for this, a more 
stringent criterion for rejecting the null 
hypothesis was adopted.” 


CAUSAL ATTRIBUTIONS 
Overall Effects: Results and Discussion 


The analysis of variance performed on the 
experimental subjects’ attributions to all five 
causes yielded an attribution main effect 
which reflected the following descending order 
of attribution frequency: person > circum- 
stance > other > stimulus > person-stimulus 
(see Table 1). Comparisons between the mean 
frequencies, using the studentized range sta- 


6 The effects of these independent variables on 
“other” attribution are not reported. Inasmuch as 
this is a residual category, comprised of three dif- 
ferent causal attributions, it is difficult to interpret 
these effects. However, the interested reader will find 
the data reported in McArthur (1970) (see Footnote 
4). 
i Seven of the 11 effects reported are significant 
at the .001 level, 2 are significant at the .01 level, 


and 2 at the .02 level. 
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TABLE 1 


PROPORTION OF ATTRIBUTIONS MADE TO EACH Cavs 
Locus AMONG EXPERIMENTAL AND No 


INFORMATION CONTROL SUBJECTS 


Treatment 
Attribution locus Fac oT a oa E 
Experimental Con 


Person .29 4 

Stimulus 14 18 

Circumstance 24 10 

Person stimulus 14 

Other 19 5 
tistic, showed that, with one exception, the 


frequency of attribution to each cause dif- 
fered significantly from the frequency of at- 
tribution to all others. (The frequency of 
stimulus attribution did not differ significantly 
from that of person-stimulus attribution.) 

The analysis of variance performed on con- 
trol subjects’ attributions to all five causes 
also yielded an attribution main effect. The 
following descending order off attribution 
frequency was obtained: other > person > 
person-stimulus > stimulus > circumstance 
(see Table 1). Comparisons between the mean 
frequencies, using the studentized range sta- 
tistic, revealed that the frequency of circum- 
stance attribution was significantly smaller 
than that of person-stimulus, person, or other 
attribution. 

Although differences in the overall fre- 
quency of attribution to the five possible 
causes obtained in the present study are un- 
related to Kelley’s model, they are neverthe- 
less quite interesting. For both experimental 
and control subjects the frequency of person 
attribution exceeds that of stimulus attribu- 
tion. This relationship obtains for 16/16 items 
among experimental subjects and for 10/16 
items among control subjects. Although the 
magnitude of this effect is small, its relativ« 
consistency makes it noteworthy; in all other 
respects the pattern of causal attributions 
among experimental subjects differs qvi 
markedly from that of the control suis: 
These differences may be roughly summariz 
as a tendency for subjects to make more 
complexs—that is, qualified—attributions when 
they are given no information about the re- 
sponse, For example, simple circumstance at- 
tribution is low among control subjects, while 
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more complicated circumstantial attribution, 
that is, “other” attribution—circumstances 
together with characteristics of the person or 
characteristics of the stimulus, or both—is 
‘high. The reverse holds true for experimental 
subjects. Also, person-stimulus attribution is 
. considerably higher among control subjects, 
reflecting their apparent hesitancy to make 
an unqualified attribution. But, despite this 
preference for the complex and qualified 
cause, when control subjects do make a uni- 
lateral attribution, it is more often to the 
person than to the stimulus. This fact really 
becomes intriguing only on closer considera- 
tion of the behaviors that the subjects were 
asked to attribute. 

One is not particularly surprised to find 
that events such as “Linda receives three 
invitations to the church picnic,” “Henry gets 
a birdie on the fifth hole,” “Jack contributes 
a large sum of money to an auto-safety fund,” 
and “Diane pledges to participate in a week- 
long hunger strike protesting inadequate auto- 
safety features’ were overwhelmingly attrib- 
uted to characteristics of Linda, Henry, Jack, 
and Diane (a total of 37 person attributions) 
rather than to the picnic, the fifth hole, or 
inadequate auto-safety features (a total of 
one stimulus attribution). These events are 
clearly rather unusual, and one assumes with- 
out being told that few people receive three 
invitations to the picnic, get a birdie on the 
fifth hole, contribute large sums of money to 
an auto-safety fund, or pledge to participate 
in a week-long hunger strike. It seems appar- 
ent that characteristics of the people involved 
are responsible for these events, unless in- 
formation to the contrary is provided. But, 
why are mundane events such as “Sue is 
, afraid of the dog,’ “George translates the 
sentence incorrectly,” “Ralph trips over 
Joan’s feet while dancing,” and “Steve puts 
a bumper sticker advocating improved auto 
safety on his car” overwhelmingly attributed 
to characteristics of Sue, George, Ralph, and 
Steve (a total of 35 person attributions) 

8 The frequencies of person and stimulus attribu- 
tion reported here are taken from the control data, 
which are a purer measure of the relative preference 
for person attribution. The experimental data are 
in the same direction, but they are weaker since each 
item is paired with each information combination, 
which, of course, has an equalizing effect on attribu- 
tions. 
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rather than to the ferocity of the dog, the 
difficulty of the sentence, the clumsiness of 
Joan, or the attractiveness. of the bumper 
sticker (a total of three stimulus attribu- 
tions)? One is hard pressed to come up with 
any logical explanation of this proclivity for 
person attribution. Certainly the real-world 
incidence of fearful people, dumb people, 
clumsy Ralphs, and bumper sticker buffs does 
not exceed the incidence of ferocious dogs, 
difficult sentences, clumsy Joans, and beauti- 
ful bumper stickers! One can only conclude 
that there exists a bias in favor of attributing 
behavior to characteristics of the person 
rather than to the stimulus properties of his 
environment. In other words, people are 
not naïve stimulus-response theorists,” but, 
rather, they are naive “black-box theorists,” 
looking inside the organism for the causes of 
behavior rather than to external stimuli. Per- 
haps I am too, for I have just attributed a 
high incidence of person attribution to the 
characteristics of my subjects rather than to 
characteristics of my items. However, there 
is some further data which bolsters this 
“black-box” conclusion, 

Cohen (1969) conducted a questionnaire 
study in which subjects were presented with 
64 items representing the occurrence of re- 
sponses made by other persons. The subjects 
were asked to divide up 10 “causal points” 
between three possible loci: the person, the 
stimulus, and the circumstances. Across this 
relatively wide sample af items, the mean 
allocation of causality to the person (5.34) 
was significantly greater than the mean alloca- 
tion to the stimulus (3.70). Using another 
sample of 64 items, Paquette? similarly 
found that a greater proportion of the total 
attributions were made to the person than to 
the stimulus (.67 versus .21). 


Information Effects: Results 


Person attribution. The analysis of variance 
performed on the frequency of person attribu- 
tion revealed significant main effects for all 
three information variables. As predicted, the 
frequency of person attribution was greater 
with low than with high consensus informa- 
tion, with low than with high distinctiveness 


° P, Paquette. Unpublished research, Yale Univer- 
sity, 1970. 
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Fic. 1. Proportion of total attributions made to 
the person as a function of no information and 
consensus, distinctiveness, and consistency informa- 
tion. 


information, and with high than with low con- 
sistency information (see Figure 1). Distinc- 
tiveness information had the greatest effect 
on person attribution, accounting for 21.72% 
of the total variance. Second was con- 
sistency information, which accounted for 
15.76% of the variance. Consensus informa- 
tion had the smallest effect, accounting for 
only 6.25% of the total variance in person 
attribution. 

Significant second-order interactions be- 
tween the three sources of information some- 
what qualify the above-mentioned main ef- 
fects. The consensus and distinctiveness 
effects were markedly diminished under low 
consistency. Furthermore, a significant Con- 
sensus X Distinctiveness interaction revealed 
that low consensus information significantly 
increased the probability of a person attribu- 
tion only when it was paired with low dis- 
tinctiveness information (see Figure 1). A 
closer look at the interactions between the 
three sources of information in person attribu- 
tion suggests a multiplicative relationship: 
each variable acts to amplify the effects of 
the other two. 

Two information combinations yielded a 
significantly higher proportion of person at- 
tribution than did no information. As pre- 
dicted, there was a significantly higher pro- 
portion of person attribution obtained with 
low consensus, low distinctiveness, and high 


10 Expected values of the mean square for each 
independent variable were computed from the ob- 
tained mean squares; each of these estimated variance 
components was divided by the total variance to 
obtain the percentage of variance accounted for by 
a particular variable. The same procedure was used 
for each dependent variable. The percentage of vari- 
ance in causal attributions accounted for by each of 
the independent variables is summarized in Table 2. 
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consistency (.86) than with no information 
(.24, p < .001). In addition, there was a 
higher proportion of person attribution ob- 
tained with high consensus, low distinctive- 
ness, and high consistency information (.45) 
than with no information (.24, p < .001; see 
Figure 1). 

Stimulus attribution, The analysis of vari- 
ance performed on the frequency of stimulus 
attribution revealed significant main effects 
for all three information variables. As pre- 
dicted, the frequency of stimulus attribution 
was greater with high than with low con- 
sensus information, with high than with low 
distinctiveness information, and with high 
than with low consistency information (see 
Figure 2). As in the case of person attribu- 
tion, distinctiveness information accounted for 
the most variance (12.12%), followed by 
consistency (5.88%) and consensus (5.179%) 
information. y 

Significant second-order interactions “be 
tween the three sources of information re- 
vealed that, just as for person attribution, the 
consensus and distinctiveness effects were 
greater under high than under low consist- 
ency information (see Figure 2). Moreover, 
comparisons between the means revealed that 
the distinctiveness effect was not significant 
under low consistency (p > .30). Also similar 
to person attribution was the significant Con- 
sensus X Distinctiveness interaction, which 
revealed that high consensus information sig- 
nificantly increased the probability of a stimu- 
lus attribution only when it was paired with 
high distinctiveness information. 

Only one information combination—high 
consensus, high distinctiveness, and high con- 
sistency—yielded a significantly higher pro- 
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Fic. 2. Proportion of total attributions made to 
the stimulus as a function of no information and 
consensus, distinctiveness, and consistency informa- 
tion. 
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portion of stimulus attribution than did no 
information (P = .61 versus 18, p< .001; 
see Figure 2). = 

Circumstance attribution. The analysis of 
variance on the frequency of circumstance at- 
tribution revealed a significant main effect 
for two information variables, The frequency 
of circumstance attribution was greater with 
high than with low distinctiveness informa- 
tion and, as predicted, with low than with 
high consistency information. Consensus in- 
formation did not have a significant effect on 
the frequency of circumstance attribution 
(see Figure 3). By far the most variance in 
circumstance attribution was accounted for 
by consistency information (41.36%), while 
distinctiveness and consensus information 
accounted for 7.58% and 30%, respectively. 

Significant second-order interactions be- 
tween the three sources of information re- 
vealed that the distinctiveness effect was sig- 
nificant under low consistency (p < .002), 
but not under high consistency (p > .20). 

All four information combinations that con- 
tained low consistency information yielded a 
significantly higher proportion of circum- 
stance attribution than no information did 
(all ps < .001; see Figure 3). 

Person-stimulus attribution. The analysis of 
variance performed on the frequency of 
person-stimulus attribution revealed signifi- 
cant main effects for two information vari- 
ables. The frequency of attribution to 
person-stimulus was greater with high than 
with low consensus information and with high 
than with low consistency information. Dis- 
tinctiveness information did not have a sig- 
nificant effect on the frequency of person- 
stimulus attribution (see Figure 4). As in the 
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the person and the stimulus as a function of no 
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sistency information. 


case of circumstance attribution, consistency 
information accounted for the most variance 
in person-stimulus attribution ( 16.04%) fol- 
lowed by consensus and distinctiveness infor- 
mation which accounted for .64% and 0%, 
respectively, 

A significant Consensus  Distinctiveness 
second-order interaction markedly qualifies 
the above-mentioned consensus effect. Person- 
stimulus attribution was more frequent under 
low consensus-high distinctiveness than under 
low consensus—low distinctiveness (p < .01) 
or high consensus-high distinctiveness (p < 
.10). Similarly, person-stimulus attribution 
was more frequent under high consensus—low 
distinctiveness than high consensus-high dis- 
tinctiveness (p < .01) or low consensus—low 
distinctiveness ( < .002). A significant Con- 
sensus X Distinctiveness X Consistency triple- 
order interaction for person-stimulus attribu- 
tion revealed that the above Consensus X 
Distinctiveness interaction occurred only 
under high consistency (see Figure 4). 

Two information combinations—high con- 
sensus, low distinctiveness, and high consist- 
ency and low consensus, high distinctiveness, 
and high consistency—yielded a significantly 
higher proportion of person-stimulus attribu- 
tion than no information did (P = 35 versus 
23; P< NOLE 


Information Effects: Discussion 


The results of this study both support and 
go beyond the predictions made regarding the 
effects of consensus, distinctiveness, and con- 
sistency information on causal attributions. 
As predicted, high consensus, high distinctive- 
hess, and high consistency information in 
combination produced significantly more stim- 
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ulus attribution than did no information. Also 
as predicted, low consensus, low distinctive- 
ness, and high consistency information in 
combination produced significantly more per- 
son attribution than did no information. The 
derivative predictions concerning the inde- 
pendent effects of the three information varia- 
bles were also confirmed. Low consistency 
produced more frequent circumstance attribu- 
tion than high consistency; high consensus, 
high distinctiveness, and high consistency each 
produced more frequent stimulus attribution 
than low consensus, low distinctiveness, and 
low consistency, respectively; and low con- 
sensus, low distinctiveness, and high consis- 
tency eack produced more frequent person 
attribution than high consensus, high distinc- 
tiveness, and low consistency, respectively. 

The above consensus and distinctiveness ef- 
fects were somewhat qualified by interactions 
with consistency information. The tendency 
for high consensus and high distinctiveness in- 
formation to increase the frequency of stimu- 
lus attribution and the complementary ten- 
dency for low consensus and low distinctive- 
ness information to increase the frequency of 
person attrbiution were substantially dimin- 
ished in the presence of low consistency infor- 
mation (see Figures 1 and 2). A parallel ef- 
fect was obtained for attributions to the com- 
bined cause of the person and the stimulus. 
The tendency for “mixed” combinations of 
consensus and distinctiveness information 
(ie, one “high” and the other “low’’) to 
increase the frequency of person-stimulus at- 
tribution held true only when these informa- 
tion combinations were coupled with high 
consistency (see Figure 4). 

Although these interactions were not ex- 
plicitly predicted, they are anticipated by 
Kelley (1967), who stated that “The attribu- 
tion he makes on any given occasion depends 
on some sampling of the information avail- 
able to him . . . the more consistent this in- 
formation is, the more stable will his attribu- 
tion be [p. 198].” Inasmuch as causal attribu- 
tion to characteristics of the person, the 
stimulus, or both are inherently stable attribu- 
tions, one would expect low consistency in- 
formation to preclude such attribution. For 
example, while the information that almost 
everyone else made the same response to this 
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stimulus (high consensus) or that this person 
does not respond in the same fashion to most 
other stimuli (high distinctiveness) both 
strongly suggest that this particular stimulus 
is the cause of the event, the additional in- 
formation that “in the past this person al- 
most never responded in the same fashion to 
this stimulus” (low consistency) seems to de- 
mand attribution of the event to particular 
circumstances, and to preclude causal attribu- 
tion to stable properties of the stimulus. Simi- 
larly, the information that hardly anyone else 
made the same response to this stimulus (low 
consensus) or that this person also responds in 
the same fashion to most other stimuli (low 
distinctiveness) both strongly suggest that this 
person’s disposition is the cause of the event. 
But the additional information that “in the 
past this person almost never responded in the 
same fashion to this stimulus” (low consis- 
tency) challenges that conclusion and calls 


for attribution to a transient circumstantial 


cause rather than to stable properties of the 
person. Indeed, it is surprising that the effects 
of consensus and distinctiveness information 
on person and stimulus attribution are often 
merely diminished in the presence of low con- 
sistency information rather than. obliterated. 
It was found that a simple multiplicative 
model fits the person attribution data quite 
well. That is to say, the separate effects 
cumulate such that the frequency of person 
attribution for any given information combi- 
nation can be derived from multiplication of 
the marginal effects. It appears then that low 
consistency information does not deter sub- 
jects from making attributions to stable prop- 
erties of the person or the stimulus as much 
as one might “logically” expect. 

In addition to their interaction with con- 
sistency information, consensus and distinc- 
tiveness information interacted with one an- 
other. The tendency for low consensus to 
produce a greater frequency of person attribu- 
tion than high consensus was diminished in 
the presence of High distinctiveness informa- 
tion (see Figure 1). Similarly, the tendency 
for high consensus information to produce a 
greater frequency of stimulus attribution than 
low consensus was diminished in the presence 
of low distinctiveness information (see F. igure 


2). In short, consensus and distinctiveness 
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information had a substantial effect on person 
attribution and stimulus attribution only when 
these information variables were working in 
the same direction (i.e, when they were both 
“high” or both “low”). When they were 
working at “cross-purposes” (i.e., one “high” 
and the other “low”), person-stinulus attri- 
bution was more frequent than either person 
attribution or stimulus attribution (see Figure 
4). 

A consideration of the “meaning” of these 
information combinations suggests why this 
occurred. As discussed above, low consensus 
and low distinctiveness information seem to 
imply that a dispositional Property of the 
person caused the event to occur. One might 
expect that substituting high consensus for 
low consensus information or high distinctive- 
ness for low distinctiveness information 
would simply weaken this implication. But a 
closer look at these “mixed” information 
combinations suggests that they are qualita- 
tively, rather than quantitatively, different 
from the low-consensus—low-distinctiveness 
combination. 

The information that hardly anyone else 
makes the same response to this stimulus (low 
consensus) together with the information that 
this person does not make the same response 
to other stimuli (high distinctiveness) sug- 
gests attribution to a cause qualitatively dif- 
ferent from a dispositional property of the 
person. This information suggests that the 
cause of the event is not a generalized re- 
sponse tendency but rather a more specific 
habit or attitude characterizing the interac- 
tion of the person with this particular stimu- 
lus. In line with this reasoning is the finding 
that the information combination of low con- 
sensus, high distinctiveness (and high con- 
sistency) produced significantly more person- 
stimulus attribution than did no information, 
while it did not produce significantly more 
person attribution. 

Just as low-consensus—high-distinctiveness 
information seems to differ qualitatively from 
low consensus-low distinctiveness, so does 
high consensus—low distinctiveness. The in- 
formation that almost everyone makes the 
same response to this stimulus (high con- 
sensus) together with the information that 
this person makes the same response to many 
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other stimuli (low distinctiveness) suggests 
two equally plausible causes for the event. 
From the high consensus information one may 
infer that the stimulus has particular charac- 
teristics that caused the event to occur; from 
the low distinctiveness information, one may 
infer that the person has particular character- 
istics that caused the event to occur. So, for 
this information combination, a person-stimu- 
lus attribution seems most appropriate, And, 
in fact, the information combination of high 
consensus, low distinctiveness (and high con- 
sistency) produced significantly more person- 
stimulus attribution than did no information, 

It should be pointed out that the “mean- 
ing” of a person-stimulus attribution given 
low-consensus-high-distinctiveness informa- 
tion differs from that given high-consensus— 
low-distinctiveness information. As mentioned 
above, the former suggests that a true inter- 
action between the person and the stimulus is 
the most probable cause of the event. The 
latter, on the other hand, suggests that the 
person and the stimulus are each sufficient 
causes for the event. An example might help 
to clarify this distinction, If Ralph trips over 
Joan’s feet while dancing and 


a. Hardly anyone else who dances 
over her feet (low consensus) 

b. Ralph does not trip over almost 
ner’s feet (high distinctiveness) 

. In the past Ralph has almost always tripped over 
Joan’s feet (high consistency), 


with Joan trips 


any other part- 


o 


then what seems to have caused the event to 
occur is that Ralph and Joan do not groove 
together very well on the dance floor. The 
cause is an interaction between the person and 
the stimulus. On the other hand, if 


a. Almost everyone else who dances with Joan trips 
over her feet (high consensus) 

b. Ralph also trips over almost every other partners 
feet (low distinctiveness) 


c., In the past Ralph has almost always tripped over 
Joan’s feet (high consistency), 


then what seems to have caused the event to 
occur is that (a) Ralph is a clod, and (b) 
Joan is a clod. The cause is both the person 
and the stimulus. The dual—as opposed to 
shared—causality that this latter information 
combination implies may be responsible for 
the finding that high consensus, low distinc- 


tiveness, and high consistency information not 
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TABLE 2 


PERCENTAGE OF THE TOTAL VARIANCE IN CAUSAL ATTRIBUTIONS ACCOUNTED FOR 
BY EACH OF THE INDEPENDENT VARIABLES 


Causal attribution 


Variable 7 
Overall Person Stimulus groe Tinma 
Consensus information 2.90% 6.25% 30% 04% 
Distinctiveness information 10.16% 21.726 by 7.58% 00° A 
Consistency information : 20.03% 15.76% 41.36% 16.04% 
Verb category (experimental subjects) 1.02% 979 ATI 24% 
Verb category (control subjects) 45.03% 57.46% 68.68% 48.31% 


only increased the frequency of person-stimu- 
lus attribution over that produced given no 
information, but it also increased the fre- 
quency of person attribution. 

The relative importance of the three infor- 
mation variables varied somewhat among at- 
tribution loci. The percentage of total vari- 
ance in the various causal attributions that is 
accounted for by each of the independent 
variables is reported in Table 2. It should be 
noted that the data in this table can be quite 
useful if one wishes to know what kind of in- 
formation is likely to have the greatest im- 
pact on a given causal belief. For example, 
the finding that consistency information ac- 
counts for most of the variance in attribution 
to circumstance suggests that if one wishes to 
produce a circumstantial attribution, the most 
efficient way to do so is to alter the consis- 
tency information input—that is, to provide 
low consistency information. Similarly, the 
provision of high consistency information is 
the best means of eliminating a circumstance 
attribution. What it will be replaced with de- 
pends primarily on distinctiveness information 
—the single informational input that is most 
likely to influence a person or stimulus attri- 
bution. An input of low distinctiveness infor- 
mation will facilitate the former, while an 
input of high distinctiveness information will 
facilitate the latter. 

The finding that consistency information 
accounted for more of the total variance than 
consensus information gives some support to 
a suggestion offered by Kelley (1967): 


It has been postulated . . . that physical reality 
takes precedence over social reality information 
. . . The implication is that the consistency cri- 
teria may be more important to the individual 
than the consensus criterion [p. 207]. 


Kelley further suggests that a partial explana- 
tion for predominance of the consistency cri- 
teria—if such predominance were found—may 
be that consensus information involves further 
attributional tests and is highly dependent on 
attributions regarding the source of this infor- 
mation. 

The finding that distinctiveness information 
accounted for more of the total variance than 
did consensus information is not so easily ex- 
plained. While consistency information may 
be taken at face eg sircvencss infor- 
mation, like consefisus, involves’ further attri- 
butional tests. Just as the “meaning” of the 
information that many persons make the same 
response to a given stimulus depends on who 
those persons are, so does the “meaning” of 
the information that a given person makes 
the same response to many stimuli depend on 
what those stimuli are. However, the sub- 
jects’ more liberal use of the distinctiveness 
information suggests that they were not as 
concerned about the nature of the stimuli to 
which it pertained as they were about the na- 
ture of the persons to which the consensus in- 
formation pertained. This is consistent with 
Gilson and Abelson’s (1965) finding that 
individuals are more prone to generalize from 
evidence specific to a particular object to 
other objects than they are to generalize from 
evidence specific to a particular subject to 
other subjects.™* Since consensus information 
and subject-specific evidence both pertain to 
persons who are actors, while distinctiveness 
information and object-specific evidence both 


11 “Subject” refers to the subject of the verb, 
that is, the person who produces the response, “Ob. 
ject” refers to the object of the verb, that is, the 
stimulus entity toward which the response is di- 
rected, 
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š pertain to nonpersons that are recipients of 
action, Gilson and Abelson’s finding and the 
present results may be due either to a differ- 
ence in willingness to generalize across actors 
versus recipients of action or to a difference in 
willingness to generalize across persons versus 
nonpersons. Gilson and Abelson (1965) sug- 
gested that the latter difference may be the 
crucial one: “objects may be seen as much 
more readjly interchangeable (than persons) 
[p. 308].” This line of reasoning implies that 
distinctiveness information should be more 
similar in its effectiveness to consensus infor- 
mation if the actor and the recipient of the ac- 
tion (i.e., the “stimulus”) are either both per- 
sons or both nonpersons. (Similarly, subject- 
specific evidence should produce as much 
generalization as object-specific evidence if 
the subject and the object of the verb are both 
persons or both nonpersons, ) 

The experimental data do not permit a good 
test of the above prediction since no item 
employed a nonperson as the actor, and only 
three of the items used persons as “stimuli.” 
However, an examination of these is strongly 
suggestive. For the items in which the stimu- 
lus was a person, the effects of distinctiveness 
and consensus information on stimulus attri- 
bution were virtually identical. For both high 
distinctiveness and high consensus, the pro- 
portion of stimulus attribution was .26, while 
for low consensus and low distinctiveness, it 
was .08. For those items in which the stimu- 
lus was a nonperson, the relative effectiveness 
of distinctiveness and consensus information 
© was quite different. The proportion of stimulus 
attribution given high distinctiveness was .35, 
while that given low distinctiveness was .03. 
In contrast to this difference of .32, the differ- 
» ence between high and low consensus informa- 
tion in the proportion of stimulus attribution 
was only .12; the proportion given high con- 
sensus was .19, while that given low consensus 
was .07 (see Table 3). 

These data not only suggest that consensus 
and distinctiveness information are equally 
powerful when the stimulus is a person, but 
they also suggest that distinctiveness informa- 
tion per se is more powerful when the stimu- 
lus is a nonperson: the difference between 
high and low distinctiveness in the proportion 
of a stimulus attribution was 32 when the 
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TABLE 3 


PROPORTION oF STIMULUS ATTRIBUTION AS A FUNCTION 
OF CONSENSUS AND DISTINCTIVENESS INFORMA- 
TION AND THE NATURE OF THE 
STIMULUS OBJECT 


| Information 


Stimulus object | Distinctiveness 


Consensus 
High Low High Low 
Person .26 .08 -26 -08 


Nonperson 35 -03 19 07 


stimulus was a nonperson and only .18 when 
the stimulus was a person. Hence, it seems 
that the differential effectiveness of consensus 
and distinctiveness information that were ob- 
served in the present study—and the related 
difference in generalization across subjects and 
objects observed by Gilson and Abelson 
(1965)—lies not in a reference to actors ver- 
sus recipients of action, but rather in a ref- 
erence to persons versus nonpersons. For some 
reason that is intuitively, but not logically, 
compelling, people are more willing to regard 
a stimulus or group of stimuli as representa- 
tive of an entire class than to so regard a 
person or group of persons. Their relative 
reluctance to generalize across persons is es- 
pecially intriguing in view of the fact that, in 
the present study, consensus information was 
presented in about the strongest possible form. 
“Almost everyone” constituted high consensus 
information, and “hardly anyone else” consti- 
tuted low consensus information. 

Although it may be limited to situations in 
which the stimulus is a nonperson, the finding 
that distinctiveness information was a more 
potent determinant of person and stimulus 
attributions than consensus information has 
many interesting implications, For example, 
let us assume that you, a journal editor, have 
rejected a manuscript, and the author has 
concluded that you are a scoundrel (person 
attribution). If you want him to believe in- 
stead that you rejected the manuscript be- 
cause it was of poor quality (stimulus attri- 
bution), the results of this study suggest you 
would do better to point out that you do not 
reject many other manuscripts (high distinc- 
tiveness) than to point out that many other 
editors have rejected this manuscript (high 
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consensus). Similarly, let us assume that you 
have accepted a manuscript, and the author 
has concluded that is of exceptional quality 
(stimulus attribution). If you want him to 
believe instead that you accepted it because 
you are a good samaritan (person attribu- 
tion), the results of this study suggest you 
would do better to point out that you accept 
many other manuscripts (low distinctiveness) 
than to point out that others have not ac- 
cepted this manuscript (low consensus). 

The relative weakness of consensus infor- 
mation in the present investigation raises at 
least two interesting research questions: (4) 
Is the weakness of consensus information a 
product of learning, an effect that would show 
development changes? (b) Would consensus 
information be as insignificant a factor in the 
causal attributions made by people for their 
own behavior? 

Intuitively, one might suspect that the an- 
swer to the second question would be “no.” 
‘An individual who is attempting to determine 
the cause of another person’s behavior already 
has one bit of consensus information—what 
(he thinks) his own behavior would be in that 
situation. Hence, telling him how people other 
than the actor in question responded does not 
add quite as much information as does telling 
him how the actor responds toward other 
entities. In short, distinctiveness information 
is more informative than consensus informa- 
tion. On the other hand, an individual who is 
attempting to determine the cause of his own 
behavior probably has several bits of distinc- 
tiveness information—he knows how he re- 
sponds toward some other entities. Hence, pro- 
viding additional distinctiveness information 
may not have as great an impact as providing 
consensus information about which he may 
have no knowledge at all. In addition to these 
“information-value” differences, there are 
“motivational” factors that might make the 
impact of consensus information on someone 
who is searching for an explanation of his own 
behavior quite different from its impact on 
someone who is attempting to explain another 
person’s behavior. Indeed, inasmuch as all the 
conformity literature has demonstrated that 
consensus information often has a very sub- 
stantial effect on people’s behavior, it would 
be ironic if it did not also have a noteworthy 
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effect on peoples’ explanations for their be- 
havior. (See Jones & Nisbett, 1971, for a 
more extensive discussion of differences in 
self- versus other-observation). 

Although the present study clearly demon- 
strates that individuals can use consensus, 
distinctiveness, and consistency information 
systematically, the methodology that was em- 
ployed does not reveal whether individuals 
who are attempting to find an appropriate 
attribution will use or seek out this informa- 
tion of their own accord. There is, however, 
some evidence that indicates that they will. In 
two studies by McArthur (1969, 1970, Ap- 
pendix D, pp. 130-144), the subjects were not 
explicitly told to utilize the consensus and 
distinctiveness information when they were 
asked for a causal attribution. Rather, this 
information was embedded in a fairly lengthy 
written—or spoken—paragraph. Moreover, 
the distinctiveness information was substan- 
tially weaker than in the present question- 
naire study, and there was no consistency in- 
formation provided. Despite this manner of 
presentation, the results revealed that the in- 
formation was utilized systematically. 


Verb Category Effects: Results 


Person attribution. The analysis of variance 
performed on the frequency of person attribu- 
tions among experimental subjects did not 
yield a significant verb category main effect. 
However, a one-degree-of-freedom test con- 
trasting emotions and opinions with accomp- 
lishments and actions was significant and 
reflected a greater frequency of person attri- 
bution for accomplishments and actions than 
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% for emotions and opinions (see Figure 5). 
Only .97% of the variance in person attribu- 
tion was accounted for by verb category. 
The analysis of variance performed on the 
frequency of person attributions among no- 
information control subjects did yield a sig- 
nificant verb category main effect that re- 
> flected the following descending order of per- 
son attribution frequency: accomplishments > 
actions > opinions > emotions. Just as for the 
experimental subjects, a one-degree-of-free- 
dom test contrasting emotions and opinions 
with accomplishments and actions was highly 
significant, while the residual effect was not 
(see Figure 5). Verb category accounted for 
57.46% of the variance in person attribution 
among these no-information controls. 
Stimulus attribution. The analysis of vari- 
ance performed on the frequency of stimulus 
attributions among experimental subjects 
yielded a significant verb category main effect 
that reflected the following descending order 
of stimulus attribution frequency: emotions 
> opinions > acomplishments > actions. Just 
_ as in the case of person attribution, a one- 
degree-of-freedom test contrasting emotions 
and opinions with accomplishments and ac- 
tions proved highly significant. However, un- 
like person attribution, stimulus attribution 
was more frequent for emotions and opinions 
than for accomplishments and actions (see 
Figure 6). Also, the residual effect was sig- 
nificant. Verb category accounted for 2.85% 
of the total variance in stimulus attribution. 
The analysis of variance performed on the 
frequency of stimulus attribution among no- 
information control subjects also yielded a 
significant verb category main effect that re- 


1.00 


75 

PROPORTION 
OF 

stimulus -50 
ATTRIBUTION 


Experimental Ss. 
Control St. 


EMOTIONS ACCOMPLISHMENTS 


ard ond 
OPINIONS ACTIONS 


VERB CATEGORY 


Fic. 6. Proportion of total attributions made to 
the stimulus among experimental and control sub- 
jects as a function of verb category: emotions and 
Opinions versus accomplishments and actions. 


185 


flected the following descending order of stim- 
ulus attribution frequency: opinions > emo- 
tions > accomplishments > actions. Just as 
~ for the experimental subjects, a one-degree-of- 
freedom test contrasting emotions and opin- 
ions with accomplishments and actions was 
highly significant (see Figure 6). The residual 
effect was not significant. Verb category ac- 
counted for 51.02% of the total variance in 
stimulus attribution among: the controls. 

Circumstance and person-stimulus attribu- 
tions. The analyses of variance performed on 
the frequency of circumstance and person- 
stimulus attributions among experimental sub- 
jects yielded no significant verb category main 
effects or second-order interactions. 

The analysis of variance performed on the 
frequency of circumstance attribution among 
no-information control subjects did yield a 
significant verb category main effect that re- 
flected the following descending order of cir- 
cumstance attribution frequency: actions > 
accomplishments > opinions > emotions. A 
one-degree-of-freedom test contrasting emo- 
tions and opinions with accomplishments and 
actions revealed that the latter yielded signifi- 
cantly more circumstance attribution than the 
former. Verb category accounted for 68.68% 
of the variance in circumstance attribution 
among control subjects. 

Although the analysis of variance performed 
on the frequency of person-stimulus attribu- 
tion among no-information control subjects 
also yielded a significant verb category main- 
effect, the data were not grouped into ac- 
complishments and actions versus emotions 
and opinions as they were for the other attri- 
bution loci. Rather, the following descending 
order of person-stimulus attribution frequency 
was obtained: emotions > accomplishments > 
actions > opinions. Verb category accounted 
for 48.31% of the variance in person-stimu- 
lus attribution among control subjects. 


Verb Category Effects: Discussion 


The present investigation clearly demon- 
strated that the nature of the response affects 
its causal attribution. Moreover, the verb 
category effects that were obtained fall into a 
remarkably consistent pattern. Accomplish- 
ments and actions repeatedly produced similar 
effects, and these effects differ from those pro- 
duced by emotions and opinions; for both ex- 
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perimental and control subjects, accomplish- 
ments and actions yielded significantly more 
person attributions and significantly less stim- 
ulus attributions than did emotions and opin- 
ions. This empirical division of verb types 
falls neatly into a categorization of verbs first 
proposed by Gilson and Abelson (1965), The 
accomplishments and actions generally rep- 
resent acts performed by the subject that are 
“directly observable and relatively delimited 
in time,” and may therefore be classified as 
“manifest” verbs. The emotions and opinions, 
on the other hand, generally represent mental 
states of the subjects that are “relatively en- 
during and not directly observable,” and may 
therefore be classified as “subjective” verbs 

(Kanouse & Abelson, 1967, p. 159). 
In addition to their isomorphism with a 
more or less well-established manifest-sub- 
jective verb distinction, the verb category ef- 
fects obtained in the present study do make 
intuitive sense. As mentioned earlier, Heider 
(1958) suggested that there is a tendency to 
attribute enjoyment to the object rather than 
to the person. This suggestion is confirmed— 
at least relative to other types of responses— 
and may be generalized to other emotions as 
well, It also appears to apply to opinions. 
Emotions and opinions are commonly re- 
garded as being elicited by stimuli, as opposed 
to being emitted by persons. An opinion is 
necessarily of something, and, by this token 
alone, it is caused at least in part by the 
properties of that stimulus. Similarly, with 
the exception perhaps of psychotic reactions, 
it is difficult to conceive of any emotion that 
originates within the person and is unprovoked 
by an outside force. One is afraid of, pleased 
with, angered by, sad over some thing. The 
emotion necessitates the inclusion of an ob- 
ject—that is, an external cause—to be mean- 
ingful. This argument, and the data, tie in 
nicely with Schachter’s (1964) theory of emo- 
tions. His assertion that internal physiological 
arousal is not sufficient to induce emotions; 
that, in order to experience a particular emo- 
tion, an individual must “Jabel” his arousal in 
terms of cognitions available to him from ex- 
ternal stimuli, is quite consistent with the 
present study’s finding that people believe 
emotions to be caused by external stimuli. 
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Unlike emotions and opinions, accomplish- _ 
ments and actions are at least as easily con- 
ceived of as being emitted by persons as be- 
ing elicited by stimuli. As Heider (1958) 
pointed out, the dispositional property “can” 
or “able” is a joint function of the power 
or ability of the person and the difficulty of 
environmental factors. He further noted that T 
an effect (i.e, an accomplishment) may be 
attributed to the ability of the person, the 
difficulty of environmental factors, or both. a 
Jones et al. (1968) demonstrated that, in 
point of fact, another person’s accomplish- 
ment tends to be attributed to his ability— 
the person—rather than the task difficulty— = 
the stimulus. This is consistent with the re- “A, 
sults of the present study in which observers 
tended to attribute accomplishments to the 
person more than to the stimulus. Like ac- 
complishments, actions can also be readily 
attributed to the person. Indeed, for some ” 
actions, stimulus attribution is almost incon- 
ceivable. Unfortunately, this may have been 
more true of the action items employed in this 
study than is typical since they were all 
instances of “activism’”—doing something 3 
about auto safety—and two were quite ex- 
treme. But, even when one considers very 
mundane acts such as washing dishes or 
crossing the street, it becomes apparent that 
these manifest behaviors are most easily con- 
ceived of as being “emitted” by the person; 
intuitively, they do not seem to be “elicited” 
by a stimulus as the corresponding opinions 
“The dishes are dirty” and “The street is 
safe for crossing” appear to be. Cohen (1969) 7 
provides more substantial evidence that the 
observed tendency for actions to be attributed 
to the person rather than to the stimulus is 
not limited to the biased sample of actions, 
used in the present study. Using a sample of 
64 verbs—32 manifest and 32 subjective—he 
found the mean person attribution to be sig- 
nificantly greater for manifest than subjective 
verbs, while the mean stimulus attribution was 
significantly greater for subjective than for 
manifest verbs. Similarly, Paquette (see Foot- 
note 9), using still another sample of 64 verbs, 
found that the proportion of total attributions 
made to the person was greater for manifest 
than subjective verbs (.72 versus .62), while 
the proportion of total attributions made to 
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the stimulus was greater for subjective than 
for manifest verbs (.25 versus .17). 

To say that a subjective behavior is readily 
perceived as being elicited by some stimulus 
entity is to say that that stimulus is perceived 
to be a suficient condition for the behavior, 
This suggests that one would expect all people 
to respond similarly to the stimulus; and, in- 
deed, this seems to be the case. Despite the 
adage “everyone is entitled to his own opin- 
ion,” people tend to regard opinions as ap- 
propriate or inappropriate, correct or incor- 
rect. When someone holds an opinion different 
from our own, we try to convince him that he 
is mistaken, that the stimulus calls for our 
belief and not his. Similarly, when someone 
experiences an emotion different from our 
own, we regard it as inappropriate and may 
try to “talk him out of it” by pointing out 
that the stimulus situation does not call for 
such an emotion. Such behavior suggests that 
our need for a stable definition of reality is 
threatened when other people’s emotions or 
opinions differ from our own (see Festinger, 
1954). We firmly believe that consensual 
validation of our subjective responses should 
be the rule because we believe these responses 
to be caused by external stimuli. 

Accomplishments and actions, that is, mani- 
fest responses, are regarded quite differently. 
To say that a manifest behavior is not readily 
perceived as being elicited by some stimulus 
entity is to say that the stimulus is not per- 
ceived to be a sufficient condition for the be- 
havior. This. suggests that one would not ex- 
pect all people to respond similarly to the 
stimulus, This appears to be the case. We have 
no quarrel with an individual whose perform- 
ance is different from our own: there is a 
question of better or worse performance, but 
rarely of correct or incorrect performance. 
Similarly, with the exception of very deviant 
or counternormative behavior, one’s need for 
a stable definition of reality is generally not 
threatened when other people’s actions differ 
from our own. We do not expect consensual 
validation of our manifest responses because 
we do not believe these responses to be caused 
by external stimuli. “Do your own thing” is, 
by this token, a more honest overture than 
“You're entitled to your own opinion,” 


187 


Naturally, research is needed to confirm the 
argument that more consensual validation is 
assumed for subjective than for manifest re- 
sponses, But, given that one finds evidence for 
this assertion, the question may be raised as 
to why this occurs. Some interesting data 
pertinent to this question were obtained in a 
ministudy by McArthur. A random sample 
of 20 pages drawn from Roget’s International 
Thesaurus contained significantly more mani- 
fest than subjective verbs (238 versus 75; x? 
= 84.89, p < .01).1* Hence, if there do exist 
real differences in expectancy for consensual 
validation of manifest and subjective re- 
sponses, they may well be due to the fact that 
there are significantly more possible manifest 
than subjective responses: holding all else 
constant, it logically follows from this simple 
difference in numbers that any given subjec- 
tive response has a higher probability of con- 
sensual validation than does any given mani- 
fest response. The following postulate is sug- 
gested: All other things being equal, when the 
set of alternative responses is small, then the 
response that is made has high probability; 
when the set of alternative responses is large, 
then the response that is made has low prob- 
ability. A response characterized by high prob- 
ability will tend to be attributed to the stimu- 
lus, while a response characterized by low 
probability will tend to be attributed to the 
person who makes it or to particular circum- 
stances,!4 

Although these predictions seem intuitively 
compelling, it should be noted that they are 


12L, A. McArthur, Unpublished research, 1970, 

13 This particular comparison may inflate the dif- 
ference in the frequency of manifest versus subjective 
responses, since predicate adjectives, which often 
communicate subjective responses (e.g., He is upset 
about the event), are not included. 

14 The suggestion that the probability of a re- 
sponse is a determinant of its causal attribution was 
first made by Jones and Davis (1965). These au- 
thors proposed that a behavior that has low proba- 
bility (either because the actor has high choice, 
there are many alternative responses, or because the 
act has low social desirability), will be attributed to 
high probability will not. The present postulate is — 
virtually identical, although, unlike the Jones and 
Davis prediction, it makes an assertion about stimulus 
attribution: if a response has relatively high prob- 
ability—because there are relatively few alternative 
responses—it will be attributed to characteristics of 
the stimulus, 
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not logical. A high probability response is not 
only very likely to be produced by many 
people (high consensus), but it is also very 
likely to occur in the presence of many stimuli 
(low distinctiveness). Hence, high probability 
“logically” implies both high consensus and 
low distinctiveness. Although the former tends 
to produce stimulus attribution, the latter 
tends to produce person attribution, and, for 
this reason, the prediction that a highly prob- 
able response will tend to be attributed to the 
stimulus is illogical. Similar reasoning demon- 
strates that the prediction that a low proba- 
bility response will tend to be attributed to 
the person is also illogical. However, the argu- 
ment can be made that response probability 
psychologically implies consensus information. 
more strongly than it does distinctiveness in- 
formation, in which case the predictions would 
follow. 


EXPECTANCIES FOR BEHAVIORAL 
GENERALIZATION 


Information Effects: Results 


The analysis of variance performed on the 
subjects’ expectancies for response and stimu- 
lus generalization indicated that more overall 
generalization was expected under low than 
high distinctiveness and under high than low 
consistency. Consensus information did not 
have a significant effect on expectancy for 
overall generalization, 


15 When one contemplates the likelihood that some 
response will occur, one tends to think of the prob- 
ability of any given person producing that response; 
one does not generally consider the probability of 
any given stimulus receiving that response. For ex- 
ample, in order to arrive at a subjective estimate of 
the probability of the event, “children eat sweets,” 
one tends to think of instances of children rather 
than instances of sweets. Similarly, if one is told 
that the event “children eat sweets” is very probable, 
one tends to infer that many children eat sweets 
rather than inferring that children eat many sweets. 
If this phenomenon is general, then the probability 
of a given response “psycho-logically” implies con- 
sensus information more strongly than it does dis- 
tinctiveness information—at least when the former 
pertains to persons and the latter pertains to non- 
persons. This is consistent with the finding that 
individuals tend to generalize more across nonperson 
stimuli than across persons. “A sweet is a sweet, but 
a child by any other name might not eat.” Probabil- 
ity information is therefore assumed to refer to chil- 
dren rather than to sweets. 
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A significant Consensus X Generalizatio 
interaction reflected a greater expectancy fo: 
the actor to make other responses to the same 
stimulus (response generalization) under high 
consensus than for him to make the same 
response to other stimuli (stimulus generali- 
zation), while the reverse occurred under low 
consensus. Although comparisons between the 
means revealed that the difference in expec- 
tancy for response and stimulus generalization 
was not significant under low consensus (p > 
.10), the obtained interaction does support 
the prediction that those independent varia- 
bles that tend to produce stimulus attribution 
will also produce a greater expectancy for 
response than stimulus generalization, while 
those which tend to produce person attribu- 
tion will produce a greater expectancy for 
stimulus than response generalization. 

Also supporting this prediction was a sig- 
nificant Distinctiveness X Generalization in- 
teraction that reflected a greater expectancy 
for the actor to make other responses to the 
same stimulus (response generalization) un- 
der high distinctiveness than for | im to make 
the same response to other stimuli (stimulus 
generalization), while the reverse occurred 
under low distinctiveness. The difference in 
expectancy for response and stimulus generali- 
zation was significant under both high and low 
distinctiveness information (p < .002 in both 
cases). However, it should be noted that the 
effect of distinctiveness information on ex- 
pectancy for stimulus generalization is rela- 
tively trivial inasmuch as this information was 
essentially a statement about the actor’s be- 
havior vis-a-vis certain other stimuli. 

Analyses within generalization categories 
revealed that low consensus, low distinctive- 
ness, and high consistency each produced a 
significantly greater expectancy for the actor 
to make the same response to other stimuli 
(stimulus generalization) than did high con- 
sensus high distinctiveness, and low consis- 
tency, respectively. On the other hand, high 
distinctiveness and high consistency each 
produced a significantly greater expectancy 
for the actor to make other responses to the 
same stimulus (response generalization) than 
did low distinctiveness and low consistency, 
respectively. Consensus information did not 
have a significant effect on expectancy for 
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TABLE 4 


MEAN EXPECTANCY FOR RESPONSE AND STIMULUS GENERALIZATION AS A FUNCTION OF 
CONSENSUS, DISTINCTIVENESS, AND CONSISTENCY INFORMATION 


High consensus 


Low consensus 


Low distinctiveness 


High distinctiveness Low distinctiveness 


Low High Low High Low 


consistency | consistency | consistency | consistency | consistency | consistency | consistency | consistency 


Behavorial High distinctiveness 
generalization 
High Low High 
Response ` 6.44 4.93 5.52 
Stimulus 3.56 3.09 6.65 


4.47 
6.21 


6.20 
3.84 


4.43 
3.61 


5.23 
Cas 


4.30 
6.44 


response generalization. (The mean expec- 
tancy for response and stimulus generaliza- 
tion obtained for each of the information con- 
figurations is presented in Table 4.) 

The percentage of total variance in expect- 
ancy for behavioral generalization that is 
accounted for by each of the independent 
variables is presented in Table 5. Distinc- 


* tiveness information accounted for 63.40% of 


the variance in expectancy for the actor to 
make the same response to other stimuli 
(stimulus generalization), while consistency 
and consensus information accounted for only 
1.31% and 93%, respectively. In contrast 
to this, distinctiveness information accounted 
for only 2.63% of the variance in expectancy 
for the actor to make other responses to 
the same stimulus (response generalization) , 
while consistency information accounted for 
13.62%. Consensus information again ac- 
counted for only a small part of the variance 


(36%). 


Information Effects: Discussion 


The predictions made regarding expectancy 
for stimulus and response generalization are 
clearly supported by the results of this study. 
Both low consensus and low distinctiveness in- 
formation each produced a greater expectancy 
for the person to make the same response to 
other stimuli than for him to make other 
responses to the same stimulus (ie., the 
expectancy for stimulus generalization was 
greater than the expectancy for response gen- 
eralization). Moreover, the expectancy for 
stimulus generalization per se was greater 
given low consensus or low distinctiveness 
than given high consensus or high distinctive- 
ness. As mentioned above, it is quite logical 


that distinctiveness information would have 
this effect. But, the tendency for low con- 
sensus to produce a greater expectancy for 
stimulus generalization than high consensus 
cannot be so simply explained. Whether or 
not other people exhibit a response has no 
logical implications for whether or not the 
actor will exhibit the same response to other 
stimuli. The fact that no one but me likes 
Martians does not, in and of itself, make me 
any more likely to like Venutians than I 
would be if everyone liked Martians. A two- 
step process seems necessary to explain the 
effect of consensus information on expectancy 
for stimulus generalization: (a) Low consen- 
sus suggests that the cause of the response 
is located in the actor, (b) The cause being 
located in the actor means that he is “an 
emittor of that response,” which suggests that 
he will emit it in the presence of other 
stimuli as well. 

High consensus and high distinctiveness 
information each produced a significantly 
greater expectancy for the person to make 
other responses to the same stimulus than for 
him to make the same response to other stim- 
uli (i.e., the expectancy for response general- 


TABLE 5 


PERCENTAGE OF THE TOTAL VARIANCE IN EXPECTANCY 
FOR BEHAVORIAL GENERALIZATION ACCOUNTED FOR 
BY EACH OF THE INDEPENDENT VARIABLES 


Behavioral 
generalization 
Variable 

Response | Stimulus 
Consensus information 36% 93% 
Distinctiveness information 2.63% 63.40% 

Consistency information 1362% £31 
Verb category experimental subjects) 17,43% 0% 
Verb category (control subjects) 54,06% 0% 
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ization was greater than thé expectancy for 
stimulus generalization). In addition, the ex- 
pectancy for response generalization per se 
was greater with high than with low distinc- 
tiveness information. Like the effect of con- 
sensus information on expectancy for stimu- 
lus generalization, this effect is not strictly 
logical. Whether or not the actor also makes 
the same response to other stimuli (distinc- 
tiveness information) has no direct implica- 
tions for whether or not he will make other 
responses to a particular stimulus. The fact 
that I like Martians, but do not like almost 
any other outerspacemen, does not, in and of 
itself, make me any more likely to kiss 
Martians than I would be if I also liked most 
other outerspacemen, Again, a two-step proc- 
ess seems necessary to explain the results: 
(a) High distinctiveness suggests that the 
cause of the response is in the stimulus. (b) 
The cause of the response being located in 
the stimulus means that it is “an elicitor of 
the response,” which suggests that it may 
elicit other related responses as well. 
Although it was not explicitly predicted, 
the finding that high as compared with low 
consistency information yielded a greater 
expectancy for both stimulus and response 
generalization is certainly reasonable. Overall 
expectancy for generalization should be lower 
when a response is not reliable—low consist- 
ency—since such behavior cannot logically be 
used as a valid predictor of other responses. 


Verb Category Effects: Results and 
Discussion 


The analysis of variance performed on the 
experimental subjects’ expectancies for re- 
sponse and stimulus generalization did not 
yield a significant verb category main effect. 
Neither did the analysis of variance per- 
formed on expectancies for response and 
stimulus generalization among no-information 
controls, 

While there was not significantly more 
overall generalization expected for one verb 
category than another, there was a significant 
Verb Category X Generalization interaction 
for both experimental and control subjects. 
These interactions revealed that for emotions 
and opinions there was a greater expectancy 
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for the actor to make other responses to the 
same stimulus (response generalization) than 
for him to make the same response to other 
stimuli (stimulus generalization), while the 
reverse was true for accomplishments and 
actions. Like the Information X Generaliza- 
tion interactions, this interaction supports the 
prediction that those independent variables 
that tend to produce stimulus attribution will 
also produce a greater expectancy for response 
than for stimulus generalization, while the 
reverse will occur for those variables that 
tend to produce person attribution. 

Analyses within generalization categories 
revealed a significant verb-category main ef- 
fect on expectancy for response generalization 
among both experimental and no-information 
control subjects. These simple effects reflected 
the following descending order of expectancies 
for the actor to make other responses to 
the same stimulus: emotions > opinions > ac- 
tions > accomplishments. One-degree-of-free- 
dom comparisons contrasting emotions and 
opinions with accomplishments and Actions 
revealed that the former produced a signifi- 
cantly greater expectancy for response, gen- 
eralization than the latter among both ex- 
perimental and control subjects. The residual 
effect was not significant in either case, Ex- 
pectancy for stimulus generalization was not 
significantly affected by verb category among 
experimental or control subjects. 

Expectancies for behavioral generalization 
were clearly affected by verb category as well 
as by the information variables. Emotions and 
opinions produced a greater expectancy for 
the person to make the same response to other 
stimuli than for him to make other responses 
to the same stimulus. Like some of the in- 
formation effects, these verb category effects 


may be most easily explained in terms of a3 


two-step process: (@) emotions and opinions 
produce attribution to the stimulus which, in 
turn, yields (b) a greater expectancy for re- 
sponse than stimulus generalization; and (a) 
accomplishments and action produce attribu- 
tion to the person which, in turn, yields 
(b) a greater expectancy for stimulus than 
response generalization. 

Whatever the explanation for these verb 
category effects, they are quite provocative. 


Tue How anD WuHat or Wry 


To give a very speculative example, they 
suggest that if you give someone—for ex- 
ample, a child—many nice things (manifest 
response), his expectancy for you to also give 
nice things to other people (stimulus general- 
ization) will be greater than his expectancy 
for you to praise him or to protect him 
(response generalization). But, if you love 
someone (subjective response), his expect- 
ancy for you to praise him or to protect him 
(response generalization) will be greater than 
his expectancy for you to also love other 
people (stimulus generalization). If this ap- 
plication were accurate, then it might not be 
surprising that the “poor little rich kid” feels 
deprived. Not only is he given no love, but, 
moreover, the material goods that he is given 


have little meaning: he expects that his 


parents would give these things to many other 
people—or at least that they would be more 
likely to do that than to make other positive 
responses to him. The “rich little poor kid,” 
on the other hand, not only has love, but, 
moreover, he feels that he is special in this 
regard: he expects that his parents would not 
give their love to many others—or at least 
that they would be less likely to do that than 
to make other positive responses to him. 
Although the finding that distinctiveness 
information accounted for most of the vari- 
ance in expectancy for stimulus generaliza- 
tion is not particularly exciting, the finding 
that verb category accounted for most of the 
variance in expectancy for response general- 
ization is quite fascinating (see Table 5). 
Having observed someone’s behavior, one’s 
expectancy for him to make other responses 
to the same stimulus depends primarily on 
the nature of the initial behavior; this expect- 
ancy is significantly greater when the initial 
response is subjective—for example, an emo- 
tion or an opinion—than when it is manifest."° 
To take the example cited above, this sug- 
gests that a child’s expectation that he will 
receive protection or praise will be greater 


16 Jt should be noted that all of the response gen- 
eralization items involved manifest responses. Hence, 
a more conservative statement of the finding would 
be that one’s expectancy for other manifest responses 
to be directed to the same stimulus is greater when 
the initial response is subjective than when it is mani- 
fest. 
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if you love him (subjective response) than 
if you give him many nice things (manifest 
response). Or, to give another example, if you 
pity someone who is in trouble (subjective 
response), his expectation that you will also 
protect and support him (response general- 
ization) will be greater than it would be if 
you helped him (manifest response). The 
moral of this story for the bystander who is 
wary of commitment to future benevolence, 
is: Don’t just pity someone. Help him! 

It should be noted that the specific manifest 
and subjective verbs used in the above illustra- 
tions were not contained in the experimental 
questionnaire. However, since Cohen (1969) 
and Paquette (see Footnote 9) both demon- 
strated that the manifest versus subjective 
verb attribution effects obtained in this study 
generalize to other manifest and subjective 
verbs, it seems warranted to generalize the ob- 
tained expectancy for generalization effects to 
other manifest and subjective verbs. Never- 
theless, there is a need to explicitly test the 
predictions in these examples by asking the 
target person of a response to give his expect- 
ancy for the actor to show response and stim- 
ulus generalization, since the possibility exists 
that target-person observers and “any old” 
observers do not respond in a like fashion. 


RELATIONSHIP BETWEEN ATTRIBUTIONS AND 
EXPECTANCIES FOR BEHAVIORAL 
GENERALIZATION 


It was found that, as predicted, person 
attribution and expectancy for stimulus gen- 
eralization were similarly affected by the in- 
formation variables, as were stimulus attribu- 
tion and expectancy for response generaliza- 
tion (see Figure 7). High consensus and high 
distinctiveness information each produced 
more stimulus attribution and a greater ex- 
pectancy for response generalization than did 
low consensus and low distinctiveness infor- 
mation, respectively. Conversely, low con- 
sensus and low distinctiveness each produced 
more person attribution and a greater expect- 
ancy for stimulus generalization than did high 
consensus and high distinctiveness information. 

Not only did the information variables have 
these predicted parallel effects on attributions 
and expectancies for generalization, but there 
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also were striking similarities in the effects of 
verb category on attributions and expected 
generalization. Manifest verbs—accomplish- 
ments and actions—produced both more per- 
son attribution and a greater expectancy for 
stimulus generalization than did subjective 
verbs. Conversely, subjective verbs—emo- 
tions and opinions—produced both more 
stimulus attribution and a greater expectancy 
for response generalization than did manifest 
verbs (see Figure 7). 

These parallel findings suggest that causal 
attribution serves as an intervening variable 
that mediates expectancies for behavioral gen- 
eralization. However, the overall correlation 
between stimulus attribution and expectancy 
for response generalization was only .20 for 
experimental subjects and .10 for no-informa- 
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tion controls, and the correlation between 4 


person attribution and expectancy for stimu- 
lus generalization was only .28 for experi- 
mental subjects and .25 for controls, As would 


be expected, negative correlations obtained i, 


between person attribution and expectancy for 
response generalization (r = —.05 for experi- 
mental subjects, and r= —.16 for controls) 
and between stimulus attribution and expect- 
ancy for stimulus generalization (r= —,20 
for experimental subjects, and r= —.12 for 
controls). Although almost all of these cor- 
relations were significant at the .05 level or 
better, this was largely due to the many de- 
grees of freedom, and the size of the correla- 
tions is far from impressive. 

In view of the relatively low correlations 
between intervening attributions and: expect- 
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ancies for behavioral generalization, the rela- 
tionship between the independent variables 
and the experimental subjects’ expectancies 
for generalization was investigated. For com- 
putational covenience, only one attributional 
index (person attribution minus stimulus at- 
tribution) and one generalization index (ex- 
pectancy for stimulus generalization minus 
expectancy for response generalization) were 
utilized. A covariance analysis performed on 
these two difference scores revealed a very 
high correlation between them as a function 
of information (r = .87, df = 6, p < .01) and 
a correlation of only .16 (df = 44, p < .10) 
when both independent variables (information 
and verb category) were partialed out. 

The finding that there was a stronger cor- 
relation between the independent variables 
and expectancies for generalization than þe- 
tween the intervening attribution variables 


zand these dependent measures may indicate 


that attributions do not mediate generaliza- 
tion expectancies; that information and 
verb category each affect both the causal at- 
tribution and the expectancy for behavioral 


X generalization. On the other hand, these dif- 


ferences in the magnitude of the correlation 
may simply reflect the extremely small be- 
tween-subject variance within conditions that 
imposes limitations on the correlation as well 
as the loss of precision when one correlates 
relatively fuzzy intervening variables with a 
dependent measure. This likelihood, together 
with the fact that a two-step process seems 
logically necessary to explain some of the in- 
formation effects on expected generalization 
and all of the verb category effects, certainly 
leaves open the possibility that causal attribu- 
tions do indeed mediate expectancies for 
behavioral generalization. Although further 
research is needed to specifically test the 
mediation hypothesis, the present study has 
clearly contributed to our understanding of 
peoples’ beliefs about the causes of another 
person’s behavior: how they decide why 
someone behaved as he did and wat expecta- 
tions for his future behavior accompany these 


causal beliefs, 
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BEHAVIOR AS A FUNCTION OF ATTITUDE-TOWARD- 


OBJECT AND ATTITUDE-TOWARD-SITUATION * 


MILTON ROKEACH °? PETER KLIEJUNAS 
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The relationship between attitudes and behavior was examined in the context 
of Rokeach’s two-attitude theory of behavior. Data from 81 subjects provided 
correlations between behavior and three attitudinal measures: (a) Ao, attitude 
toward object, (b) As, attitude toward the situation in which the object is 
encountered, and (c) AoAs, a weighted average of Ao and A, that was derived 
from an adaptation of Rokeach and Rothman’s belief congruence model of 
cognitive interaction. Supporting all hypotheses, the results showed AAs to be 
a consistently better predictor of behavior than either A, or As and as good a 
predictor, or better, than the multiple correlation. The results illustrate the 
importance of attitude-toward-situation as well as attiude-toward-object as 
determinants of behavior and suggest that verbally expressed attitudes can 
predict overt behavior to a considerably better extent than is generally believed. 


The concept of attitude is central to social 
psychology and to personality theory because 


` it is generally regarded to be an important 


determinant and predictor of behavior. How- 
ever, those studies that have systematically 
examined the relationship between attitudes 
and behavior have generally found a lack of 
correspondence or, at best, a low correspon- 
dence between verbally expressed attitudes 
and overt behavior. 

The purpose of this study is to examine the 
relationship between attitudes and behavior in 
the context of Rokeach’s (1968a, 1968b) 
conception of attitude, in which the distinc- 
tion between attitude-toward-object and atti- 
tude-toward-situation is a crucial one. When 
these two kinds of attitudes are taken into 
account, an individual’s behavior should be 
predicted to an extent that is considerably 
better than is generally believed to be the case 
by social psychologists. 

LaPiere’s (1934) classic study is perhaps 
the one most often cited as a striking example 
of a discrepancy between expressed attitudes 
and actual behavior. Traveling extensively 
throughout the United States with a young 
Chinese couple, LaPiere had occasion to stop 
at many motels, hotels, and restaurants. They 
had been refused service only once in 250 in- 
stances, stretching across 10,000 miles of mo- 


1 This research was supported by a grant from the 
National Science Foundation to the senior author. 

2 Requests for reprints should be sent to Milton 
Rokeach, Department of Psychology, University of 
Western Ontario, London 72, Ontario, Canada. 
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mailed questionnaires to the proprietors 0 


tor travel. In a follow-up study, LaPiere | 
establishments previously visited. oe : 


to the question “Will you accept members of — 
the Chinese race as guests in your establish-» 
ment?” approximately 92% answered “No.” | 
They obtained almost identical results from a 
control group of proprietors of comparable 
establishments that had not been visited. ' 
Similar negative findings often cited are those 
of DeFleur and Westie (1958) and of Kutner, 
Wilkens, and Yarrow (1952). More recent 
reports of discrepancies between verbally ex- 
pressed attitudes and behavior include those 
of Fendrich (1967), Fishman (1961), and 


Linn (1965). For a recent review of the T 


literature, see Wicker (1969). 

When discrepancies are found between 
verbally expressed attitudes and overt be- 
havior, variables other than attitudes are in- 
voked to explain them. These variables al- 
most always turn out to be some situational 
condition such as “situational thresholds” 7 
(Campbell, 1963), “social involvement” (De- 
Fleur & Westie, 1958), “commitment” 


(Fendrich, 1967), or “the nature of the en- j 


vironment” (Weissberg, 1965). i 
Lohman and Reitzes (1954), for example, * 
have reported differences in discrimination as 
a function of situational differences. White 
residents in an urban environment all þe- 
longed to a property owners’ association that 
was actively resisting Negro penetration. 
These same subjects, however, also belonged i 


| 


ot 


» to a labor union that had a well-implemented 
` policy of granting Negroes job equality. Simi- 
larly, Minard (1952) has observed below-the- 
ground integration and above-the-ground seg- 
regation between white and black coal miners 


in the Pocahontas coal field. 


Many other writers have emphasized the 
importance of situational variables in influ- 
‘ encing behavior (e.g., Gordon, 1952; Lewin, 
“1951; Orne, 1962; Pettigrew, 1961; Rotter, 
1955). Some have concluded that character- 
“istics of the situation are so overwhelming as 
determinants of behavior toward an attitude 
object that little or no direct relationship be- 
‘tween verbally expressed attitudes and overt 
l behavior should be expected. Some have even 


gone so far as to question the usefulness of 


the attitude concept. Doob (1947) views it 
| as having no systematic status as a scientific 
construct; Blumer (1955) sees it as useless 


yin analyzing social action because it is too 


vague and because it lacks an empirical 


. reference. 


Rather than abandoning the attitude con- 


A cept or the assumption that attitudes deter- 


mine behavior, Rokeach (1968a, 1968b) has 
argued that the disappointing empirical find- 


- ings merely suggest a need for a clearer con- 


> 


` 


| Most investigators who have been concerned 


| ceptualization of the attitude concept and of ` 


the relationship between attitudes and behav- 
ior. More specifically, he has proposed that 
behavior - with - respect - to-an-object-within-a- 
situation (Bəs) is always a function (f) of 


¿ at least two interacting attitudes: attitude- 


toward-object (A,) and attitude-toward-situa- 


» tion (As): 


Bos = (AAs); [1] 


with the functional relation between attitudes 


Bnd behavior have typically focused their at- 


tention and measurement procedures on Aj, 
across situations, and have typically neglected 
A,, across objects, expecting that Bos = f (Ao). 


_ This neglect represents a failure to appreciate 


J 


that an individual always encounters an atti- 
tude object in the context of some situation 
| (event, activity, or happening), about which 
he also has an attitude. Whenever an atti- 
| tude toward a particular object is activated, 
‘this A, will not necessarily be manifested or 
‚expressed in behavior in the same way. Its 


a manifestation will vary as the activated A, 
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will vary. If one focuses only on A, as a pre- 
dictor of behavior while ignoring the equally 
relevant As, as has been the case in countless 
studies since LaPiere (1934), one is almost 
guaranteed to observe inconsistencies between 
an individual’s expressed attitude and his 
behavior. 

The attitude object is the figure, and the 
situation within which the attitude object is 
encountered is the ground. The relétionship 
between the two—object and situation—is an 
interacting relationship between a particular 
figure and a particular ground, and it is this 
interaction that will determine the behavior- 
with-respect-to-a-particular-object-within-a- 
particular-situation. Because of this interact- 
ing figure-ground relationship, it is not suf- 
ficient to assert that behavior is merely a 
function of two attitudes, A, and As. It is also 
necessary to recognize that the psychological 
representations of the object and the situation, 
A, and A,, will cognitively interact with one 
another and may exert differential influence in 
determining Bos. In other words, Bos may be 
differentially influenced by the two attitudes, 
A, and As, depending on their relative impor- 
tance with respect to one another, In order to 
maximize prediction of behavior, therefore, we 
must not only take into account the two types 
of attitudes but also the nature of the cogni- 
tive interaction between them. This requires a 
theory or model of cognitive interaction. 

The belief congruence model of cognitive 
interaction (Rokeach & Rothman, 1965) can 
readily be adapted to describe how the two 
attitudes, A, and A,, will interact with one 
another to determine behavior. Whenever a 
person encounters an object within a situation, 
two attitudes, A, and A,, are activated, and the 
person will compare the two attitudes for their 
relative importance with respect to one an- 
other. The evaluation of the total figure- 
ground configuration, A As, is postulated to 
be an average of the evaluations of A, and A, 
considered separately, weighted by the per- 
ceived importance of A, and A, relative to 
one another within the context AjA,. Thus, 


AAs = (w)A, + (1—w)Ag, [2] 
where A,A, refers to attitude-toward-object 
interacting with attitude-toward-situation, and 
(w) and (1 — w) refer to the perceived impor- 
tance of A, and A, with respect to one an- 


other in a specified figure-ground context—a 
specified object in a specified situation. Since 
Bos is postulated to be a function of A,A. 
(Formula 1), Formula 2 may be rewritten as 


Bos = F(A.) = (w)Ao + (1—w)A.. [3] 


The following hypotheses seem to follow 
from the preceding: 

1. The two attitudes, A, and A,, are inde- 
pendent of one another. 

2. Each of the two attitudes, A. or As, will 
reliably predict behavior to at least some 
extent. 

3. Behavior will be more accurately pre- 
dicted if both attitudes, A, and As, are taken 
into consideration. 

4, Behavior will be even more accurately 
predicted if the two attitudes, A) and As, are 
weighted for their perceived importance with 
respect to one another, 


METHOD 


To test the four hypotheses, we selected the fre- 
quency of cutting class in specified psychology 
courses as the dependent variable. While there are, 
no doubt, many motives and attitudes that might 
determine whether or not a college student will cut 
a particular class, it is reasonable to assume that 
the behavior, cutting class, will be mainly deter- 
mined by the attitude a student holds toward the 
particular professor who is teaching the class (Ac) 
and by the attitude he holds toward the general 
activity of going to class (As). How often a par- 
ticular student cuts a particular class should be a 
function of these two attitudes and of their inter- 
action with one another, more specifically, of their 
perceived importance with respect to one another. 


Subjects and Procedure 


One hundred and eight students enrolled in two 
intermediate psychology courses at Michigan State 
University served as subjects. Questionnaires were 
administered under group conditions designed to 
obtain information on the frequency of class cuts 
in several previously taken courses, attitudes toward 
the professors teaching these courses (Ao), attitudes 
toward the general activity of attending class (As), 
and the perceived importance of a particular profes- 
sor and attending class in relation to one another. 
Most subjects completed the questionnaire in about 
15 minutes. 

“This research,” the subjects were told, “is designed 
to determine some of the factors which may affect 
the frequency of attending class in psychology 
courses.” The subjects were then instructed to list 
the psychology courses they had previously taken 
and completed—up to a total of five courses. The 
subjects were asked not to list the courses they were 
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presently taking or to list courses in which there was ` 
more than one instructor. 

The subjects were also asked to list the number 
of class hours per week -for each course and their 
estimate, to the best of their recollection, of the 
number of times they had cut class. “We are not 
interested in class cuts caused by illness, accident, — 
unusual weather conditions, etc.” 

It should be noted that the behavioral measure 
was obtained by self-reports. There is no reason, 
however, to believe that the subjects, who remained 
anonymous, were not able to give reasonably reliable} 
estimates. 

Next, we obtained a measure of A.: “We wouldiem 
next like to ascertain your general attitude toward 
attending class. We are not interested in your atti- 
tude toward a particular instructor or toward af 
particular course, but only in how you feel about” 
going to class im general.” i 

The subjects then indicated on a 9-point rating scale — 
how they felt in general “about the importance or 
unimportance of attending class,” where 1 represented — 
“somewhat important to me,” 5 represented “impor- 
tant to me,” and 9 represented “very important 
to me.” 

Measures were then obtained of their attitudes 
toward the several professors (Ao). They first identi- 
fied the professor in each course ply as, “Profes- 
sor 1,” “Professor 2,” etc., thus preserving the pro- - 
fessor’s anonymity. They then rated each professor 
on a 9-point rating scale, where “1% represented 
“I disliked the instructor very much,” 5 represented 
“neither liked nor disliked,” and 9 represented “I 
liked the instructor very much.” 

The final section of the questionnaire was designed 
to obtain a measure of perceived importance of the 
two attitudes and was similar in format to that pre- 
viously employed by Rokeach and Rothman (1965). 


Finally, we would like to know to what extent 
your going or not going to class was jointly 
determined by your attitude toward each of your 
instructors and your general attitude toward at- a 
tending class. For each course you listed on page 1, 
you are asked to fill in a percentage following 
each of the two simple scales. In each case, the 
two percentages should add up to 100%. 


1. You had cuts (fill in from page 1) in Pro- 
fessor 1’s course. To what extent was the frequency 
of attendance or the frequency of cuts in this course 
determined by 

Write 
Percentage 
Here 
(a) your feelings about Professor 1? A 


—/0 
0% 50% 100% 
(b) your PEA ieee about the im- 
portance or unimportance of at- 
tending class? 


0% 50% 
(c) Total percentage (a + b) 


100% 


aie 


ae ee E TA SL Wee i, 
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TABLE 1 
INTERCORRELATIONS AND MULTIPLE CORRELATIONS BETWEEN ATTITUDES AND BEHAVIOR 
1 2 3 4 5 
Data analysis n 
TAo,As Bos,Ao rBos,As RBos'Ao,As| 1Bon,AcAs 
All subjects, all courses 81 .07 —.20 n. oa | 49%" aa) Eai 
First 3-hour-per-week course listed by subject; 
hours constant, course varied 71 .00 — .38** | —.35** P | OG 
Course and hours constant 
1. Psychology 100 35 —.07 —.31 —.34* ,48* ioe 
2. Psychology 200 30 04 —.16 —.29 33 —.39* 
3. Psychology 300 29 —.01 —.24 — A5* ES a —.57* 
4. Psychology 400 25 .20 —.25 —.27 33 —.38 
5. Psychology 500 20 —.26 —.68** 3s) .68** 180" 
*p <.05, 
WED SOL, 
wD <.001, 


Of the 108 questionnaires that were collected, 19 
were excluded because the subject had listed less 
than three psychology courses, 5 were excluded be- 
cduse the percentages in the last section did not add 
up to 100%, and 3 were excluded either because the 
‘subject had circled more than one number on a 
rating scale or had omitted the rating. Analysis of 
data was carried out for the remaining 81 subjects 
who had completed all the information and who had 
taken 3-5 psychology courses. For each of these 
subjects, we obtained the following measures: (a) 3-5 
measures of Bos, estimated number of class cuts; 
(b) As, attitude toward attending class in general; 
(c) 3-5 measures of Ao, attitudes toward particular 
instructors; and (d) 3-5 measures of AAs, weighted 
for perceived importance. For example, if a subject 
had rated A, as 6, and Professor 1 in Psychology 
400 as 3, and indicated that As was weighted 80% 
and attitude toward Professor 1 (Ao), 20%, the 
weighted value of AAs was 5.4 ([.80]6+ [.20]3). 
These scores can range from 1 to 9, the higher scores 
representing more favorable feelings toward object 
and situation. It is these scores that should turn out 
to be our best predictors of behavior. 


RESULTS 


Table 1 shows the results obtained from 
three kinds of analyses: 

Analysis I shows the results for all 81 sub- 
jects for all reported psychology courses. The 
data were averaged across 3-5 psychol- 
ogy courses to provide a single index for 
each subject for Ao, As, AoAs, and Bos. Bos, 
the class cut index, the dependent measure of 
behavior, is the total number of class cuts 
divided by the total number of class hours 
per week for all psychology courses the sub- 
ject had reported taking. Analysis I provides 
us with the most reliable test of the two- 
attitude theory previously described. 


Analysis II shows the same results obtained 
for 71 subjects who reported they had taken 
a 3-hour psychology course. Class hours per 
week is constant, while course content may 
vary. Ten of the 81 subjects did not report a 
3-hour-per-week course. If a subject listed 
several 3-hour courses, we analyzed only the 
first one he had listed. 

Analysis IIT shows the results obtained for 
five specific psychology courses, identified here 
as Psychology 100, 200, etc. The course and, 
therefore, hours per week are constant, but 
different instructors may have taught the 
same course. 

Hypothesis 1: A, and As are independent. 
Column 1 shows that the correlations between 
A, and A, are generally close to zero and are 
in no case statistically significant. The two 
attitudes, A, and As, are clearly independent 
of one another. 

Hypothesis 2: A, and A, will each predict 
behavior to at least some extent. Column 2 
shows the correlations obtained between fre- 
quency of class cuts (Bos) and Aj. It will be 
noted that in all instances the correlations 
are negative—the more positive the attitude 
toward the instructor, the fewer the class cuts. 
The correlation obtained in Analysis I for 
all 81 cases is —.20, which is not statistically 
significant. The size of this correlation is in 
line with the findings generally reported in 
the literature (Wicker, 1969) concerning the 
relationship between attitudes and behavior. 
Analysis II yielded a correlation of —.38 þe- 
tween behavior and A», which is significant 
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beyond the .01 level; Analysis III yielded a 
highly significant correlation of —.68 between 
class cuts and A, in Psychology 500. 

Column 3 shows the comparable correla- 
tions obtained between A, and behavior. These 
correlations are also for the most part nega- 
tive, suggesting that the more favorable the 
attitude toward the activity of attending class, 
the fewer the class cuts. Our best estimate of 
this correlation (Analysis I) is —.46, which 
is significant beyond the .001 level. Analyses 
II and III show additional statistically 
significant correlations. 

It is evident that As, which is typically 
neglected in attitude research, is a somewhat 
better predictor of behavior than is A. In 
Analysis I, A, correlates —.20 with class cuts, 
while A, correlates —.46; the difference be- 
tween these two correlations is significant 
beyond the .05 level. 

Hypothesis 3: Behavior will be more accu- 
rately predicted if both attitudes, Ao and As, 
are taken into consideration. Column 4 shows 
the multiple correlations between the two atti- 
tudes on the one hand and class cuts on the 
other. These multiple correlations are typi- 
cally higher than those shown in columns 2 
and 3 and range from .33 to .68. Five of the 
seven multiple correlations are statistically 
significant, and these correlations between 
attitudes and behavior are generally more 

_ respectable than those generally found in the 
literature. 

Hypothesis 4: Behavior will be most accu- 
rately predicted if the two attitudes, A, and 
A,, are weighted for perceived importance. 
The most impressive correlations obtained in 
this study are those shown in column 5, 
the correlations between class cuts and the 
weighted A.A, measure. They range from 

- —.38 to —.78, and six of the seven correla- 
tions are statistically significant, four of them 
beyond the .001 level. These correlations are, 
without exception, higher than the correlations 
obtained between behavior and A, or between 
behavior and As, and they are, without ex- 
ception, higher than the multiple correlations. 
For all 81 cases across all psychology courses, 
the correlations between the two weighted 
attitudes and behavior is —.61, which is sig- 
nificant beyond the .001 level. This correla- 
tion is, furthermore, significantly higher than 
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the correlation between class cuts and A, 
(p < .001) or between class cuts and As 
(p< .01). In Analysis II, the correlation 
between the two weighted attitudes aad 
behavior (— .56) is also higher than cither 
the correlation between class cuts and A, 
(p < 10), or between class cuts and As 
(p < 02). 


The object versus the situation as a deier- 
minant of behavior. In general, As was con- 


sistently and significantly more favorable in W 


all comparisons than Ao. Furthermore, Analy- 
sis I reveals that A, and A, were, on tic 
average, perceived to be important with re- 
spect to one another in the ratio of 37:63 
In other words, the subjects estimated that 
their attitude toward the activity of going to 
class was about twice as important as their 
attitude toward the instructor in determining 
frequency of class cuts. This difference be- 


tween perceived importanee of A, and Asis ~ 


significant beyond the .001 level. Similar. 
analysis of the remaining data yielded similar 
results. 


DISCUSSION 


In order to predict behavior with reasonable 
accuracy, at least two attitudes, A, and As, 


as well as their cognitive interaction, must be .7 


considered. The findings in regard to A, and 
As considered separately as predictors of be- 
havior are not atypical of results from studies 
in which only one class or type of attitude is 
used to predict behavior. In the present study, 
the correlation between the A,A, configuration 
and behavior is statistically significant in six 
of the seven subsamples, and it is larger than 
the correlation between either A, and behavior 


or between A, and behavior in every analysis a 


undertaken. These findings thus suggest a sen’ 
sible explanation for the generally disappoint- 
ing findings of many previous investigations 
concerning the relationship between attitudes 
and behavior. 

LaPiere (1934), for example, measured an 
attitude toward object (Chinese), but he did 
not take into account the situation or the 
attitude toward the situation in which the 
object was encountered. The motel, hotel, and 
restaurant owners had attitudes not ‘only 
toward Chinese, but also had attitudes toward 


“managing their establishments in a profitable 


manner. In the face-to-face situation, they per- 
haps did not want to cause a scene that 
other customers might witness, or embarrass 
LaPiere, or offend the Chinese couple whom 
LaPiere describes as “skillful smilers” speak- 
ing perfect English. In the non-face-to-face 
situation, however, the manager’s attitude 
toward the Chinese interacted in a very dif- 
ferent way with their attitude toward their 
business establishment and the former attitude 
Way have been a more imporant determinant 
of behavior than it was in the face-to-face 
situation, 

Other studies that have reported discrep- 

_ ancies between expressed attitudes and overt 
behavior may be viewed in a similar manner. 
In Minard’s (1952) study, for example, the 
behavior of the white coal miners toward 
Negroes can reasonably be expected to be 
ery different in the work situation of the 
ine, where the safety of all depends upon a 
{close working relationship, than outside the 
y mine, where the social situation activates 
many other predispositions. 

Rather than hypothesizing that attitude is 
one determinant of behavior and the situation 
is another, we have applied here the same 
universe of discourse to both determinants. 

The situation is psychologically represented 
as an A,. By doing so, we are now able to 
suggest, instead of behavior being a function 
of attitudes and situations, that behavior is a 
function of at least two attitudes, one con- 
cerning the object and the other concerning 
the situation. By representing the situation as 
an Ag, the study of attitudes as determinants 
of behavior becomes more truly social psycho- 
logical in nature rather than remaining merely 
personological in nature. Instead of lamely 
explaining away disappointing attitude—be- 
havior correlations with “apparently, the 
situation is more important than attitude as 
a determinant of behavior,” it is possible to 
specify in the present study the extent to 
which A, and A, predict behavior. 

The major implication of the present con- 
ceptualization and findings concerning the 
attitude-behavior relationship is that both A, 
and As, as well as their cognitive interaction, 
should be assessed in order to maximize the 
accuracy of predictions to behavior. The in- 
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vestigator’s task is to identify the relevant A, 
and A, when making predictions about a 
specific behavior, and this identification will 
often be a matter of intuition or educated 
guess. In the present study, for instance, it 
was assumed that in predicting the frequency 
of class cuts, the most relevant attitudes would 
probably be those concerning the instructor 
of a particular course (A,) and those concern- 
ing the general activity of attending class 
(As). There are, no doubt, other attitudes 
toward objects and situations that may be 
relevant that have been ignored here, for 
example, attitude toward other students in 
class, attitude toward the textbooks employed, 
attitude toward the teaching assistant, atti- 
tude toward the room in which classes are held, 
attitude toward that long walk from dormi- 
tory to classroom, etc. Future research may 
well include more than one A, and A, in 
predicting behavior. 

Over and above the identification of the 
most relevant attitudes, it is also necessary, 
as we have shown, to assess the relative 
importance of these attitudes. Measures of 
(w) and (1 — w) were obtained in this study 
by asking the subject to rate the relative 
importance of A, and A, in the context of 
AAs. It is perhaps possible to assess the rela- 
tive importance of the two attitudes by other 
methods and, perhaps, on theoretical rather 
than empirical grounds. Rokeach and Roth- 
man (1965) have previously suggested that 
the comparison of the relative importance of 
the two attitudes will take place within the 
general framework of one’s total belief sys- 
tem, wherein all beliefs, and thus all attitudes 
(which are organizations of beliefs), are ar- 
ranged along a central-peripheral dimension 
of importance. Conceptualizations about the 
different types of beliefs and attitudes which 
may be found along this dimension of impor- 
tance allow for at least some informed guesses 
to be made about which of the two attitudes, 
A, or Ag, is likely to be more important in a 
given context. 

In regard to the relative importance of A, 
and A,, it might be suggested that A, will often 
be the more important of the two. Previously 
cited studies, for example, those of LaPiere 
(1934), Lohman and Reitzes (1954), Minard 
(1952), and Kutner et al. (1952), all empha- 
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size the importance of the situation within 
which an attitude object is encountered. 
Findings of the present study also show the 
somewhat greater relative importance of A, 
than of Aj, even though these findings are not 
altogether consistent on this point. The impli- 
cation for future studies of the relationships 
between attitudes and behavior is clear. The 
importance of situational attitudes and their 
interactions with attitudes toward objects can 
no longer be neglected in research concern- 
ing the relationship between attitudes and 
behavior. 

Perhaps the most obvious limitation in the 
present study is that behavior was not actu- 
ally observed. The subjects were asked to 
estimate from memory the number of their 
class cuts in each course. There is no reason 
to assume, however, that these estimates are 
not reasonably accurate or that the subjects 
systematically distorted their recall of such 
behavior. Nonetheless, observation of be- 
havior would have been more desirable, if 
less practicable, and is suggested in future 
research. 

The fact that we relied on the subject’s 
self-report to obtain measures of behavior and 
attitudes leads to the possibility that these 
measures were not entirely independent of one 
another and, thus, spuriously determined the 
outcome. It is conceivable, perhaps, that sub- 
jects who reported large numbers of cuts in a 
particular course would, for the sake of con- 
sistency, also report lower evaluations of the 
instructor in that course. Such a consistency 
interpretation, however, would not fit the fol- 
lowing two findings: (a) that self-reports con- 
cerning A, and As are generally uncorrelated 
with one another and (b) that each of 
two atttitudes nevertheless are consistently 
correlated with behavior. 

Implicit in the formulation that behavior 
is a function of at least two cognitively inter- 
acting attitudes is the proposition that the 
prediction of future behavior is based on prior 
knowledge of A, and A, and of their relative 
importance with respect to one another. In 
the present study, subjects were asked for 
their evaluations of the objects and of the 
situation after the actual behavior had taken 
place. No measures of possible attitude 
change over time were obtained. It does not 
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seem unreasonable that such changes in eval 
ation, especially of instructors, might have 
occurred. 

One additional aspect of the behavioral 
measure should be noted, Although the sub- 
jects were instructed to disregard factors such 
as illness, accident, and unusual weather con- 
ditions when estimating the number of times 
they had cut a particular course, there is 
no way of knowing to what extent these in- 
structions were actually followed. The cor-/, 
relations between the attitudinal and behav- 
ioral measures, as well as the reliability and 
validity of the behavioral measures, would be 
reduced to the extent that these instructions — 
were not kept in mind when estimating Bos. 

The possibility of a difference among sub- — 
jects in the mean of the A, and A, measures 
should also be noted. In evaluating the in- 
structors, subjects were asked to what extent _ 


they “liked” each particular instructor. Likin f 
é p 
t 


can be interpreted in a number of differen 
ways. To some subjects it may have mean 
“good lecturer,” to ot jers a “nice person,” and 
so on. Differences in interpretation of “the — 
general importance of attending class” are 
also possible. These variations in meaning 
undoubtedly introduced error variance and, 
thus, probably reduced the size of the 
correlations obtained. 

To summarize the limitation of the present 
study, it should be noted that some aspects 
of the study may have affected the reliability 
and validity of the measurements. These 
include the following: (a) The behavioral mea- 
sures represent estimates provided by the 
subjects after the occurrence of the actual 
behavior; (b) attitude change may have oc- 
curred; (c) factors other than A, and A, may 
have affected class-cutting behavior; and (d) 
the meaning of the behavioral and attitudi- 
nal measures may have differed among the 
subjects. 

In conclusion, then, results of the present 
study lend support to the general hypothesis - 
that behavior can be more accurately pre- 
dicted than is widely supposed, from a knowl- 
edge of verbally expressed aititudes. The more 
specific hypothesis that prediction of behavior, 
with respect to an object within a situation, 
will be more accurate when based on knowl- 
edge of the cognitive interaction between Ao 
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Wand As than when based on knowledge of 


either attitude alone is also supported by 
the results. 
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Ninety subjects saw slides portraying scenes in each of which a white police- 
man was attacking a black civilian, while 90 others saw the same slides with 
all aggressive content masked out. All subjects made evaluative ratings of the 
policeman and the “victim” who appeared in the final slide of the series, In 
each group, half of the subjects gave ratings for only the experimenter’s use 
(mo-consequence condition), and half gave ratings to be forwarded ostensibly 
to an investigative “Commission” that might affect the fates of the depicted 
persons (consequence condition). For the no-consequence condition, it was 
predicted that observers of aggression would employ victim derogation in 
their evaluations; in the consequence condition, observers of aggression were 
expected to enhance the victim’s qualities. Devaluation of the attacker was 
predicted for the consequence ratings. Findings support the predictions for 
ratings of the victim figure, particularly when subjects perceived the attack 
as unjustified and when the victim was rated before the attacker. Findings 
were less consistent for the evaluations of the attacker. Implications for con- 
sidering Lerner’s “just world” hypothesis are discussed. 


Imagine that you witness the following 
incident: On a public street, you see a man 
receiving a severe beating from another man. 
Assume, also, that you are powerless to affect 
the course of events. You are merely a passive 
observer of this fait accompli; you can only 
form an impression of the event and carry it 
with you in some recess of your mind. What 
impression will you tend to form? Will your 
sympathies go out to the victim or to the 
attacker? 

This is not a trivial question. Although it 
is unlikely that you personally will witness 
acts of extreme violence, you frequently ob- 
serve violent incidents through the medium of 
television, newspapers, or picture magazines. 
Almost daily you may see such scenes. And, 
even if you turn away in horror, you make 
implicit judgments about the qualities of 
both the victim and the attacker. The ques- 
tions, then, should not be dismissed: What 
will your impressions be? Do you remain 


1 This paper was adapted from a master’s thesis 
by the first author under the guidance of the second 
author, We are indebted to Alice Eagly for her 
helpful suggestions as a member of the thesis com- 
mittee, and to Ivan Steiner and Elaine Walster for 
their comments on an earlier draft of this paper. 

2 Requests for reprints should be sent to Alan 
Lincoln, Department of Sociology, University of 
Massachusetts, Amherst, Massachusetts 01002. 
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entirely neutral, or do you favor one pa 
over the other? ae 

Conventional wisdom mig “simply answer 
this question as follows: You decide who is $ 
“tight” and who is “wrong”; if the victim is — 
being beaten “unjustly,” your sympathies will | 
flow out to him, and your anger will rise at — 
the attacker; or, if the attacker has a “just” 
reason, then you will identify with the at- 
tacker and blame the victim. Conventional 
wisdom might assert that the observer bases 
his judgments on the facts, and nothing but 
the facts. But what are the relevant “facts”? 

A social-psychological line of analysis might 
begin at the same starting point, but arrive 
at a different set of answers. Social-psycho- 
logical theory would confirm that the naive — 
observer finds it important to maintain justice 
in the world he perceives around him (Adams 
1965; Homans, 1961; Lerner & Simmons, 
1966). Yet, in the situation described above, 
there are some psychological responses that 
are likely to be ignored by the conventional 
wisdom. For one thing, the perception of 
justice or injustice may be the consequence 
rather than the origin of the perceiver’s 
cognitive operations. The observer’s judgment | 
of right and wrong will depend on the charac- — 
teristics that he attributes to the parties in 
the incident. If he sees the victim as “good” 


w 
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“ and the attacker as “bad,” he is likely to 


judge that the victim has suffered an injus- 
tice. Alternately, the observer may choose to 
believe that the victim is a bad person; in 
that event, the victim will have deserved his 
fate, and the observer can rest comfortably 
in the belief that justice has triumphed. 
Pertinent data for such a psychological 
analysis have been provided by Lerner and 
his colleagues (Lerner & Mathews, 1967; 
Lerner & Simmons, 1966). In their experi- 
ments, subjects tended to reject or devalue 
the victim in incidents where they were un- 
able to alter his fate. On the basis of these 
experiments, Lerner has suggested that to 
retain their faith in a “just world,” people 
will derogate a helpless victim; they wish to 
believe that he deserves his fate. This sort of 
belief adjustment is part of a more general 
cognitive balancing process. People create jus- 


“tice out of perceived injustice or stability out 


of apparent instability to bring order into 
their perceived world. 

Adams (1965) has approached the study of 
justice and injustice on the basis of Homans’ 
(1961) exchange-theoretic notions. Homans’ 
concept of “distributive justice” implies that 
a person compares his own outcomes and 
investments with those of some referent other 
and decides whether he is receiving equitable 
returns, The outcome-input model derived by 
Adams yields predictions parallel to those of 
Lerner’s “just world” hypothesis, and it pro- 
vides a general framework for viewing situa- 
tions of equity and inequity. 

The problem for the present study arose, 
in part, out of Adams’ model and Lerner’s 
earlier research. At the conclusion of their 
empirical study on observers’ reactions to 


sñ “innocent victims”—where they found that 


: 


‘experimental subjects tended to devalue the 
suffering victim—Lerner and Simmons (1966) 
wrote the following: 


The most compelling question raised by these 
data is under what conditions will a person whose 
suffering derives from altruistic motives be reacted 
to with compassion and admiration rather than 
rejection [p. 210]. 


The present purpose was to investigate ob- 
servers’ reactions to scenes of violent en- 
counter between policemen and Negroes— 
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incidents in which a black victim is attacked 
by a white policeman. Would there be deroga- 


tion of the victim in the eyes of the observer? _ 


Or might there be an alternative response, — 
that is, the enhancement of the Negro victim — 
and/or the derogation of the white agent? 

Tf a subject believes that an injustice has 
been committed, he will find this belief to be 
unpleasant. Such perception of inequity ap- 
pears to produce distress (Adams, 1965; 
Walster, Berscheid, & Walster*). Berscheid 
and Walster (1969) proposed that, if the sub- 
ject himself is the “harm-doer,” he most prob- 
ably will use one of two alternative means 
of restoring justice: (a) justification of the 
harmful act or (b) compensation of the 
harmed victim. They pointed out that justifi- 
cation is particularly likely to occur when 
adequate restitution is unavailable (Berscheid 
& Walster, 1969, p. 21). However, if a per- 
son has available means for making adequate 
restitution to the victim of a harmful act, he 
is fairly likely to use compensation as his 
method of restoring justice. 

In the present study, analogous reasoning 
was employed to predict reactions of subjects 
who were merely the observers of aggressive 
incidents. We speculated that if a subject — 
were given the opportunity to make only 
private ratings of the victim and the attacker 
in an aggressive incident—ratings that would 
have no consequence for altering the fate of 
either party—he would tend to justify the 
attack by derogating the victim, as occurred 
in the study by Lerner and Simmons (1966). 
Having no means of compensating the victim 
for his injury, observers would downgrade 
his personal qualities in order to rationalize 
the justice of the victim’s fate. (The same 
logic would assert that these same observers 
would also upgrade the personal qualities at- 
tributed to the attacking policeman. How- 
ever, it seemed unlikely that the average col- 
lege student observer would respond sym- 
pathetically to an aggressing policeman. ) 

What would happen if subjects believe that 
their evaluations of both the victim and the 
attacker may have a noticeable effect on these 


SE. Walster, E. Berscheid, and G. W. Walster. 
Recent advances in equity research, Manuscript in 
preparation, 
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parties? Would their reactions be different 
if their ratings were to be transmitted to an 
agency that subsequently might influence the 
fates of the persons concerned? We predicted 
that such an eventuality would alter the sub- 
ject’s reaction, since now he would have, the 
opportunity to at least partially compensate 
the victim and retaliate against the attacker. 

The following hypotheses were formulated. 
Hypothesis 1 follows from the findings of 
Lerner and Simmons (1966). 

1. If his evaluation has no consequence for 
altering a victim’s fate, an observer will tend 
to devalue the characteristics of the victim 
of an unjustified attack. 

Hypotheses 2 and 3 are derived from the 
predictions of Berscheid and Walster (1967, 
1969). Note that these authors’ research in- 
vestigated only the responses of harm-doers 
themselves, and obtained no evidence about 
the behavior of outside observers. 

2. If his evaluation does have consequences 
for affecting the fate of the victim of an un- 
jusified attack, the observer will tend to 
enhance the characteristics of the victim. 

3. If his evaluation does have consequences 
for affecting the fate of the attacker in an 
unjustified attack, the observer will tend to 
devalue the characteristics of the attacking 
agent. 

Tt was expected that subjects would differ 
considerably in the extent to which they per- 
ceived the occurrence of an actual injustice. 
The more unjust the incident, the greater 
should be the subject’s need to restore justice. 
More extreme responses should be made 
by those subjects who perceive greater de- 
grees of injustice. Therefore, the following 
hypothesis was proposed: 

4, The greater the perceived injustice of the 
aggressive act, the more extreme will be the 
subject’s “justice-restoring” response. 


METHOD 
Design and Overview 


The hypotheses were tested via a 2X 2 factorial 
design (Aggression versus Nonaggression [Control] X 
Consequence versus No Consequence). Subjects were 
randomly assigned as observers either of unjustified 
aggression or of nonaggression. Across both aggres- 
sion conditions, the two stimulus persons in the 
evaluated photo were the same policeman and Negro 
“protester”; the nonaggression observers served as 
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the control group. Within each level of aggre 
half of the subjects were assigned to a no-t 
quence condition and half to a consequence ev 
tion condition, The former subjects were tolc ‘sat 
their ratings of the pictured people were solely 
the use of the experimenter; the latter were in- 
formed that their ratings would be used to 
a fictitious Interracial Commission for the Inv 
tion of Public Behavior, which had ost 
supplied these materials, and that this Commis 
would forward their ratings to the stimulus perse 

Subjects experienced the experiment in the follow 
ing sequence: (a) They received an orientation about 
its ostensible purpose. (b) They looked at one oi 
two series of 21 slides, depicting either a set of 
aggressive or of nonaggressive incidents. (c) In the 
aggression condition, the subjects rated the perceive 
injustice of the incident shown in the final slide. 
(d) All subjects received either the consequence or 
no-consequence instructions. (e) Finally, the subjects 
made their ratings of the two stimulus persons. 

sbi) > Thn 

Y 
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In the main study, the subjects were 191 Caucasian 
males and females enrolled in introductory psychol- 
ogy classes at the University of Massachusetts. To 
facilitate analysis, the scores of 11 subjects—no more 
than 4 from any experimental cell—were randomly 
deleted. Exactly 45 subjects remained in each of the 
four experimental conditions. 

Before these subjects were recruited, pretests were 
conducted to ascertain the amount of campus-wide 
sympathy for Negroes compared to whites. Was it 
possible that subjects, because of pro-Negro sym- 
pathies, would resist giving negative evaluations to 
the Negro stimulus persons? Or, conversely, might 
subjects be biased against a Negro? Verbal descrip- 
tions, similar to those to be later employed in the 
main study, of “positive” and “negative” persons 
were presented to 44 Caucasian students, Half of the 
positive persons were described as Caucasians and 
half as Negroid; negative stimulus persons were simi- 
larly identified. The dependent measure was identical 
to the measure later employed in the main research; 
it consisted of 15 pairs of bipolar evaluative adjec- 
tive scales identical to those used by Lerner and 
Simmons (1966). There were no significant differ- 
ences among the ratings attributable to the race of 
the stimulus person.* 


Subjects 


Materials 


Aggression stimuli. Aggressive acts were depicted 
by 21 photographs reproduced from weekly periodi- 
cals, processed into transparent slides. All slides por- 
trayed white uniformed policemen attacking black 
males, generally in the presence of other persons. 
Slide 21, in Figure 1, is an example. All 21 slides 
were drawn from a larger pool of magazine pictures 


4A full description of the pretest procedures and 
results, as well as a verbatim account of all other 
procedures and communications to the subjects, are 
contained in Lincoln (1969). 
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depicting violent police-black incidents, which had 
earlier been rated by 24 undergraduates. Each slide 
retained for this study depicted an incident rated as 
unjust—that is, above 4.00 on a 6-point scale. The 
final set of 21 slides had a mean injustice score of 
4.60; Slide 21 had an injustice rating of 4.91 (see 
Figure 1, upper). 

Nonaggression stimuli. For control purposes, the 
aggression slides were edited with masking tape to 
depict Caucasians and Negroes socially interacting in 
public places without any visible aggressive cues. To 
ee? explain the masking of the slides, the subjects were 

informed that the Commission was interested only in 
> some of the persons in each of the scenes they were 
about to see. Where possible, both the corresponding 
“victim? and the “attacker”? figures were displayed; 
when this was impractical, nonaggression slides por- 
trayed onlookers from the surrounding crowd. In 
Slide 21, the evaluated scene, a vicious police dog 
and certain other aggressive cues were masked out, 
but the policeman and Negro were fully in view 
(see Figure 1, lower). 


Procedure 


1. Orientation. Subjects were brought into the 
experimental room in groups ranging from 15 to 24. 
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Fic. 1. Aggressive (upper) and nonaggressive (lower) versions of Slide 21. 


The experimenter seated them and presented them 
with an experimental booklet. Tape-recorded instruc- 
tions were then presented which indicated that the 
study was designed to “investigate awareness and 
memory of complex stimuli.” Subjects were told 
also that we were interested in “the effect of inter- 
vening tasks on recall.” Aggression subjects were 
informed that they were to see “high-arousing 
complex material”; nonaggression subjects were told 
they would be exposed to “low-arousing complex 
material.” All subjects were told that the materials 
had been loaned by the Interracial Commission for 
the Investigation of Public Behavior and that this 
material was being used because it satisfied our 
“demanding” requirements. A brief description of 
the functions of the fictitious Commission was given, 
indicating its nonpolitical concerns with societal 
problems. 

2. Exposure to stimulus material. Subjects in the 
aggression and nonaggression conditions each received 
a communication from the same “authoritative” 
Commission. The two communications were similar 
in length and format, but differed in theme. In the 
aggression condition, the aggressive behavior was 
portrayed as unjustified; the message stressed that 
the victims were peaceful demonstrators, civil rights 
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workers, and the like. The nonaggression message 
described the stimulus persons in a similar manner, 
but did not mention either aggression or suffering. 

In Slide 21, the two stimulus persons were 
described as follows: 


The young man in this picture is 23 years old. 
He is married and has no children, He is a gradu- 
ate of Jefferson High School in Louisville, Ken- 
tucky. He is currently employed as a salesman 
in an automobile dealership. At the time this pic- 
ture was taken, he was involved in a peaceful 
demonstration to call attention to poor city 
maintenance in certain areas of the city. 

The police officer is 30 years old. He is married 
and has one child. He has been with the force 
for 9 years. When this photograph was taken, 
he was the officer in charge of a detail of police. 


3. Rating of perceived injustice (aggression condi- 
tion only). Immediately after their exposure to the 
stimulus slides, all subjects were required to respond 
to several questions “designed to measure attention 
and recall.” In addition, the aggression subjects 


were asked to rate the degree of injustice (or justice) 
of the attack portrayed in Slide 21. On &6-point 


scale (where 6 = totally unjustified), they answered 
the question “How justified was the police action?” 

4. Consequence-no-consequence manipulation. Sub- 
jects in both the aggression and nonaggression condi- 
tions were assigned randomly to either the conse- 
quence or the no-consequence rating condition by 
either the inclusion or the omission, respectively, of 
the following statement. If included in the rating 
booklet, the statement was located just prior to the 
point at which the subject was asked to give his 
evaluation of the stimulus persons: 


We have agreed with the Commission for the 
Investigation of Public Behavior to report our 


subjects’ opinions to them. The Commission will 


make this information available to the people 
involved in the slides you have seen. 


5. Ratings of stimulus persons. Each stimulus 
person was rated on a set of 15 scales. This instru- 
ment was identical to that employed by Lerner and 
Simmons (1966); it consisted of 15 pairs of bipolar 
adjectives on 9-point scales. 

Subjects were instructed to make their judgments 
on the basis of all their information concerning the 
persons in Slide 21. Following this instruction, the 
slide was displayed on the screen again while the 
subjects made their ratings. In each condition, half 
of the subjects rated first the policeman and then 
the victim, while the remaining subjects rated these 
people in the opposite order. 


RESULTS 
Consistency of Evaluative Scales 


Victim evaluation. To determine whether 
the 15 rating scales would comprise a single 
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“among experimental conditions, indicated that 


measurement scale, interitem correlations 
were computed. The mean interitem correla- 
tion coefficient, computed by Fisher’s z trans- ` 
formation was .26 (W = 180, p < .01); there © 
were no negative correlations. The mean cor- ~ 
relation of each item with the total score was 
55 (N = 180, p < .01). In view of these con- 
sistently positive intercorrelations, further ` 
analyses of the victim evaluation used the © 
total scores across all items. 

Agent evaluation. The same 15 scales used 
for evaluating the agent were more highly 
correlated. The mean interitem correlation 
was .52 (N = 180, p < .01). The mean cor- 
relation of the items with the total score 
was .75 (N = 180, p < .01). All subsequent 
analyses therefore were based on the total 
scores. 


Subjects’ Awareness 


even subjects, about equally divided 


they thought the purpose of the study was 
not, primarily to investigate “memory,” but 
to determine attitudes toward Negroes or 
police. However, no stibject indicated either 
that he was aware of the existence of more 
than the single experimental condition to 
which he himself was assigned or that he had 
any notion of the experimental predictions. § 
The data of these “aware” subjects were 
therefore retained. 


Devaluation and Enhancement of the Victim 


Hypothesis 1 stated that the victim would 
be devalued, compared with the nonvictimized 
protester, if the subjects believed that their 
evaluation would be unavailable to the victim 
(no-consequence condition). In contrast, Hy- 
pothesis 2 predicted that the victim’s traits’ 
would be enkanced, compared with those of 
the nonvictimized protester, if the subjects 
believed that their evaluations would have 
consequences for the rated persons. An analy- 
sis of variance was computed to assess the 
effects of the two independent variables of — 
aggression (aggression and mnonaggression) 
and consequentiality of evaluation (conse- 
quence and no consequence). In this 2 X 2 
design, the four cell variances were relatively 
homogeneous (Fmax = 1.12, a= 4, n = 45). 


Re. 


OBSERVERS’ EVALUATIONS OF 


If, as predicted, the no-consequence victim 


_ ratings of aggression observers were less posi- 


be 


tive than those of nonaggression observers, 
while the opposite trend occurred for the 
consequence evaluations, a significant inter- 
action would be found. The predicted interac- 


>- tion between level of aggression and evalua- 


tion was indeed highly significant (F = 9.21, 
df =1/176, p < .001). 

Table 1 shows that in the no-consequence 
condition, the subjects observing aggression 
gave the victim a mean rating of 91.78, while 
those who observed nonaggression gave him a 
more positive rating of 99.42. This difference 
was significant in the predicted direction 
(t = 2.76, df = 88, p < 01). In the conse- 
quence condition, the subjects who observed 
the aggression scene gave the victim a mean 
evaluation of 94.69, and those observing non- 
ageression rated him 90.09. The variance of 


` the consequence ratings was much smaller, 


and this difference was highly significant in 
the predicted direction (¢= 5.85, df = 88, 
p< 001). 

It had been expected that the no-conse- 
quence ratings of the victim would be more 
negative than the corresponding consequence 
ratings. Although Table 1 indicates a tend- 
ency in that direction, the effect was not 
significant if one considers the ratings of all 
90 subjects exposed to aggression (¢ = 1.21, 
df = 88, p > .20). However, if one considers 
only those subjects who indicated that they 
believed the aggression to be unjust, then the 
victim’s mean no-consequence rating was 
89.46, while his mean consequence rating was 
97.07. This adjusted difference is significant 
(t = 2.31, df = 72, p < .02). 

Thus, Hypotheses 1 and 2 were supported. 
Aggression observers who considered the at- 
tack to be unjust tended to devalue the 
victim when their evaluation appeared to have 
no consequences for him, but they enhanced 
their ratings of the victim when instructed 
that their evaluation would be made available 
to him or, at least, to a public agency that 
might be instrumental in furthering justice. 


Devaluation of the Agent 


Consider now the ratings of the attacker 
as compared to those of his nonaggressive 
counterpart. Hypothesis 3 predicted that, fol- 
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TABLE 1 


MEAN EVALUATIONS OF Victim AND ATTACKER AS A 


FUNCTION OF CoNSEQUENTIALITY, LEVEL OF 
AGGRESSION, AND DEGREE OF 
PERCEIVED INJUSTICE 


Stimulus figure receiving 
evaluation 
Experimental condition n 

Victim Agent 

No-consequence rating 
Nonaggression 45 99.42% 83.98 
Aggression 45 91.78 56.13 
Unjustified> 36 89.46 52.63 
Justified? 9 96.61 70.11 

Consequence rating 

Nonaggression 45 90.09 81,20 
Aggression 45 94.69 64.18 
Unjustified 38 97.07 60.39 
Justified 7 81.71 84.71 


a The higher the score, the more positive the evaluation. 

b“'Unjustified” referred to scores of 5 or 6 and ‘“‘justified” to 
scores of 1 to 4 on a balanced 6-point scale from “totally 
justified’’ (1) to “totally unjustified" (6), 


lowing their observation of aggression, sub- 
jects in the consequence condition would de- 
value the agent. If we compare the agegres- 
sion ratings with the nonaggression ratings in 
this condition, this prediction is strongly up- 
held. As an aggressor, the policeman received 
a mean evaluation of 64.18; as a nonaggres- 
sor, he was rated 81.20 on the average (see 
Table 1). This difference is highly significant 
(¢=4.21, df = 88, p< .001). 


However, the data in Table 1 indicate that 
the predicted effect also occurred within the 


no-consequence condition. If anything, the 


subjects in the no-consequence condition de- 


valued the agent even more than those who 
rated him with some possible consequence. 


Thus, the interaction between levels of ag- 
gression and consequentiality of evaluation — 


approached significance in the opposite direc- 
tion to that predicted (F = 3.28, df = 1/176, 
p < .08). 

Although Hypothesis 3, stated literally, re- 
ceived confirmation, the consequence subjects 
did not devalue the agent more than did 
the no-consequence subjects. Possible reasons 
for this finding will be discussed later. 


Perceived Injustice and Evaluation 


Hypothesis 4 suggested that the greater 


the perceived injustice of the aggressive be- 
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havior, the more xtreme would be the ob- 
servers’ justice-r toring responses. The mean 
injustices all aggression observers was 
5.11 on the 6-point scale of perceived injus- 
tice. There was no significant difference be- 
tween the mean scores of consequence raters 
(5.02) and of no-consequence raters (5.20). 

Victim ratings. For those subjects whose 
ratings were of no consequence, there was 
a negligible correlation between their percep- 
tion of the injustice and their evaluation of 
the victim (7 = .03, = 45). This is not sur- 
prising, since subjects who felt that the attack 
was just were likely to rate the victim nega- 
tively, as were those who perceived injustice 
and subsequently devalued the victim. These 
two opposing tendencies would incline the 
resulting ratings in the same direction. 

In the consequence condition, however, per- 
ceived injustice was positively correlated with 
subjects’ evaluations of the victim (7 = .48, 
n = 45, p < .005). The 38 subjects who indi- 
cated that the attack was unjustified evalu- 
ated the victim far more positively (97.07) 
than did those 7 subjects who perceived the 
aggression as justified (81.71); this difference 

was highly significant (f= 2.98, df = 43, 
p < .005). 

_ Agent evaluation. In both conditions, the 
subjects’ perception of injustice and their 
positivity to the attacker were inversely cor- 
related. In the no-consequence condition, the 
correlation was —.46 (n = 45, p < .005), and 
in the consequence condition the correlation 
was —.57 (n= 45, p < .001). As would be 

expected, there was also a significant differ- 
ence between the mean evaluations of sub- 
jects who defined the attack as unjust and 
those who saw it to be relatively just (by 
t test, p < .02 for no consequence; p< .005 
for consequence subjects). These results indi- 

cate that aggression alone did not arouse con- 
demnation; observers also had to perceive the 
attacker as unjustly aggressive. 

Hypothesis 4 received support. Observers 
who rated the attack as clearly unjustified 
(5-6) gave ratings farther in a “justice re- 
storing” direction than did subjects who indi- 
cated that the attack was either just or only 
slightly unjust (1-4). Among four separate 
analyses, the singular exception of the victim 
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ratings in the no-consequence condition was 
easily explainable. 


Effects of Order 


Going beyond our original hypotheses, we 
examined possible effects of order of rating. 
The notion tested was that more extreme 
responses of either devaluation or enhance- 
ment would occur on subjects’ first ratings 
than on second ratings. A subject’s first rating 
was made while his sense of perceived in- 
justice would have been at its peak; and, if 
the first rating had functioned to restore 
somewhat his sense of perceived justice, then 
the second rating would be made under less 
severely perceived injustice. 

Victim ratings. An analysis of variance 
assessed the effect of order of rating on the 
extremity of victim ratings in relation to their 
consequentiality and the level of aggression 
viewed. This analysis tect a after ran- 
domly deleting some subjects’ séores to pr 
duce equal cells) indicate 
action of order and ¢ 
ratings approached signif 
df = 1/136, p < .07). 

Consider the effects of order in the no- 
consequence and the consequence conditions 
separately. No-consequence subjects who 
rated the victim first devalued him more than 
did those who rated him second, as shown 
by a simple effects test (F = 5.38, df= 
1/136, p < .025). In the consequence condi- 
tion, first raters tended to enhance the victim 
more than did second raters, but this efiect 
was only of borderline significance (F = 2.99, 
df = 1/136, p < .10). 

Agent ratings. In the no-consequence condi 
tion, first ratings of the attacker were mi 
devaluing of the attacker than were second 
ratings (F = 16.53, df =1/136, p < .001). 
However, this effect did not occur in th 
consequence condition (F < 1). 4 

In general, then, order of a subject’s rati 
did seem to affect the extremity of sisi 
sponse, but primarily within the no-cons 
quence condition. 


Discussion 3 

In concluding their paper on observe 
reactions to injustice, Lerner and Simmo 
(1966) asked under what conditions suffer 
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may be reacted to compassionately rather 
than with rejection. Findings from the present 
study contribute toward an answer to that 
question. When no one who mattered would 
be likely to see their ratings, our subjects 
tended to use the mechanism of derogating 
the victim—as Lerner and Simmons’ subjects 
had done. However, if it appeared that their 
evaluations would be made available to the 
victimized protester, then they tended to rate 
him more highly than his nonvictimized 
counterpart.” These trends were more clear 
for observers who fully perceived the injus- 
tice, and for those whose ratings of the 
victim were made first. The findings were less 
consistent in regard to the ratings of the 
agent. The aggressing policeman was dero- 
gated strongly by both the no-consequence 
and the consequence subjects; if anything, 
though, the no-consequence raters devalued 
him even more than did the consequence 
raters—contrary to our initial prediction. 
_ The findings of this study help to extend 
the applicability of Adams’ (1965) outcome- 
input model to the realm of third-party ob- 
servation of injustice. If the observer per- 
ceives the perpetration of an injustice, he 
may choose either to justify the act (i.e., ad- 
just the actors’ perceived inputs), or he may 
. try to correct the injustice (i.e., adjust their 
future outcomes). In this study, the subjects 
in the consequence condition had potential 
opportunity of affecting the parties’ outcomes; 
they adjusted their ratings so as to raise 
their evaluations of the victim. 

Admittedly, there is some ambiguity in the 
exact nature of the consequences perceived 


5 A conversation with Barry Collins was helpful in 
suggesting the labels of consequence and no conse- 
quence, earlier referred to as “public” and “private” 
(Lincoln, 1969). Collins has recently used the conse- 
quentiality concept for explaining some seemingly 
contradictory findings about certain cognitive dis- 
sonance phenomena. For instance, writing an anony- 
mous essay has been found to produce little disso- 
nance, while making a parallel counterattitudinal 
statement in public role playing does produce dis- 
sonance effects, Collins has come to believe that the 
anonymous essays are perceived to have no conse- 
quence for the subjects, while the role playing does 
have consequences and thus produces distinct cogni- 
tive effects (B. Collins, personal communication, 
November, 1970). 
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by the consequence raters. For example, it is 
unclear whether the subjects perceived their 
enhancement of the victim as functioning 
mainly (a) to help the victim through the 
possible intervention of the Commission, or 
(b) to try to inform him that someone cared 
about him. Yet, these two variables are gen- 
erally confounded in the real world, Future 
research may concern itself with their disen- 
tanglement. 

Also, it may be puzzling as to why no-con- 
sequence ratings of the aggressing policeman 
were more negative than his consequence rat- 
ings, or why no-consequence ratings of the 
Negro nonvictim were more positive than 
were consequence ratings in the nonaggression 
condition. This finding may be explained 
by assuming that the subjects felt freer 
to express personal biases in the no-conse- 
quence condition, and that privately they 
felt very positive toward the black civilian 
but very negative toward the white policeman. 
In the consequence condition, the raters might 
have felt inhibited from giving extreme rat- 
ings to either figure. If correct, this assump- 
tion also would explain why the variances 
were smaller for ratings in the consequence — 
condition. 

The findings did show that subjects who 
justified the agent’s attack rated him more 
highly than subjects who deplored the attack, 
and, in the case of consequence subjects, more 
highly even than the control ratings. Since 
perceived injustice was not experimentally 
manipulated, it is not certain whether the 
degree of perceived injustice affected the eval- 
uations causally. It is noteworthy, though, 
that the predicted effect was significantly 
stronger for whichever stimulus was rated 
first. This finding is perhaps the most convinc- 
ing evidence that cognitive adjustments op- 
erated according to our hypothetical model. 

The present study was designed to pertain 
to a current social problem. The “attacker” 
and “victim” were not the neutral figures of 
the usual laboratory experiment, but they 
were figures who might elicit the subjects’ 
preexisting prejudices. In that context, the 
observers’ tendencies to derogate the victim 
(Lerner & Simmons, 1966) were found to be 
less likely when his enhancement might make 
a difference. It seems interesting to speculate 


whether, in the nonlaboratory world, indi- 
viduals who believe they have some power to 
affect the fate of others (or “the fate of the 
world”?) are less prone to derogate unfortu- 
nate victims than are individuals who prefer 
to accept the world as it is. Is it plausible 
that “activists,” by virtue of their feeling 
actively responsible for events, are less prone 
to derogate the impoverished or the victimized 
than are “passivists”? Do “activists” believe 
that compensation to victims is not only more 
necessary, but also more probable? 

The initial predictions of the present study 

did not generalize too well to the ratings of 

the attacking agent, At least two explanations 
- are reasonable. One is that all subjects (who 
perceived an injustice) felt strong forces to 
condemn the policeman, but subjects in the 
no-consequence condition (whose ratings 
were for the experimenter alone) felt safer to 
express their negative feelings in their trait 
ratings. 

Another possible explanation is that if a 
‘person perceives an unjust act, but is unable 
to do anything about it, he may wish to 
` derogate all parties to the event. His response 

_ thus would be “a plague on both your houses,” 
If this tendency were operating here, then 
both the victim and the attacker would re- 
ceive lower ratings. If this interpretation were 
correct, then subjects in the earlier Lerner- 
Simmons study might have derogated the 
agent as well as the victim of the injustice if 
they had been given a chance. This idea de- 
serves future investigation. 

Does the present study have any implica- 
tions for the strategies of social activists con- 
cerned with the maintenance or restoration of 
justice in our society? Lerner’s studies have 
implied that pointing alone—that is, merely 
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showing the existence of injustice—may de- 
feat the very purpose sought by the poin‘er. 
Instead of stirring observers’ sympathies, it 
may encourage perceivers actually to devatue 
the objects of the injustice. Under conditions 
where observers are isolated from suffering 
victims and where they anticipate no future 
interaction, derogation is a likely psycholoy'- 
cal mechanism (see Footnote 3). Other studies 
haye outlined conditions under which harn 

doers may be inclined to compensate rather 
tha derogate their victims (Berscheid & 
Walster, 1969, pp. 20-26). The present study 
suggests that when observers of harm are of 
fered the possibility of either derogation or 
compensation, they prefer the latter if there 
is a chance that it may ameliorate the injus- 
tice. 
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alternatives. 


bution of responsibility are discussed. 


Psychological theory and research dealing 
with attribution of responsibility in person 
perception have tended to ignore the poten- 
tial impact on the process of decision making 
Nof (a) the perceiver’s decision alternatives 
}}and (b) the magnitude of consequences of his 
decisions for the target person (see Jones & 
Davis, 1965; Maselli & Altrocchi, 1969, for 
literature reviews). The model of the attribu- 
tion process proposed by Jones and Davis 
(1965), which is based on the theory pro- 
posed by Heider (1958), is essentially an 
inference model that sees responsibility attri- 
#& bution as a process based on the logical 

processing of cues about the targets’ acts, his 
ability to commit those acts, and his knowl- 
edge about the consequences of those acts. 
The “personal knowledge” model of deCharms 
(1968) implies that the attribution process is 
basically a projection by the perceiver to the 
target of his own intentions if he were to com- 
mit similar behaviors. Both models, however, 
do not deal with the many instances in which 
the perceiver is forced to make a decision 
‘about a target person knowing in advance 
that the decision will result in certain out- 


1 This research was supported by Canada Council 
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offered criticisms of earlier drafts of this paper. 
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EFFECTS OF DECISION ALTERNATIVES ON THE VERDICTS 
AND SOCIAL PERCEPTIONS OF SIMULATED JURORS? 


NEIL VIDMAR ? 


3 University of Western Ontario 


Simulated jurors were asked to read a description of an attempted robbery 
and consequent killing of a store proprietor, 
verdict on the defendant’s guilt under one of seven conditions (plus a no- 
decision control group) which varied the number and severity of the decision 
A postdecision questionnaire assessed their perceptions of the 
defendant and the trial testimony. Although jurors having at least a “mod- 
erate” penalty option seldom chose a verdict of “not guilty” (on average of 
6% in six conditions), over half (54%) of the jurors faced with only a 
“severe” penalty option chose “not guilty.” Implications for theories on attri- 


They were asked to return a 


comes for the target. Consider some exam- 
ples. A factory foreman may be put in thé 
position of filing a “violation” report on a 
subordinate. Will he be more inclined to make 
a judgment of “willful” intent if the subordi- 
nate will only receive several days of suspen- 
sion as opposed to the instance where the sub- 
ordinates employment will be terminated? 
Consider another example. Will a student’s 
PhD committee be more inclined to vote to 
pass him on comprehensive exams if they know 
a fail will result in termination of doctoral 
study than if they know a fail will merely re- 
sult in the student having to rewrite all, or 
portions, of the exam? How will these deci- 
sion makers then rationalize their judgments? 
A final example, which originally gave rise 
to speculation on these issues, is concerned 
with the decision alternatives made available 
to jurors by either the law or the judge’s 
discretion in his instructions to the jury. 
During the Detroit riots of 1967, three 
young black men were killed in the now in- 
famous Algiers Motel incident (see Hersey, 
1968). In the summer of 1969, a white police- 
man involved in that incident was brought to 
trial for the murder of one of the men. After 
both prosecution and defense had presented 
their cases, the judge in charge of the trial 
gave the jury only two choices in returning 
their verdict: guilty of first-degree murder or 
not guilty. The former alternative carried a 
mandatory sentence of life imprisonment. 
“Middle ground” verdicts of second-degree 
murder or manslaughter were not allowed, al- 


i 


though their omission from consideration is 
“unusual” (but legal) in Michigan murder 
trials. A number of legal sources were reputed 
to feel that because of conflicting testimony 
on the part of the defense, the defendant 
would likely have been found guilty if the 
less severe alternatives had been available. 


Other legal sources disagreed and felt the 


proper charge had been given to the jury 
(see Lundy, 1969). 

Disregarding the particulars in the Algiers 
Motel trial, discussions of the jury system 
indicate that such controversy is not unusual. 
Many sources indicate that the judge has 
considerable influence on jury outcome (see 
Winick, 1961, for a review). Even though 
some states allow the judge neither comment 
nor summary on a trial, there is no way of 
knowing how much informal influence a judge 
can exert (Kalven & Zeisel, 1966). The im- 
pact of the severity of the threatened penalty 
has also been studied. Punishment may domi- 
nate jury deliberation despite instructions to 
disregard it (Kalven & Zeisel, 1966, Ch. 21). 
At the same time, a study of instances where 
the jury was also allowed to set the penalty 
as well as deciding guilt failed to show dif- 
ferences from instances where juries were not 
allowed to set the penalty (Kalven & Zeisel, 
1966, Ch. 21). 

The broader psychological problem arising 
from the above discussion, then, may be 
summarized as follows: given instances in 
which a perceiver must make an overt judg- 
ment, or decision, about a target person, what 
effect does the number of decision alternatives 
and consequences arising from the decision 
alternatives have on the type of decision that 
is reached? An examination of the literature 
Suggests that no attention has been given 
directly to this question. A second question 
asks, how does the perceiver rationalize his 
decision once it is made? There is some previ- 
ous work dealing with the processes of the 
perceiver after he has made a decision about, 
or has performed some behavior having con- 
sequences for, a target person (Berscheid & 
Walster, 1967; Lerner, 1965; Lerner & Mat- 
thews, 1967; Lerner & Simmons, 1966; Roth- 
bart, 1968; Walster & Prestholdt, 1966). 
Basically, this research supports the notions 
derived from dissonance theory (e.g., Fest- 
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inger, 1957) that subjects will rationa ue 
their perceptions of the target in such a wa) 
as to make it consistent with their decision, 
In a jury trial, for example, choice of a ver- 
dict with severe consequences should result in 
greater attribution of responsibility to the] 
target person than when a less severe or “not {i 
guilty” verdict is chosen. Similarly, jurors]” 
should give greater weight to evidence sup- 
porting their decision. 

The present research limits itself to simu 
lated juror decisions in an analogue of the A1% 
giers Motel trial. Suppose the evidence is 
such that given no restrictions a defendant 
would generally be perceived to be guilty of © 
manslaughter or second-degree murder. If 
jurors are restricted to a limited number of 
guilt alternatives, will the number of ac- 
quitals increase as the severity of the conse- 
quences associated with the guilt alternative 
increases? One might hypothesize that in the: 
instance of a choice between a guilty verdict 
which carries a too severe penalty or a not 
guilty verdict, jurors will sAy “the penalty is § 
too severe even thouSh the defendant is 
guilty, and hence I will judge him to be not 
guilty.” On the other hand a juror might de 
cide: “the penalty is too severe, but that is 
none of my business because the defendant is 
guilty, and I must indicate his guilt.” 


METHOD 
General Strategy and Design 


An abridged transcript of a murder trial was used 
as the task. It consisted of a neutral introduction 
to the facts of the case followed by the testimony of 
four witnesses. Because order of testimony might 
have a bearing on decision outcomes, all possible 
orders of evidence (hence, 24 order€ of testimony) 
were used within each decision-alternative cell of the 
design. After reading the facts and testimony, the, 
subjects were told to assume that they were mer’ 
bers of a jury and to indicate their verdicts about 
the defendant. Following this decision they we 
asked a number of questions about the defendar 
the testimony of the witnesses, and so forth. ‘ 
experimental design had eight conditions regard 
the number and kind of decision alternatives av 
able to the simulated juror: (1) guilty of frst- 
degree murder or not guilty; (2) guilty of second- 
degree murder or not guilty; (3) guilty of man- 
slaughter or not guilty; (4) guilty of first-degree 
murder, second-degree murder, or not guilty; (5) 
guilty of first-degree murder, manslaughter, or not 
guilty; (6) guilty of second-degree murder, man- 


D 

ged 

e slaughter, or not guilty; (7) guilty of first-degree 

3 murder, second-degree murder, manslaughter, or not 

© guilty; (8) control condition where no verdict was 
| requested. 


| Subjects 


The subjects were 227 students taking an intro- 
ductory course in social psychology at the University 
of Western Ontario. The questionnaires were ad- 
ministered during class time as a demonstration 
| experiment in social psychology. The questionnaires 

were ordered so that the necessary 192 cells (8 
“/decision-alternative conditions X 24 orders of testi- 
mony) would be filled, In order to keep the number 
of subjects and testimony orders equal in each cell, 
the responses of the additional 35 subjects were 
discarded from the analysis. Sex of subjects was not 
well controlled in the design and was to some extent 
influenced by class seating patterns. There were 74 
males and 118 females in the final analysis. Cell 
frequencies for males ranged from only 2 males in 
Conditions 1 and 3 to 17 in Condition 4. An analysis 
of variance considering sex as a variable could still 
be attempted although not under ideal circumstances 
since the F ratio would be based on unweighted 
means and hence would be only approximate. 


Task 


The criminal case was modified from an abbrevi- 
ated description of a murder trial that actually took 
place in Canada, The evidence of the four witnesses 
was modified to range from testimony indicating 
first-degree murder, through “middle ground” guilt 
of second-degree murder or manslaughter, to not 
guilty. A pilot study (as well as the results of the 
„final experiment) indicated that given no restrictions, 
Jihe overwhelming majority of judges would choose 
between manslaughter or second-degree murder. 
Middle-ground guilt was necessary to test the effects 
of giving only an alternative with a perceived “too 
- severe” punishment. Subjects were given the follow- 
ing introductory instructions: 


` We are interested in studying the manner in 
which people judge various criminal offenses. The 
following pages give an account of a criminal 
offense. Please read the case carefully because 
when you have finished reading the case account, 
you will be asked to give your personal opinions 
concerning the case. 


The general introductory facts and testimony in 
the case were presented in a summary brief as 


follows: 


Case of Charles Young, accused of murder in the 
death of Yoshyuki Ono. On December 10, 1966, 
the defendant, Charles Young, entered the shop 
kept by the father of the victim at 502 West 4th 
Ave., Vancouver, for the purpose of robbery. The 
victim, 24 years old, Voshyuki Ono, lived in the 
rear of the shop with his father and mother and 
o> waarcld brother. The living area consisted of a 
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sitting room, kitchen, and bedrooms. The sitting 
room was separated from the rest of the shop by 
what is called a half-door with curtains on the 
store side extending to the floor. A bell would 
ring in the living area when the street door of the 
shop was opened. 

On the occasion of the incident, Mr. and Mrs. 
Ono were sitting in the sitting room with the 
deceased son and his brother. The store bell having ` 
rung, the father left the room for some reason and 


the mother entered the shop. She saw that Young 
was carrying a revolver and uttered some expres- | 


sion which gave a warning to the others in the — 
sitting room that a holdup was in progress. They ~ 
had formerly gone through a similar experience, — 
and the expression was understood. At about this 
point, a customer, Steve Wilson, entered the 
store, He quickly saw that a robbery was in 
progress and dove for a counter on the opposite 
side of the store. From this point on, the events 
and sequence of events are unclear. A struggle 
ensued between Young and the victim, Yoshyuki 
Ono, Three shots were fired from Young’s re- 
volver. One bullet entered Voshyuki’s hand, an- 
other entered his arm, and the third entered his 
head, causing a fatal wound. Young fled the shop 
but was apprehended a short time later. 

In the murder trial the question of the at- 
tempted robbery is undisputed. However, the trial 
testimonies of Young, Mrs. Ono, Yoshyuki’s 
brother, and the customer, Steve Wilson, differ. — 


The testimony of the witnesses, ordered in de- 
creasing degree of guilt, was as follows: 

Testimony of victim’s brother. The brother states 
that he and VYoshyuki were sitting in the sitting 
room when they heard their mother’s warning, Both 
men immediately jumped up and started toward the 
door to the shop. Yoshyuki was nearer to the door 
when two shots rang out, wounding him in the hand 
and arm. Yoshyuki continued his rush into the store 
and began to struggle with Young in order to get 
the gun away from him, In the struggle, Yoshyuki 
“had Young’s wrist, raising it up in the air.” Young 
managed to break away from Yoshyuki and rushed 
toward the street door. Just as he was opening the 
door, “he turned toward my brother who was just 
standing there. He aimed the gun and shot my 
brother in the head.” Young then rushed out into 
the street. 

Testimony of victim’s mother. Mrs, Ono says that 
as soon as she signaled her family about the robbery, 
Young went immediately toward the curtains of the 
door leading into the sitting room, and, as he ap- 
proached, he fired a shot which passed through the 
wooden partition at the side of the doorway. When 
he got to the curtains, he fired another shot through 
the curtains. The first shot wounded Yoshyuki in 
the hand and the second in the arm. Yoshyuki im- 
mediately came into the store and grappled with 
Young in an effort to disarm him. Mrs. Ono says 
that in the struggle Yoshyuki was attempting to 
take the pistol from Young “but could not reach’ 
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because he was quite high [tall].” She added: 
“Young was holding the gun, and my son grabbed 
his wrist.” Defense question: “Does she mean that 
Young was holding the gun in his hand?” Answer: 
“Yes, and my son was doing his best, and trying to 
bring it down, but he quite weak because he injured 
already.” She continues that while they were strug- 
gling, she heard a shot and afterwards did not hear 
anything. After the shot, her son fell down, and 

Young ran off. 

Testimony of Steve Wilson, customer. Wilson 
testified that just as soon as he had gotten inside the 
store, Mrs. Ono cried out, and he then saw the rob- 
ber’s gun. Since he had not been noticed by the 
gunman, Wilson quickly decided to take cover. 
While he was scrambling to hide behind a wooden 
counter, he heard two shots but did not see any- 
thing. When he looked out from the side of the 
counter, Young and Yoshyuki “were struggling.” 
Question: “What were the positions of the two men 

relative to the door?” Answer: “Mr. Ono (Yoshy- 
uki) was between Young and the door.” Question: 
“Would you say that Ono was attempting to stop 
Young’s escape?” Answer: “I don’t know.” Wilson 
said that the struggling men moved out of his field 
of vision. He heard a third shot. After “a few 
moments,” he looked up and saw Yoshyuki on the 
floor. 

Testimony of Defendant Young. Young maintains 
that when he entered the store the revolver was in 
his pocket. Mrs. Ono entered the shop from the rear 
area and asked if she could help him. When he 
pulled the revolver from his pocket “the old lady 
started screaming.” Young asserts that the commo- 
tion unnerved him, he hesitated, and then decided to 
tun. But before he could make his move, Yoshyuki 
appeared from nowhere and began to struggle with 
him, Young managed to push Yoshyuki away from 
him but in the process was off balance, and, as he 
fell against a counter, a combination of the impact 
and the panic caused him to fire the revolver twice. 
Yoshyuki then reengaged in the struggle, grabbing 
at Young’s wrist when the gun went off accidentally, 
inflicting the fatal wound. Young then ran off. 
Under defense cross-examination, the arresting offi- 
cer corroborated the fact that this was what Young 
told him when first arrested: “He told me he was 
in a jam.” Question: “What kind of a jam?” “He 
told me he took some store and said that ‘the Jap 
came for me, and the gun went off? ” 


Decision-Alternative Conditions 


Ater reading the testimony, the subjects were given 
the following instructions: 


Assume you are a member of the jury, We are 
interested in your individual decision about the 
defendant, Charles Young. The judge gives the 
following instructions to the jury before sending 
it to the jury room to make its decision: There 
are four options (or three options or two options, 
depending on the condition) in bringing forth your 
verdict about the defendant. (a) He is guilty of 
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first-degree murder (sentence required by law is 
25 years to life imprisonment), (b) He is guilty 
of second-degree murder (sentence required by 
law is 5-20 years imprisonment). (c) He is guilty 
of manslaughter (sentence required by law is 1-5 
years imprisonment). (d) He is not guilty. 


Postdecision Questions 


Following their verdict, the subjects were to indi- 
cate their feelings about the case on a series of 
7-point semantic-differential-type scales. Question 1: 
Young is guilty (7), not guilty (1). Question 2: 
Young should be treated harshly (7), leniently (1). 
Question 3, Describe Young on each of the following 
scales: (æ) violent-not violent; (b) good-bad; (c) 
unlucky-lucky; (d) reckless-cautious; (e) not re- 
sponsible-responsible; (f) likable-unlikable. Ques- 
tion 4 asked the subject to describe the victim on the 
following adjective scale pairs: (a) good—bad; (b) 
unlucky-lucky; (c) reckless—Cautious; (d) not re- 
sponsible-responsible; (e) likable-unlikable. Ques- 
tions 5, 6, 7, and 8 asked the subjects to describe the 
testimony of the victim’s mother, the defendant 
Young, the victim’s brother, and customer Wilson’s 
testimony, respectively, on each of the following 
scales: (a) probable-improbable; (b) bad-good; (c) 
irrational-rational; (d) accurate-inaccurate; (e) 
unbiased-biased. The favorable adjective was always 
anchored with the scale point of 7, while the un- 
favorable adjective was anchored with the 1. scale 
point. 


RESULTS 


A preliminary analysis indicated that there 
were no significant effects due to order of 
testimony. Preliminary analysis also showed 
only three marginally significant sex effects 
out of 34 tests. Since we would expect ap- 
proximately 2 out of 34 tests to yield “sig- 
nificant” results by chance, and since those 
three sex effects did not contribute to the 
understanding of the data, the remainder of 
the analyses disregarded sex as a variable. 


Decision Alternatives and Verdicts 


Table 1 shows, by condition, the frequency 
with which each alternative was chosen. It is 
quickly apparent that far more (54%) not 
guilty verdicts were returned in Condition 1, 
which forced the subjects to choose between a 
first-degree murder verdict or not guilty, than 
in any other condition. To test the statistical 
significance of this finding, the frequencies 
were treated as either a not guilty or guilty 
“to some degree” (hence, Condition 7 had 22 
guilty and 2 not guilty verdicts). To test the 
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EFFECTS or DECISION ALTERNATIVES 
TABLE 1 
DECISION ALTERNATIVES AND FREQUENCY OF VERDICTS 
Condition 
Alternative 

1 2 3 4 5 6 7 8s 
First degree 11 = = 2 7 — 2 — 

(46%) (8%) (29%) (8%) 
Second degree =< 22 — ; 15 — 

(84%) (92%) (8%) (63%) 
| \ Manslaughter = SA 2 a 5, — 

i : (92%) (67%) et 7) (21%, 
Not guilty 13 14 2, 0 1 0 2 — 

(54%) (17%) (8%) (0%) (4%) (0%) (8%) 


controversial idea suggested by the Algiers 
Motel trial, Condition 1 was compared to 
Condition 7, which had all four decision-alter- 
Ss native poscibilities. The two conditions were 
significantly different at the .001 level (x? = 
11.7, df = 1). Some critics might (appropri- 
ately) suggest that the differences between 
Conditions 1 and 7 may have been due to the 
greater number of decision alternatives, and 
thus the opportunity to choose some verdict 
other than not guilty in Condition 7 rather 
than the greater severity of the guilty option 
in Condition 1. To test this interpretation, 
Conditions 2 and 3 were combined and com- 
pared to Condition 1. The former conditions 
were like Condition 1 in that there were only 
two alternatives, but they differed in that a 
guilty verdict would result in a less severe 
penalty of the defendant. Condition 1 was 
again significantly different (x? = 14.3, df = 
1, p < .001). Conditions 4, 5, and 6, which 
had two guilt options, yielded only one not 
guilty verdict out of 72 subjects. It may be 
concluded then, that under conditions of re- 
stricted decision alternatives, the more severe 
the degree of built associated with the least 
severe guilt alternative, the greater were the 
chances of obtaining a not guilty verdict. 


Postverdict Social Perceptions 


Once an individual has chosen a decision 
alternative, how does he rationalize his deci- 
sion? Dissonance theory and other consistency 
formulations would predict that those indi- 
viduals choosing a guilty alternative should 


Note.—Blank cells indicate that that decision alternative was not available in the condition. There were 24 subjects in each 
condition. The percentage of verdicts within each cell within conditions is indicated in parentheses. 
a Condition 8 was a control for the social perception question; no verdicts were requested. 


report the defendant to be more guilty and 
indicate that he should be treated more 
harshly than subjects choosing a not guilty 
alternative. Subjects choosing guilty should 
also downgrade testimony that is favorable to 
the defendant. Hence, the perceptions of the 
11 subjects in Condition 1 who chose not 
guilty and those of the 13 who chose guilty 
were compared with the perceptions of the 
subjects in Condition 8, These latter subjects 
were not asked to pass a verdict on the de- 
fendant, and thus served as a control group 
whose social perceptions should fall between 
the guilty and not guilty decision makers in 
Condition 1. 

A one-way analysis of variance with three 
decision levels (guilty, Condition 1; not 
guilty, Condition 1; no decision, Condition 8) 
was conducted on each of the 33 social per- 
ception scales. Although only 7 scales yielded 
statistically significant differences, the data for 
6 of these scales yield fairly strong support for 
the hypothesis. Table 2 shows that subjects 
choosing not guilty saw the defendant as being 
less guilty, as deserving less harsh treatment, 
and as being less bad and more likable than 
subjects choosing a guilty verdict. Not guilty 
subjects also saw the victim’s mother’s and 
brother’s testimony as being more biased than 
subjects choosing a guilty verdict. The means 
for the last scale in Table 2 suggest the 
possibility that the testimony of Steve Wilson, 
the customer, may have been perceived as 
ambiguous. Both decision makers who choose 
guilty and those who choose not guilty saw 
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TABLE 2 


Means AND F Ratios or SOCIAL Perception ITEMS 
SHOWING SIGNIFICANT DIFFERENCES BETWEEN 
Conpition 1 (Nor Gumury), Conpition 1 
(Gury), AND CONDITION 8 
(No-DEcIsIoN) SUBJECTS 


Condition 
Items/subjects F ratio 
Not | Guilty | Woon 
guilty} (1) (8) 
1. Young is guilty (7) 3.69 5.82 5.83 | 12,.85** 
2. Young should be treated 
harshly (7) 4.00 | 5.45 | 4.79 | 5,08* 
3. Young is good (7) 3.38 2.64 2.33 4,30. 
4. Young is likable (7) 3.25 1.82 3.08 4,99 
5. Mother’s testimony 
biased (1) 3.46 4.75 4.45 4,07* 
6. Brother's testimony 
biased (1) 1.79 2.73 215 3.26% 
7. Customer's testimony 
improbable (1) 5.31 5.64 6,17 3.36* 


Note.—Only one adjective of the bipolar scales is listed ; the 
parentheses indicate the number with which that adjective was 


his testimony as less probable than the “no 
decision” subjects, and, in fact, the not guilty 
choosers saw it as less probable than the 
guilty decision makers. 


Discussion 


Discussion of the study first centers around 
the limitations of the specific experiment and 
then suggests some implications for the 
broader problem of attribution of responsibil- 
ity when the perceiver’s decision has direct 
consequences for the target person. 

The present data indicate that restricting 
the decision alternatives available to jurors, 
especially when the guilty alternative has a 
consequence which is perceived to be too 
severe, may increase the likelihood of obtain- 
ing a not guilty verdict. We should note, how- 
ever, that the usual qualification of labora- 
tory experiments should be applied before 
generalizing to actual jury decisions as in the 
Algiers Motel trial. First, the subjects knew 
that the task and the consequences for the 
defendant were hypothetical. Subjects’ deci- 
sions might have been quite different if they 
were faced with a “real world” situation. 
Second, the subjects were the overstudied 
“college sophomores,” who are probably un- 
like the populations typically chosen for jury 
duty, for example, age, education, personality 
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differences. It is worth citing, however, that” 
three additional experiments containing the 
most essential cells of the present study 
ditions 1, 7, and 8) have been cond 
part of classroom demonstrations ot 
tions composed largely of primary 
ondary school teachers, The results r: 
the findings of the present study. Thi 
a single task was used. The experimes:! 
sults may be due to some variables (or se 
variables) associated with this speciñ 
Finally, it should be noted that the 
guilt and the severity of punishment were 
founded in the experiment. Different res: 
may have been obtained if the subjects | 
been asked to make verdicts on the degree of 
intent (first-degree murder, second-dey1 
murder, manslaughter) without consideri 
the “sentence required by law” or vice ver 
Additional research is needed to clarify the 
relative impact of the variables of degree cf 
intent and severity of punishment. 

The data on the postverdict social percep- 
tions are also subject to a qualified interpret::- 
tion. Supporting a dissonance theory expla 
nation, those Condition 1 subjects wh: 
chose not guilty tended to see the defendan: 
in a better light and adverse testimony in a; 
worse light than the subjects who chose an 
guilty verdict. But it is possible that instead 
of reflecting a postdecision rationalizing 
mechanism, the data reflect a predicision 
tendency to see guilt or no guilt. That is, 
after reading the facts and testimony, some | 
subjects were inclined to judge more harshly 
and actually found little conflict in choosing 
a first-degree murder verdict. Conversely, the 
jurors choosing not guilty may have been 
inclined in that direction prior to their deci- 
sion. The use of the no-decision control group 
(Condition 8) did not alleviate these problems 
of interpretation, in that the mean social 
perception scores of those 24 subjects may 
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3 In somewhat parallel research, Kalman J. Kap- 
lan and Roger I. Simon (personal communication, — 
June 1971) used a different task and varied the race | 
of the victim as well. These data, obtained from col- 
lege student subjects, also yielded different propor- 
tions of not guilty verdicts, depending on the num- 
ber and severity of decision alternatives. Thus, there < 
is some evidence that the present results are not due 
to a specific task, 
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have reflected the mean of both “guilty of 
first-degree murder” and not guilty tenden- 
cies. In retrospect, Condition 8 subjects 
should have been asked to make verdicts after 
they had marked the social perception ques- 
tions, We could then have correlated the re- 
sponses to the questions with verdict outcomes 
and tested the possibility that social percep- 
tions influence verdicts, instead of the op- 
posing hypothesis that verdicts influence 
social perceptions. The author nevertheless 
prefers the latter interpretation on a priori 
theoretical grounds until future experiments 
control for the alternative interpretation in 
their designs. 

As indicated in the introduction, the present 
research points out some variables that have 
been ignored in theories of responsibility at- 
tribution. In their classic paper on the effects 
of interaction goals in person perception, 
Jones and Thibaut (1958) suggested that there 
are three basic types of perceiver sets, de- 
pending on the goal of the perceiver-target 
interaction. With a value-maintenance set, the 
attribution to the target person’s motives in- 
volves relevance to the personal goals of the 
perceiver. Perceivers with a causal-genetic set 
are interested in the underlying social, physi- 
cal, and biological conditions that give rise 
to the behavior. With a situation-matching 
set, the causal locus of behavior is more or 
less irrelevant to the inference process, but 
rather involves comparing the target person’s 
behavior with regard to generalized norms 
that are perceived as relevant to the situa- 
tion. The role of juror is presumably one of 
the purest examples of this latter set (Jones & 
Thibaut, 1958), The perceiver, therefore, 
should 


behave as an agent of social control in dispensing 
social rewards and punishments or in “passing judg- 
ment” on the appropriateness of another’s behavior, 
The actors main goal is to establish whether or not 
the other person is behaving appropriately, whether 
the person’s behavior “matches” the norms which 
apply in the situation [Jones & Thibaut, 1958, p. 167]. 


The present experiment suggests that the 
severity of consequences associated with avail- 
able decision alternatives may be a potent 
factor in juror decisions. These are more 
likely related to the subject’s personal stan- 
dards of appropriate retribution than to 
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legally prescribed standards. Hence, it seems 
useful to elaborate the situation-matching set 
to discriminate between the “legal” norms 
which are formally supposed to be applied in 
the setting and the “moral” norms which the 
perceiver may apply instead. Whether in the 
laboratory or in the “real world,” it may be 
very difficult, or even impossible, to enforce 
a- legal norm as the perceiver’s sole basis of 
inference about a target person. The dynamics 
of the situation matching set are perhaps 
more complex than Jones and Thibaut had 
considered.* 

The experiment also raises questions about 
the adequacy of models dealing with the 
process of attribution. Neither the Heider- 
based inference model of Jones and Davis 
(1965) nor the personal knowledge model of 
deCharms (1968) have considered the impact 
of the number and severity of decision alter- 
natives on the decision processes of the per- 
ceiver, The present data suggest that these 
variables may have considerable impact on 
the way information is processed. Responsi- 
bility attribution is not solely a function of 
logical inference and/or projecting input from 
knowledge of what one’s own intentions would 
have been in a similar situation. 

Finally, this experiment dealt with a jury 
setting where the consequences of the per- 
ceiver’s decision have a negative impact on 
the target. One can also conceive of instances 
in which the perceiver’s decision would result 
only in positive outcomes for the target (e.g., 
award committees) or in positive outcomes 
for the target if one decision is made and 
negative outcomes if another decision is made. 
Are the dynamics behind responsibility at- 
tribution in these latter examples different 
from the instances where judgments can result 
only in negative consequences for the target 
person? In short, the present study implies 


4In an experiment designed to test the impact of 
various inferential sets on person perception, Jones 
and deCharms (1958) hypothesized that instead of 
the norms that had been experimentally specified in 
the situation-matching set, the subjects may have 
had their own ideas about appropriate norms. Those 
authors, then, were also aware of the possibility of 
more than one set of norms and the consequent 
difficulty in experimentally manipulating a specific 
set of norms, 
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some important and fruitful avenues of re- 
search. There are obviously a large number 
of real-world instances in which responsibility 
attribution judgments are made under con- 
ditions of restricted decision alternatives. 
Furthermore, some of these instances (e.g. 
juror conditions) are in areas with important 
social implications. 
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EFFECTS OF SALIENCE OF STIMULUS INCONSISTENCY 


ON IMPRESSION FORMATION 


CLYDE HENDRICK 1 
Kent State University 


Previous experiments on impression formation have found few effects on 
ratings due to trait inconsistency. Typical procedures tend to make incon- 
sistency nonsalient for subjects. In the present experiment both the magnitude 
and salience of trait inconsistency were manipulated. Subjects rated their 
liking for persons described by sets of 6 traits (3 high desirable and 3 low 
desirable), The high-desirable traits were consistent with the low-desirable 
traits in half the sets and inconsistent in the other half. Salience was manipu- 
lated by having some of the subjects rate the probability of association between 
high-desirable and low-desirable traits just before making their liking ratings, 
and by having other subjects make only the traditional liking ratings. As 
predicted, when ratings were made in the traditional manner, consistent and 
inconsistent stimulus persons were equally well-liked, but when salience was 


high, inconsistent stimulus persons were relatively disliked. 


The present study is concerned with the 
effects of trait consistency—inconsistency on 
impression judgments. Much of the research 
on impression formation uses descriptive in- 
formation that contains a high degree of in- 
consistency. How subjects cope with such 
inconsistency and how it affects their impres- 
sion ratings has not received much attention. 
Anderson (1968) concluded that there is little 
evidence “on how people do resolve such in- 
consistencies, at least in the personality im- 
pression task [p. 742].” A review of the 
literature indicates that this conclusion is still 
valid. In many experiments, subjects seemed 
(a) either minimally aware of inconsistency 
(i.e. the inconsistency was nonsalient), or 
(b) if aware, some aspect of the procedures 
made the inconsistency seem reasonable and 
not particularly bothersome. 

Earlier work on primacy—recency effects 
(Asch, 1946; Luchins, 1957), as well as more 
recent evidence (Hendrick & Costantini, 
1970), indicates that with serial presentation, 
inconsistency has few effects on impressions, 
or at least subjects are unable to verbalize 
such effects. Luchins (1957, pp. 46 and 53) 
found that the majority of his subjects either 
did not notice or were not bothered by in- 
consistency. Asch (1946, Experiment VIII) 
reported that only 2 of 24 subjects perceived 

1 Requests for reprints should be sent to Clyde 


Hendrick, Department of Psychology, Kent State 
University, Kent, Ohio 44242. 


any contradiction between traits when a set 
of inconsistent traits was read to them. A 
number of studies using simultaneous presen- 
tation of stimuli found no effects on ratings 
due to inconsistency (Gollob, 1968; Hendrick, 
1968; Himmelfarb & Senn, 1969; Rosenbaum 
& Levin, 1969). Dustin and Baldwin (1966) 
proposed a model which assumed that trait 
consistency does have an effect, and some 
mildly corroborative evidence has been ob- 
tained (Hendrick, 1969; Schmidt, 1969). 
Also, for pairs of adjectives, Wyer (1970) 
obtained a complex interaction between degree 
of inconsistency and favorableness level of the 
adjectives. A perusal of the procedure used in 
the impression experiments which used simul- 
taneous presentation indicated that subjects 
were often told that all traits in a set were 
equally accurate and important, and that each 
trait was contributed by a different individual 
who knew the person well. In general, these 
instructions were intended to minimize redun- 
dancy or inconsistency effects. An exception 
was a study by Anderson and Jacobson 
(1965), who found that impression ratings 
of inconsistent sets did vary as a function of 
instructions concerning the relative impor- 
tance of adjectives in a set. The more com- 
mon instructions of equally important traits 
contributed by different persons may create 
a perception of situational variability rather 
than the perception of person inconsistency. 
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Situational variability of behavior is quite 
common, and if subjects can and do make 
this attribution, they avoid having to deal 
directly with the possibility of character 
inconsistency localized within the person. 
The present research was predicated on the 
assumption that trait inconsistency will affect 
impression ratings when (a) the inconsistency 
is salient for subjects and (b) when no easy 
alternative exists for rationalizing away the 
inconsistency (e.g., situational attributions). 
It was assumed that inconsistency perceived 
as intrinsic to the character of a person is 
unpleasant, and that such persons will be 
relatively disliked. Therefore, it was predicted 
that when the consistency dimension was 
salient, persons possessing intrinsically incon- 
sistent traits would be evaluated more un- 
favorably than persons possessing consistent 
traits. When the consistency dimension was 
not salient, no differences in likability between 
consistent and inconsistent stimulus persons 
were expected. 

Subjects rated several persons who were 
each described by a set of six traits. Each set 
contained three high-desirable and three low- 
desirable traits. Set- construction was such 
that the high-desirable and low-desirable 
traits were semantically consistent in half of 
the sets and highly inconsistent in the other 
half. Salience of consistency—inconsistency was 
manipulated at a low level by having some 
subjects make only traditional liking ratings, 
and at a high level by having other subjects 
first focus on and rate the likelihood of associ- 
ation between high-desirable and low-desirable 
traits just prior to making their liking ratings. 
The probability rating procedure should have 
(a) made the inconsistency in a set quite 
salient for subjects, and (b) in conjunction 
with the immediately following liking rating, 
may have committed them to perceiving the 
inconsistency as localized within the person. 
If so, then persons described by inconsistent 
traits should be rated more unfavorably in 
the salient condition than persons described 
by consistent traits. 


METHOD 


Stimulus Sets and Design 


The stimuli consisted of 16 sets of six traits used 
previously in a primacy study (Hendrick & 
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Costantini, 1970, p. 161). Each set consisted of thi 

high-desirable traits homogeneous in meaning (eg 
energetic, vigorous, resourceful), and three low-desir- 
able traits (e.g., withdrawn, silent, helpless). The sets 
were constructed so that a triplet of high-desirable” 
(H) traits was either consistent or inconsistent in 
meaning with a triplet of low-desirable (L) traits 
according to the following 2 X 2 design: 


Hl: (4) Hs (—) 
Heli G HiLo (+), 


where the subscripts indicate different subsets of 
adjective triplets, and (+) and (—) indicate a 
consistent and inconsistent relation, respectively, be- 
tween high-desirable and low-desirable triplets. The 
interaction term in the design allows a direct evalua- 
tion of consistency versus inconsistency, while at the 
same time providing a control for specific traits used 
in forming sets. : 

The four sets of six traits formed in the above 
manner were considered as one stimulus replication, 
and four such replications were used with different 
traits in each replication. Each set was presented in 
both a high-desirable to low-desirable and a low- 
desirable to high-desirable order, so that altogether, 
ratings were obtained for 32 sets. 


Salience of Consistency-Inconsistency 


Salience of the consistency variable was manipu- 
lated by creating two versions of rating booklets 
which were given to different groups of subjects. In 
the salient condition, three traits (e.g, H:, He, Hs) 
were attributed to a person, and subjects rated on 
an 1l-point scale the probability that the person 
could possess three additional traits (e.g., Li, Le, Ls). 
Immediately following, the subjects were asked to 
assume that the person in fact possessed all six 
traits, and to rate on an 11-point scale how much 
they would like the person. In the nonsalient condi- 
tion the probability ratings were not made. Only ` 
the adjective set, the question pertaining to liking, | 
and the liking rating scale were included. 

The rating booklets were constructed with two 
adjective sets per page, each drawn from a different 
stimulus replication, and both sets were either con- 
sistent or inconsistent. The pages were randomly 
ordered for each booklet. A cover page gave rating 
instructions appropriate to the particular salience 
condition, Subjects were told that a given trait 
might be used in different combinations to describe) 
different persons. Trait consistency was not men- 
tioned in the instructions, nor was anything said 
concerning the relative importance of the traits or 
where the traits were obtained. À 


Subjects and Procedure 


The subjects were college students enrolled in an 
introductory psychology course. Data were collected 
in groups separately for the salient and nonsalient © 
conditions. Forty-four subjects (17 males, 27 fez 
males) served in the salient condition, and 38 subi 
jects (17 males, 21 females) served in the nonsalient 
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TABLE 1 
Mean Likinc RATINGS 
Salient condition Nonsalient condition 
primei 
replication f 7 RAAN ie 
eR fr iterence, | E | Da 
1 5.81 4.47 1,34 5.76 5.72 04 
2 4.69 4.18 OL 5.07 5.19 = 
3 5.62 4.40 1.22 5.34 5:15 19 
4 3.95 3.33 .62 3.95 4.04 =.09 
x 5.02 4,10 92 5.03 5.02 01 


condition. The experimenter explained the general 
nature of impression formation to the subjects and 
distributed the rating booklets. Subjects were told to 
make the ratings on the basis of their impressions. 


RESULTS 


The mean probability ratings of the 44 sub- 
jects in the salient condition were analyzed 
to check on the adequacy of stimulus set con- 
struction, The mean rating for the high- and 
low-desirable (+) sets was .68 and .28 for 
the high- and low-desirable (—) sets (F = 
159.80, p < .00001). Separate analyses indi- 
cated that the difference was significant within 
each stimulus replication as well. Thus, sub- 
jects perceived the stimulus sets appropri- 
ately. 

The mean liking ratings are shown in Table 
1, and a summary of the analysis of variance 
is shown in Table 2. The high- to low-desir- 
able and low- to high-desirable orders of 
presentation were combined since preliminary 
analyses indicated no differences due to order. 
The means in each column of Table 1 are 
averages of the two sets of each type (con- 
sistent or inconsistent) in the stimulus repli- 
cation. 

For the analysis shown in Table 2, replica- 
tions was a within-subject factor, and each 
of the 2 X 2 adjective-set designs (see Method 
section) was considered as nested under rep- 
lications. The significant effect of replications 
was due to the relatively low ratings for Rep- 
lication 4. The main effects for low-desirable 
triplets and high-desirable triplets are unim- 
portant. These effects simply indicate that 
within stimulus replications, one low-desirable 
(or high-desirable) triplet affected ratings dif- 
ferently than the other low-desirable (or high- 
desirable) triplet. 


As shown in the Method section, the inter- 
action effect (Low-Desirable x High-Desir- 
able Triplets) yields information about con- 
sistency versus inconsistency, Since a differ- 
ential effect was expected for salient and non- 
salient conditions, the critical effect was the 
Salience versus Nonsalience X Low-Desirable 
Triplets x High-Desirable Triplets interac- 
tion, This interaction was significant (F = 
10.89, df = 4/320, p< .0005). The means 
shown in Table 1 indicate the nature of this 
effect. The net difference in ratings between 
high- and low-desirable (+) and high- and 
low-desirable (—) sets was .92 in the salient 
condition but only .01 in the nonsalient condi- 
tion. 

Inspection of the column means indicates 
that evaluations of the high- and low-desirable 
(+) sets were virtually identical in both sali- 
ent (5.02) and nonsalient (5.03) conditions. 
The mean rating of the high- and low-desira- 


TABLE 2 
ANALYSIS OF VARIANCE OF LIKING RATINGS 
Source df MS P 
Between subjects 
Salience vs. nonsalience (A) 1 | 72.85 | 2.86 
Error 80 | 25.52 
Within subjects 
Replications (B) 3 | 160.91 | 50.01* 
AXB 3 3.84 | 1.19 
Error 240 3.22 
Low-desirable triplets (C) £ | 37.00 | 11.15* 
AXC 4 5.45 | 1.64 
Error 320 3.32 
High-desirable triplets (D) 4). 10:117 -5:3605 
AXD 4 1.67 88 
Error 320 1.89 
CXD 4 | 25.13 | 14,79* 
AXCXD 4 | 18.51 | 10,89* 
Error 320 1.70 
*p < 0005. 


ble (—) sets in the nonsalient condition was 
5.02, but this mean was only 4.10 in the sali- 
ent condition. This last difference was highly 
significant by the Duncan test ($ < 01). 
Thus, the major effect of making inconsis- 
tency salient was reduced liking for persons 
described by inconsistent traits. 


Discussion 


The results provided strong support for the 
predictions. Based on previous research, no 
differences between consistent and inconsis- 
tent sets were expected when the consistency 
variable was not salient. When this variable 
was salient, as predicted, persons described 
by inconsistent traits were disliked more than 
persons described by consistent traits. This 
prediction was based on the assumption that 
inconsistency intrinsic to a person’s character 
traits is unpleasant, and disliking him is a 
consequence of the unpleasantness. 

The manipulation of salience presumably 
did two things. Rating the probability of trait 
association prior to rating liking certainly 
should have created strong awareness of the 
magnitude of inconsistency in a set of traits. 
Second, the active assessment of trait proba- 
bilities by the subjects may have committed 
them to viewing a certain degree of incon- 
sistency as intrinsic to the stimulus person, 
thus making other attributions difficult (e.g., 
inconsistency due to situational variability). 
The present data do not indicate whether the 
relative dislike of inconsistent persons was 
due to salience per se, or to a commitment 
effect of making the probability ratings, or 
some combination of both, It would be desir- 
able in future work to vary independently 
both salience and ease with which different 
kinds of attributions can be made. It may be 
that inconsistency due to varying situational 
pressures is not perceived as particularly 
bothersome even though it is highly salient. 
Such inconsistency may create dislike only 
when it is intrinsically a part of the person. 

The results of the present study provide a 
complication for models of impression forma- 
tion. In the nonsalient condition, no term in- 
volving inconsistency would be needed in a 
model. However, such a term would be neces- 
sary in the salient condition. Perhaps the best 
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general solution would be to include a term 
for inconsistency with a variable weight w. 
In the nonsalient condition with w; = 0, the 
inconsistency term would effectively drop out 
of the model. It remains for future research 
to determine what experimental conditions, 
other than salience, will yield a w; 40 for 
the inconsistency term. 
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Subjects had either a choice or no choice about receiving either a favorable 
or an unfavorable evaluation concerning how socially sensitive they were and 
had either prior information or no prior information concerning the favorability 
of the evaluation. A significant Choice X Direction of the Evaluation inter- 
action on the subjects’ ratings of their social sensitivity indicated that choice 
decreased acceptance of the favorable message and increased acceptance of the 
unfavorable message. Similar Choice X Direction interactions were obtained for 
ratings of oneself and of the “typical subject” on ease of making friends and 
for source credibility. Choice was interpreted as inducing responsibility for 
the consequences of what one has chosen, and predictions about self-concept 
change were contingent on whether these consequences were favorable or 


unfavorable. 


When a person encounters information 
about himself that contradicts his self- 
concept, he may either find out that he is 
better than he thought he was or that he is 
worse. Ordinarily, one would expect that a 
person would be more accepting of a favor- 
able than an unfavorable message. Not only 
does this generalization make good common 
sense, but it has been embodied in psycho- 
logical theory in concepts such as the “uni- 
directional drive upward” (Festinger, 1954; 
Pepitone, 1964). Moreover, experimental re- 
sults have supported this idea (e.g., Eagly, 
19678; Steiner, 1968), provided that the 
person who receives the message does not ex- 
pect to be evaluated again (Eagly & Acksen, 
1971). 


1The authors wish to express their appreciation 
to Samuel Himmelfarb, Ivan Steiner, and George 
Levinger for their helpful comments on an earlier 
version of the paper. This paper was presented at 
the meeting of the Hastern Psychological Association, 
New York, April 1971. 

2. Requests for reprints should be sent to Alice 
H. Eagly, Department of Psychology, University of 
Massachusetts, Amherst, Massachusetts 01002. 

3Eagly (1967) found that change was greater 
toward favorable than unfavorable information 
about oneself when considered in relationship to the 
base line provided by receptivity to parallel informa- 
tion about another person. Most other studies (e.g. 
Maehr, Mensing, & Nafzger, 1962; Videbeck, 1960) 
are not relevant to this issue because they failed 
to insure that the favorable and unfavorable treat- 
ments presented information equally discrepant from 
the subjects’ initial positions. 
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However, this generalization may only be 
true when a person is passively exposed to 
information—in other words, when he has no 
choice about receiving feedback. When he has 
chosen to expose himself to an evaluation, he 
considers himself responsible for its conse- 
quences. Since an unfavorable evaluation 
usually has negative consequences (e.g., em- 
barrassment, disappointment, lowered self- 
esteem), a person who chooses it is thereby 
made responsible for something that harms 
him. Both balance theory (Heider, 1958) and 
dissonance theory (Festinger, 1957) would 
argue that choice increases the ‘‘inconsist- 
ency” experienced by the recipient of a nega- 
tive evaluation. From the balance theory 
standpoint, choice links the person and the 
unfavorable outcome by a positive unit rela- 
tionship and thereby establishes an unbal- 
anced dyadic configuration. Dissonance theory 
suggests that choice commits (Jones & 
Brehm, 1967) a person to discrepant informa- 
tion, and, if this information is unfavorable, 
it not only is discrepant from the self- 
concept (Aronson, 1969; Nel, Helmreich, & 
Aronson, 1969), but it also induces negative 
consequences for the recipient (Cooper & 
Worchel, 1970). 

The person who has chosen to receive 
evaluative information and receives negative 
feedback faces a considerable dilemma. If we 
assume that he is motivated both to change 
the configuration that makes him responsible 


for an event harmful to himself and to pre- 
serve his self-esteem, there is no single re- 
sponse that provides a completely satisfactory 
resolution, However, given that he cannot 
deny xesponsibility for his choice, one likely 
outcome is that he lowers his self-concept 
more than he would had he not chosen to 
receive the message. Then the consequences 
of the choice are less negative in the sense 
that the feedback is no longer so much worse 
than his self-evaluation. But this resolution 
is potentially threatening to self-esteem since 
it does require lowering the self-concept. 
Thus, Bramel’s (1962, 1963) work suggests 
that, in the interest of preserving self-esteem, 
evaluations of other persons may also be 
lowered so that, relative to other people, one 
‘is no more unfavorably placed than he was 
before. 

In contrast, the person who chooses to 
receive evaluative information and receives a 
“message more favorable than his self-concept 
is unlikely to alter these consequences of his 

_ decision since he is personally responsible for 
choosing a surprisingly good and pleasant 
outcome. Thus, insofar as the cognition of 

_ being personally responsible for favorable in- 

formation is salient, there is little pressure to 

revise the self-concept. 
Thus, degree of choice and the direction in 
which feedback is discrepant from the self- 

concept interact in their effect on acceptance 
‘of the feedback. In the base-line situation in 
which a person has little freedom to refuse 
information about himself, change is greater 
toward a favorable than toward an unfavor- 

-able evaluation. Choice tends to reverse this 
effect and thereby to bring about relatively 

“more receptivity to the unfavorable message. 

But it is possible that these effects of 
choice occur primarily when a person has 
foreknowledge concerning the direction of the 

evaluation, Can a person be made to feel 
personally responsible for consequences that 

are unknown to him at the time of his deci- 
sion? This consideration prompted the addi- 
tion of a third variable—whether subjects 
are informed of the favorability of the feed- 
back before they choose to receive it. This 
prior information variable has been explored 
in the dissonance literature where it has been 
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claimed both that unforeseen negative conse- 
quences increase dissonance (e.g, Breom, 
1959), and that knowing the implications of 
one’s choice at the time of the decision ': a 
necessary condition for obtaining dissonance 
effects (Linder, Cooper, & Jones, 1°07; 
Linder, Cooper, & Wicklund, 1968). 


MeTHOD 
Overview 


After completing tasks described as tests of so: ial 
sensitivity, a subject either did or did not have a 
choice about receiving either favorable or unfayor- 
able feedback on his performance and had « r 
prior information or no prior information about i'e 
favorability of the feedback. A subject then rı d 
himself on social sensitivity and responded to seve! 
other measures. 


Subjects 


The subjects were 158 students from an introduc- 
tory psychology course requiring participation 
experiments: 78 were male, and 80 were female, and 
the sex ratio was approximately constant across th: 
cells of the design. Ten subjects were eliminated 
because they were either suspicious about the genu- 
ineness of the feedback (3), students of the experi- 
menter (4), or refused the feedback (3, in choice 
conditions). Data from 3 additional subjects were 
removed randomly to equalize cell size. There re- 
mained 16 subjects in each of te eight condi- 
tions. Seventeen subjects, who received no message, 
constituted a control group. 


Procedure 


For convenience, subjects were scheduled in groups 
of three, with each group containing at least one fa- 
vorable- and one unfavorable-message subject. The 
experimenter explained that he was establishing 
norms for a number of personality tests. In the firs: 
test, which was adapted from Aronson and Cari- 
smith (1962), subjects selected “the psychologically 
maladjusted person” from each of 20 sets of three 
photographs of college students presented via an 
opaque projector. In a second test, subjects wrote 
a story in response to a Thematic Apperception 
Test picture. In order to give the experimenter 
time to appear to score these two tests, the sub- 
jects were then given a long questionnaire con- 
taining several scales (eg. self-esteem, need 
for social approval). After 25 minutes elapsed, 
the experimenter presented each subject with a 
mimeographed note explaining that the first two 
tasks assessed social sensitivity, The note described 
the tests as valuable and as yielding not just a 
general social sensitivity score but also information 
about “the more specific strengths and weaknesses 
that affect how you get along with people.” The 
note also explained that the experimenter had scored 
these tests and had feedback available. 
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In the prior information conditions, the note in- 
cluded the following written statement; “I should 
probably tell you that your performance was 
ae A The space was filled in hand- 
writing by the words “very good” in the favorable- 
message conditions and by “not very good” in the 
unfavorable-message conditions. In the choice condi- 
tions, the note also instructed the subject to check 
yes if he wanted the feedback and no if he did not. 
The three subjects who checked no were removed 
from the analysis: one was in a prior-information, 
unfavorable-message condition, but the other two 
were in no-prior-information conditions. In the no- 
choice conditions, the note instead directed the sub- 
ject to pick up the feedback from the experimenter. 
For the control subjects, who received no message, 
feedback was not mentioned, and the choice and 
prior-information instructions were omitted. 

A subject then received favorable or unfavorable 
feedback handwritten on a mimeographed form. A 
subject read the evaluation, returned it to the ex- 
perimenter, and then responded to the dependent 
variable instruments. The self- and other-person 
rating scales and the general self-esteem measure 
were presented in a single booklet described as 
another “personality test.” The remaining measures 
(test and source evaluation; manipulation checks; 
suspicion) were presented in a subsequent booklet 
described as assessing subjects’ reactions to the tests 
and the testing situation. Thorough debriefing fol- 
lowed. 


Feedback Concerning Social Sensitivity 


The bogus evaluation consisted of a numerical 
score and a written statement. In order to insure 
that the favorable and unfavorable messages were, 
on the average, equally discrepant from the subjects’ 
self-concepts, they had been constructed on the 
basis of pretesting. To establish a base line for the 
numerical score feedback, which was described as 
the number of photographs correctly guessed on the 
photo-selection task, pretest subjects completed this 
task and then estimated their performance on it by 
indicating a score on the identical form later used 
to give the experimental subjects their numerical 
feedback. This form included bogus test norms indi- 
cating that most persons received middle-range 
scores, Since the mean of the score estimated by the 
16 pretest subjects was 11 correct on the 20 trials, 
the score of 18 was chosen as the favorable message 
and 4 as the unfavorable message. 

To establish a base line for the written statement, 
which was described as based on performance on 
both of the social sensitivity tests, 17 pretest sub- 
jects judged each of several statements on the 
amount of social sensitivity possessed by the person 
it described, in comparison to himself. The state- 
ments chosen were each judged about 2 points, on 
a 9-point scale, distant from the midpoint, which 
was labeled “equal to myself” The unfavorable 
statement chosen was the following: 


This person could sometimes be warm and sensi- 
tive if he desired; however, he is often too in- 
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volved in his own activities to notice others. He 
would make a good casual acquaintance, but as a 
friend he could not be relied upon. When he does 
show warmth and sensitivity, it is unexpected and 
does not last too long. 


The favorable statement was the following: 


This person is exceptional in sensitivity and 
warmth shown to others, particularly those in need. 
He shows an acute understanding of human trou- 
bles, indicating an affection for his fellow man. 
Friends and even relative enemies turn to him 
when faced with problems. He is known as a person 
with no real enemies, for everyone whom he meets, 
likes him. 


Measuring Instruments 


Self-ratings. A subject rated himself on social sensi- 
tivity and ease of making friends, which were em- 
bedded among traits irrelevant to the message (e.g., 
physical attractiveness). For each trait, the subject 
indicated on a scale the percentage of undergradu- 
ates whose standing on the trait he thought he 
exceeded. 

Other-person ratings. A subject used the same 
trait scales to rate the “typical subject” taking part 
in the experiment. 

General self-esteem. To measure predispositional 
self-esteem, 18 items from Berger’s (1952) Self- 
Acceptance Scale were included in the test taken 
while the experimenter apparently scored the social 
sensitivity tests. Eighteen different items from this 
scale were administered as a postmeasure in order to 
assess self-esteem as a dependent variable. 

Evaluation of social sensitivity tests and of source 
of evaluation. Using 11-point rating scales, a sub- 
ject evaluated (on validity, accuracy, and reliability) 
the quality of the social sensitivity tests he had 
taken, and evaluated (on competence and insight- 
fulness) the skill of the experimenter as an inter- 
preter of these tests. 

Other measures. A subject indicated on an 11- 
point scale the extent to which he perceived he had 
a choice about receiving the feedback. He also rated 
the favorability of the feedback on an 11-point 
scale, Finally, he completed the following statement 
designed to assess suspicion concerning the manipu- 
lations: “I think the purpose of the experiment 
Khas 


RESULTS 
Manipulation Checks 


Subjects in the choice conditions perceived 
greater freedom of choice than did subjects in 
the no-choice conditions (F = 41.22, df= 
1/120, p < .001). Subjects receiving the fa- 
vorable message judged the message as more 
favorable than did subjects receiving the un- 
favorable message (F = 281.27, df = 1/120, 
$ < .001), and this effect was also significant 
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TABLE 1 
MEAN CHANGE IN SELF-RATINGS ON 
Social SENSITIVITY 
No prior information Prior information 
Message 
No choice”| Choice | No choice | Choice 
Favorable 23.3 13.8 17.7 9.2 
Unfavorable 4.1 99 7.2 16.1 
TABLE 2 


ANALYSIS OF VARIANCE OF CHANGE IN SELF-RATINGS 
ON SOCIAL SENSITIVITY 


Source of variation df MS F 
Direction (A) 1 1434.12 4.77* 
Choice (B) 1 22.00 <1 
Prior information (C) 1 1.12 <1 
AXB 1 2128.78 7.08** 
AXC 1 760.50 2.53 

G 1 34.03 <1 
AXBXC 1 7.03 <1 

Error 120 300.96 

*p <.05. 

** p <.01 


-among the subjects who did not receive prior 
information concerning the direction of the 
message (F = 156.81, df = 1/60, p < .001). 
No other significant effects were obtained in 

the ratings of choice or of the favorability of 
the message. 


Self-Evaluations 


Ratings of self (and of the other person) 
were scored as deviations from the mean con- 
trol group ratings. A positive score should be 
interpreted as change toward the message, 
that is, a rating on the positive side of the 
control group in the favorable-message con- 
ditions and on the negative side in the un- 
favorable-message conditions. A negative score 
indicates change away from the message—a 
rating on the opposite side of the control 
group from the message. 

Subjects’ self-ratings on social sensitivity, 
scored as deviations from the control group 
mean of 58.1, appear in Table 1. It might be 
noted that all means are positive or, in other 
words, that the favorable message produced 
an increase in self-evaluation, while the un- 
favorable message produced a decrease. The 
analysis of variance (Table 2) yielded a sig- 
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nificant interaction between choice and di- 
rection (see Figure 1). Although the maini 
effect of direction was significant, the ten- 
dency for a favorable evaluation to elicit more 
change than an unfavorable evaluation wasi 
confined to the no-choice conditions, where it 
was highly significant by a simple effects test 
(F = 11.73, dj = 1/120, p < 001). Tests of 
the simple effects of choice at levels of direc- 
tion indicated that, for a favorable evalua- 
tion, choice decreased the amount of change 
(F = 4.31, df = 1/120, p < .05), while for an 
unfavorable evaluation, choice increased 
change (F = 2.84, df = 1/120, p < .10). The 
weak interaction between prior information 
and direction (p < .20) indicated that prior — 
information tended to decrease change toward 
the favorable message and increase change 
toward the unfavorable message. 

On self-ratings on the related trait of ease 
of making friends, which were scored as devi- 
ations from the control group mean of 61.4, 
choice decreased change toward the favorable 
message but increased change toward the un- 
favorable message only very slightly (see 
Table 3). The interaction between choice and 
direction was significant (F = 4.89, df = 1/ 
120, p < .05), and choice yielded a significant | 
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MEAN CHANGE TOWARD MESSAGE 


No Choice Choice 
CHOICE CONDITION 


Fic. 1. Mean change in self-ratings on 
social sensitivity. 
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simple effect only for the favorable message 
(F = 8.84, df = 1/120, p<.05). Analysis 
of the self-esteem scale administered as a 


postmeasure yielded only a main effect of ` 


direction: the favorable message changed 
self-esteem toward the message, while the un- 
favorable message changed self-esteem slightly 
away from the message (F = 10.44, df = 1/ 
120, p < .005). 


Other Dependent Variables 


Analysis of the subjects’ ratings of the 
other person on social sensitivity yielded no 
significant effects. However, on ease of mak- 
ing friends (scored as a deviation from the 
control group mean of 56.6), choice increased 
change in the unfavorable-message conditions 
and decreased change in the favorable-mes- 
sage conditions (see Table 3). This effect 
yielded a significant Choice X Direction in- 
teraction (F= 7.95, df= 1/120, p< .01) 
and a significant simple effect of choice only 
with the unfavorable message (F = 5.70, df 
= 1/120, p < .025). A significant main effect 
of direction (F = 7.14, df= 1/120, p < .0L) 
indicated greater change toward the favorable 
message. 

Means (pooled over prior information con- 
ditions) for the subjects’ evaluations of the 
message source (i.e., the experimenter) and of 
the supposed social sensitivity tests appear 
in Table 4. Each scale had a possible range 
of 1 to 11, with higher numbers representing 
more favorable evaluations. As expected, the 
favorable message elicited a more positive re- 
action to the source than did the negative 
evaluation (F= 35.09, df=1/120, p< 
001), and this effect was significant in the 
choice as well as the no-choice conditions. The 
Choice X Direction interaction was also sig- 


TABLE 3 


Mean CuHANGE IN Ratincs on Ease oF MAKING 
FRIENDS FOR SELF AND OTHER (POOLED OVER 
PRIOR INFORMATION CONDITIONS) 


Person rated Message No choice Choice 
Self Favorable 12.9 K 
Unfavorable 11.4 12.2 

Other Favorable 9.0 4.0 
Unfavorable —3.1 4.3 
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TABLE 4 


Mean EVALUATION OF SOURCE AND MESSAGE (PooLED 
OVER PRIOR INFORMATION CONDITIONS) 


Evaluation Message No choice Choice 
Source Favorable 8.7 8.3 
Unfavorable 6.1 7.2 
Message | Favorable 6.3 6.6 
Unfavorable 4.2 4.6 


nificant (F = 5.54, df= 1/120, p< .025), 
and simple effects tests indicated that choice 
raised evaluations of the source when he 
gave an unfavorable message (F = 5.52, df 
= 1/120, p < .025), but had no effect when 
he gave a favorable message. Analysis of sub- 
jects’ ratings of the social sensitivity tests 
yielded only a significant main effect of direc- 
tion such that the tests were evaluated more 
positively when they yielded a favorable eval- 
uation (F = 26.36, dj = 1/120, p < .001). 


Influence of Sex and Self-Esteem 


Additional analyses of self-ratings on social 
sensitivity included either sex or self-esteem 
as a fourth independent variable in an un- 
weighted-means analysis of variance. The 
interaction between sex and direction was sig- 
nificant (F = 4.58, df=1/112, p< .05): 
males changed more toward the favorable 
than the unfavorable message, while females 
changed toward them about equally. The self- 
esteem analysis yielded a significant interac- 
tion between self-esteem and direction (F = 
5.57, df = 1/112, p < .025): high self-esteem 
subjects changed more toward the favorable 
than the unfavorable message, while low self- 
esteem subjects changed toward them about 
equally. Despite the similarity between the 
sex and the self-esteem findings, the sexes did 


not differ in self-esteem (¢ <1). The Self- 


Esteem X Direction interaction also appeared ~ 
in self-ratings on ease of making friends (p 


< .025). 


Discussion 


This study found that favorable feedback 
raised self-evaluations and negative feedback 
lowered self-evaluations. However, giving sub- 
jects a choice about receiving feedback signifi- 
cantly decreased the upward change of the 
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social sensitivity self-ratings when the feed- 
back was favorable and tended (p< .10) to 
increase the downward change in response to 
the unfavorable message. Self-ratings on the 
related trait, ease of making friends, showed 
a similar pattern of findings. 

The self-evaluation findings for favorable 
information are, then, quite clear-cut. Sub- 
jects made personally responsible for a favor- 
ably discrepant message by virtue of having 
chosen to receive it tended to maintain the 
goodness of this outcome by keeping it more 
positive than their self-concepts, or, in other 
words, by raising their self-concepts signifi- 
cantly less than they would if the message 
were imposed on them, Thus, the hedonically 
satisfying configuration induced by choosing 
to receive feedback that is discrepant in a 

 self-enhancing direction is relatively stable, 
and, consequently, the self-concept rises less 
than it otherwise would. 

We argued, however, that a person whose 
choice made him personally responsible for 

receiving an unfavorable evaluation would 
tend to alter this outcome by moving his self- 
concept downward to be closer to the evalua- 
tion. Yet the tendency for choice to increase 
acceptance of the unfavorable message ap- 
peared only as a .10 effect on the social sensi- 
tivity ratings, and no significant effect was 
_ obtained on the ease of making friends rat- 
ings. But the weakness of these self-evalua- 
tion effects should be viewed in conjunction 
with the clearly significant effect of choice on 
two of the other dependent variables—the 
“typical subject’s” ease of making friends 
and source evaluation. For these variables, 
both of which constituted reactions to other 
people, choice had a significant impact only 
when the message was unfavorable, while for 
ratings of oneself, choice had a significant 
impact only when the message was favorable. 
In other words, the class of responses—per- 
ception of oneself or perception of other peo- 
ple—that was more affected depended on the 
direction of the message. 

Our interpretation of this self-other com- 
plementarity is that with the unfavorable mes- 
sage, choice induced a set of cognitive realign- 
ments involving other people because the 
resolution most directly demanded by the 
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message—changing the self-concept doy: 
—would threaten self-esteem. Althoug 
data are not entirely clear-cut in this : 
they do suggest that the slight choice-in«i 
increase in change of the social sensitivity 
self-ratings toward the negative evaluation — 
was cushioned by the projection of at least 
one aspect, ease of making friends, onto oiher 
people. Also, the more positive evaluation of 
the source obtained with choice as compared 
with no choice in the unfavorable-message 
conditions can be interpreted as an indirect 
(and therefore relatively nonthreatening) 
acknowledgment of the accuracy of the un- 
favorable evaluation. 

Both high self-esteem and male sex in- 
creased change toward the favorable mess: ge 
and decreased change toward the unfavorable 
message. The sex effect has, to our knowledge, 
not been demonstrated before, and one of us 
considers it a demonstration of male chauvin- 
ism. The self-esteem effect has been noted bi 
fore (Eagly, 1967) and is also similar to 
Leventhal and Perloe’s (1962) experiment in 
which high self-esteem subjects were more re- 
ceptive to an optimistic, self-enhancing com- 
munication than to a pessimistic, threatening 
one, and low self-esteem subjects showed the 
reverse effect. 

Before exploring the implications of the 
present interpretation of choice, it is well to 
consider an alternative interpretation of the 
principal findings—namely, that giving people 
a choice does not affect response to the mes- 
sage but lowers self-esteem in a general way, 
perhaps by making people unsure of thein- 
selves, and, therefore it makes all ratings more 
negative than they would otherwise be. Al- — 
though such an interpretation could indeed 
account for the lower self-ratings on social 
sensitivity and ease of making friends, it is 
contradicted by the absence of choice effects 
on the self-esteem dependent variable. Since 
choice, then, lowers evaluations only on mes- 
sage-relevant dimensions, it is more reasonable 
to assume that it affects response to the mes- 
sage and does not act directly on self-esteem. 

The present analysis of choice in terms of 
inducing responsibility for the consequences 
of one’s choice is meant to encompass only 
situations in which favorable information is 
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perceived as enhancing the self and unfavor- 
able information as damaging the self. There 
no doubt are situations in which favorable 
feedback is not self-enhancing, for exam- 
ple, because it is a cue that the evaluator 
is engaging in ingratiation. Also, an unfavor- 
able evaluation may sometimes be regarded 
as helpful and self-enhancing, at least in the 
long run. Such an unfavorable message might, 
for example, discuss personal weaknesses in a 
way that opens a route to self-improvement 
rather than give general evaluative comments 
as did the message in the present study. In 
future research, it would be important to 
manipulate the perceived utility of the feed- 
back as well as the direction in which it is 
discrepant from the self-concept. 

It is interesting that the effects of choice 
were not a function of the subjects’ prior in- 
formation concerning the direction of the dis- 
crepancy. The effect of prior knowledge is 
not well understood, and there are conflicting 
findings in the literature. However, it seems 
likely that the variable made little difference 
in the present study because, even without a 
warning about the direction of the evaluation, 
the subjects probably spontaneously consid- 
ered both the possibility of a favorable and 
the possibility of an unfavorable outcome. 

The present study has especially important 
implications for dissonance theory because it 
raises serious questions about an issue that is 
central within that framework—namely, how 
the psychological mechanisms governing atti- 
tude change differ in high- and low-choice 
situations (e.g., Brehm & Cohen, 1962). The 
study demonstrates that the pressures placed 
on our cognitions by persuasive communica- 
tions received under conditions of volition or 
commitment are not as well understood as, for 
example, Brehm and Cohen’s (1962) disso- 
nance analysis might imply. A dissonance 
choice or commitment analysis would suggest 
that choosing a discrepant message increases 
the pressure to change toward it over what 
the pressure would be in passive exposure 
situations, but the present study shows that 
pressure is actually decreased by choice if the 
message is favorably discrepant. Nor is it 
adequate, as Aronson’s (1969; Aronson & 
Carlsmith, 1962; Nel et al., 1969) self-con- 
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cept dissonance analysis might imply, only to 
consider whether the feedback a person re- — 


ceives contradicts expectancies about himself, — 


Instead, it is crucial to consider the direction 
in which information is discrepant from the 
self-concept. Also, Cooper and Worchel’s 
(1970) recent definition of forced compliance 
dissonance in terms of committing oneself to 
undesired consequences is incomplete since it 
ignores the effect of desired outcomes in atti- 
tude change. Thus, one major outcome of the 
present study is its demonstration that each 
of these special varieties of dissonance theory 
is inadequate since each is incomplete: all of 
these factors—choice, self-concept, and desir- 
ability of consequences—must be considered 
within a common theoretical framework in 
order to handle the findings of the present 
study. 

In particular, variables of the choice and 
commitment class must be conceptualized in 
a form valid for both favorable and unfavor- 
able information, and response to self-dis- 
crepant information must be understood in 
terms general to both commitment and passive 
exposure situations, The present framework 
bridges commitment and passive exposure 
situations as well as favorable and unfavor- 
able information by conceptualizing choice as 
making a person responsible for an outcome. 
The valence of the outcome is crucial to pre- 
dicting what cognitive changes follow. There 
are two basic postulates from which the pre- 
dictions follow: (a) that people who are re- 
sponsible for receiving an outcome are under 
more pressure to realign their cognitions if the 
outcome is negative than if it is positive and 
(b) that people avoid lowering the self-con- 
cept. The oversimplicity of predicting response 
to self-relevant information in terms of simple 
principles such as the “unidirectional drive 
upward” (Festinger, 1954) is then clearly 
apparent. This and other studies (e.g., Eagly 
& Acksen, 1971) highlight conditions under 
which people are not more likely to be influ- 
enced by favorable than unfavorable informa- 
tion. 
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The subjects received a series of personal evaluations from one or more indi- 
viduals and then rated their attraction to these individuals. The number, 
content, and proportion of favorable and unfavorable evaluations received 
from a given source were systematically varied in two experiments. The results 
of both studies were generally consistent with Byrne’s reinforcement theory 
of interpersonal attraction—liking for an evaluator tended to be a function 
of the proportion of favorable evaluations that were received from this 
individual. When the confederate expressed negative feelings about the subject 
and when these feelings were communicated directly to the subject (rather than 
being inadvertently overheard), they seemed to have a permanent effect in 
serving to reduce the confederate’s attractiveness. 


Which person do you think you would like 
more, someone who consistently acted toward 
you in a friendly or rewarding manner or 
someone who was initially cold and rejecting 
and gradually became warm and accepting? 
Conversely, which person do you think you 
would dislike more, someone who was con- 
sistently cold and unrewarding or someone 
who was initially friendly and gradually be- 
came more and more disparaging? 

According to Byrne’s (1969) reinforcement 
theory of interpersonal attraction, the attrac- 
tion of one person toward a second person is a 
function of the proportion of times the latter 
rewards or reinforces the former. You should 
be most attracted to someone who consistently 
acts in a rewarding manner and should be 
least attracted to someone who consistently 
acts in an unrewarding manner. 

The results of an experiment by Aronson 
and Linder (1965) were not consistent with 
reward theory. Subjects received a series of 
personal evaluations from a confederate and 
then rated their liking for this individual. In 
one condition, the evaluations changed grad- 
ually from negative to positive; in a second 
condition, the evaluations changed gradually 
from positive to negative; in a third condi- 
tion, the evaluations were uniformly positive; 
while in a fourth condition, the evaluations 
were uniformly negative. Subjects were most 


1 Requests for reprints should be sent to Jay 
Hewitt, Department of Psychology, University of 
Missouri, Kansas City, Missouri 64110. 
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attracted to the confederate when the evalua- 
tions changed gradually from negative to posi- 
tive and least attracted to the confederate 
when the evaluations changed gradually from 
positive to negative, Aronson and Linder con- 
cluded that a gain in esteem may produce 
more attraction than steady positive esteem, 
while a loss in esteem may generate more hos- 
tility than steady negative esteem. 

The results of follow-up studies dealing 
with the gain- and loss-of-esteem effects have 
failed to support the theory put forth by 
Aronson and Linder and instead have tended 
to support Byrne’s (1969) reinforcement 
theory of interpersonal attraction. In a study 
by Sigall and Aronson (1967), the subjects 
were required to give a series of speeches and 
were evaluated by the experimenter after each 
speech. In one condition, the evaluations of 
the subject’s performance were initially un- 
favorable and ended up being favorable. In a 
second condition, the evaluations were ini- 
tially favorable and ended up being unfavor- 
able. In a third condition, the evaluations were 
uniformly favorable; while in a fourth condi- 
tion, the evaluations were uniformly unfavor- 
able. Subjects were most attracted to the 
evaluator when the evaluations were uni- 
formly favorable and were least attracted to 
the evaluator when the evaluations were uni- 
formly unfavorable. In a study by Mettee 
(1971), the subjects (a) were asked to state 
their opinion on a certain issue, (b) received 
either praise or criticism for their answer from 


an evaluator, (c) rated their liking for the 
evaluator, (d) were asked to state their opin- 
ion on a related issue, (e) received either 
praise or criticism for their answer from the 
same evaluator, and (f) rerated their liking 
for the evaluator. The results presented by 
Mettee were more consistent with reward 
theory than with gain—loss theory. 

The present study was designed to examine 
the effect of different sequences of personal 
evaluations on interpersonal attraction. The 
paradigm employed was similar to but some- 
what more elaborate than the paradigm em- 
ployed by Aronson and Linder (1965). 


METHOD 
- Design 


The subjects presented some information about 
_ themselves to two individuals (A and B), received 
either a favorable or unfavorable evaluation from 
one of these individuals, presented some additional 
information about themselves to Person B, received 
either a favorable or unfavorable evaluation of 
themselves from B, and then rated their liking for 
Person B. 


Subjects 


The subjects consisted of 160 females recruited 
from an introductory psychology class. Twenty sub- 
jects were randomly assigned to each of the eight 
conditions. 


Evaluations 


The first evaluation was either generally positive 
(Pı) or generally negative (N:), and the second 
evaluation was either generally positive (Pz) or 
generally negative (Nz). These evaluations are given 
below. 


Pı: “understanding, overcritical, 
interesting, and warm” 

Ni: “intelligent, untrustworthy, neurotic, deceitful, 
and unpleasant” 

Ps: “kind, sincere, imaginative, overcritical, and 
mature” 

Ne: “insincere, intelligent, cold, shallow, and of- 
fensive” 


good-natured, 


Evaluations P, and N, were tape-recorded by two 
female confederates (A and B). Evaluations Pa and 
Ns were tape-recorded by B. 


Procedure 


At a preliminary session, all subjects were asked 
to fill out a self-description questionnaire and to 
write four essays on “interests,” “goals in life,” 
“people I like and dislike,” and “personal problems.” 

Upon arriving for the second session, the subject 
was seated in front of an intercom and informed 
(erroneously) that there were two other people par- 
ticipating in the experiment (Subjects A and B). It 
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was pointed out that each subject was located in a 
different room and that all rooms were ome ked up” 
with intercoms. The subject was then told that in 
the first phase of the experiment she would (@)) 
deliver some information about herself to Subjects 
A and B, (b) receive an evaluation from one 
these individuals, (c) deliver some additional infor- 
mation about herself to Person B, and (4d) receive 
an evaluation from Person B. After the procedure 
had been explained, the subject was handed two 
lists of adjectives, informed that each evaluation” 
would consist of five adjectives, and was requested 
to put a check mark beside the adjectives comprising” 
each evaluation (the first list being used to record 
the first evaluation, the second list being used to 
record the second evaluation), 

The experimenter then retired to an adjoining room 
(connected with the subject’s room via intercom) 
and asked the subject to read the first three essays 
she had written at the preliminary session (interests, 
goals in life, people I like and dislike). A series of 
lights in the subjects room indicated that both A 
and B were listening. When the subject had finished, 
the experimenter asked either A or B to give her 
impression of the subject. After a few seconds pause, 
the experimenter turned on the tape recorder, and the 
first evaluation was delivered, (As of lights in 
the subject’s room indicated that apes: subject 
could hear this evaluation.) The erenler then: 
asked the subject to read her e 
sonal problems” and the answers 


subj! 

had finished, the experimenter hen asked Person B: y 
to give her impression of the subject. After a brief 
pause, the experimenter turned on the tape recorder,, 
and the second evaluation was delivered, i 

After the subject had received both evaluations, — 
the experimenter returned to the subject’s room and 
asked the subject to rate her liking for B. A 7-point 
scale was employed ranging in value from +3 (like 
very much) to —3 (dislike very much). The subject: 
was then debriefed and dismissed. 


RESULTS 


A three-way analysis of variance was car 
ried out on the data. A triple interaction was 
obtained between the nature of the first evalu- 
ation, the nature of the second evaluation, and 
the source of the first evaluation (F = 4.43; 
df = 1/152, p< .05). A two-way analysis! 
of variance was then carried out on the data 
in the condition in which both evaluations” 
were delivered by Person B, When both evalu- 
ations were delivered by Person B, there was 
more attraction to B when the first evaluation 
was favorable than when it was unfavorable 
(F = 5.52, df = 1/76, p < .05); there was 
more attraction to B when the second evalua- 
tion was favorable than when it was unfavor- 
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able (F = 32.87, df = 1/76, p < .001), and 
the interaction was not significant (F < 1). 
This is consistent with reward theory. A sec- 
ond two-way analysis of variance was carried 
out on the data in the condition in which the 
first evaluation was delivered by Person A. 
“In this condition, there was an interaction be- 
tween the nature of the first evaluation and 
the nature of the second evaluation (F = 
5.21, df=1/76, p< .05). When the first 
evaluation was delivered by Person A, there 
was no difference between the negative—posi- 
tive and positive-positive conditions (¢= 
.60, df = 38, p > .05), while there was more 
attraction to B in the negative-negative con- 
dition than in the positive-negative condition 
(t = 2.48, df = 38, p < .05). The last finding 
would not be predicted by either a simple re- 
ward theory or by gain—loss theory—such a 
prediction would be outside the boundary 
conditions of both theories. The means are 
presented in Table 1. 


Source of First Evaluation in Negative— 
Positive Condition 


When the subject receives an unfavorable 
evaluation followed by a favorable evalua- 
tion, reward theory predicts that there should 
be greater attraction to B when only the last 
evaluation is delivered by B, while gain-loss 
theory predicts that there should be greater 
attraction to B when both evaluations are 
delivered by B. When both evaluations are 
delivered by B, the subject experiences a gain 
in esteem from B, and the proportion of 
favorable evaluations received from B is 50%. 
When only the last evaluation is delivered 
by B, the subject does not experience a gain 
in esteem from B, and the proportion of 
favorable evaluations received from B is 
100%. The results were consistent with re- 
ward theory. When the subject received an 
unfavorable evaluation followed by a favor- 
able evaluation, there was greater liking for 
B when only the last evaluation was delivered 
by B than when both evaluations were deliv- 
ered by B (t= 2.12, df = 38, p < .05). 
Source of First Evaluation in Positive- 
Negative Condition 


When the subject receives a favorable eval- 
uation followed by an unfavorable evaluation, 
teward theory predicts that there should be 
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TABLE 1 
Mean Lixine ror B 


Evaluations from B 


Sequence ek ee ea t 
Both Last 
Positive-positive 1.80 2.20 ns 
Negative-positive 1.40 2.05 247% 
Positive-negative 50 —.70 PAY bors 
Negative-negative —.35, ies) ns 


*p <.05. 
HED < 01. 


more attraction to B when both evaluations 
are delivered by B, while gain-loss theory 
predicts that there should be less attraction 
to B when both evaluations are delivered by 
B. When both evaluations are delivered by 
B, the subject experiences a loss of esteem 
from B, and the proportion of favorable eval- 
uations received from B is 50%. When only 
the last evaluation is delivered by B, the pro- 
portion of favorable evaluations received from 
B is 0%, and there is no loss of esteem from 
B. The results were consistent with reward 
theory. When the subject received a favorable 
evaluation followed by an unfavorable evalu- 
ation, there was more attraction to B when 
both evaluations were delivered by B than 
when only the last evaluation was delivered 
by B (t= 2.73, df = 38, p < 01). 


EXPERIMENT II 


In order to examine the generality of the 
obtained results, portions of the first experi- 
ment were replicated under slightly different 
conditions. In the first experiment, the change 
from positive to negative and vice versa was 
rather abrupt, and it is possible that the gain- 
and loss-of-esteem effects may only be ob- 
tained when the change is more gradual—as 
it was in the Aronson and Linder (1965) 
experiment. As a consequence, the number of 
evaluations in the second experiment was 
increased from two to four. The content of 
the evaluations was also modified. In Experi- 
ment I, the evaluations consisted of a rather 
terse summary, while in the Aronson and 
Linder (1965) study, they consisted of a 
lengthy description. In Experiment II, each 
evaluation consisted either of a lengthy de- 
scription (the same evaluations as those em- 
ployed by Aronson and Linder) or a short 
overall summary (a condition analogous to 
that of Experiment I). 


ee 


TABLE 2 
MEAN LIKING FOR THE EVALUATOR 


Type of evaluation 


Sequence 
Lengthy Short 
description summary 
Positive-positive 1.00 1.30 
Negative-positive -90 30 
Positive-negative 1.20 .60 
Negative-negative —.60 —.60 
Method ` 
Subjects 


The subjects consisted of 80 females recruited 
from an introductory psychology class. Ten subjects 
were randomly assigned to each of the eight experi- 
mental conditions. 


Evaluations 


Each subject received four evaluations from the 
same individual (a different confederate from that 
employed in Experiment I) and then rated her liking 
for this person. Four sequences of evaluations were 
employed: positive-positive, negative-positive, posi- 
tive-negative, and negative-negative. 

For half of the subjects, each evaluation consisted 
of two words (short summary condition). For the 
positive-positive sequence, the confederate’s impres- 
sion of the subject was always “moderately favor- 
able.’ For the negative-negative sequence, the con- 
federate’s impression of the subject was always 
“moderately unfavorable.” For the negative-positive 
sequence, the four evaluations were “moderately un- 
favorable,” “slightly unfavorable,” “slightly favor- 
able,” and “moderately favorable.” These four evalu- 
ations were expressed in the reverse order in the 
positive-negative sequence. 

For the remaining subjects, each evaluation con- 
sisted of a series of comments (lengthy description 
condition). The following is an example of one of 
the four evaluations in the negative-negative condi- 
tion. 


She’s sort of a boring person, not really icky but 
just not one of those people you want to see many 
times . . . seems like one of those girls who just 
kind of gets along and doesn’t bother anyone but 
doesn’t make a very favorable impression either. 


Except for some minor changes, the evaluations em- 
ployed in the lengthy description condition were 
identical to those employed by Aronson and Linder 
(1965). 


Procedure 


The procedure employed in Experiment II was 
similar to that employed in Experiment I, Upon 
arriving for the experiment, the subject was seated 
in a small soundproof cubicle (connected to an 
adjoining room via an intercom) and informed that 
the purpose of the experiment was to study person 
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perception— how she perceived others and how 
perceived her. The subject was then handed a shee 
that summarized the procedure. The subject woul 
deliver some information about herself to one oth 
person (via an intercom), and at various intervals 
other person would be asked to give her impression 
of the subject. The subject would be asked to discus 
the following four topics: (a) her interests, (b) ht 
goals in life, (c) the kind of people she liked an 
disliked, and (d) the kind of person she aspired # 
become (and how she fell short of this). Each topi 
was to be discussed for 3 minutes. A 

This procedure was then carried out, After 
subject had discsussed a given topic for 3 minu 
the experimenter asked the confederate to give hel 
impression of the subject. As in Experiment I, all 
evaluations were tape-recorded, and the subject was 
able to hear each evaluation over her intercom 
When all four evaluations had been delivered, 
subject was asked to rate her liking for the cons 
federate on a +3 to —3 scale, after which she wat 
debriefed and then dismissed. 


Results 


The results were again consistent with re- 
inforcement theory. A three-wa; 
variance was carried out on th 
only significant effects were thos 
for the first and second evaluations. 
was more attraction to the confederate wh 
the first evaluation was favorable than when! 
it was unfavorable (F = 14.56, df = 1/72, 
p <.001), and there was more attraction tof) 
the confederate when the second evaluation 
was favorable than when it was unfavorable 
(F= 7.28, df =1/72, p< .01). The means) 
are presented in Table 2. 

Although the results of the analysis 
variance were consistent with reinforcement 
theory, inspection of the data in Table 2 
reveals that reinforcement theory was only 
fully supported in the short summary con 
tion. The greatest attraction was found 
the positive-positive condition, the least 
traction in the negative-negative condition 
with intermediate degrees of attraction in th 
negative—positive and positive-negative cond 
tions. In the lengthy description conditi 
although the least attraction occurred in 
negative—negative condition, there appeared t0 
be little difference among the negative-post 
tive, positive-negative, and positive-positi 
conditions. This is inconsistent with both 
gain-loss theory and reinforcement the 
of interpersonal attraction. Taken as a who 
however, the results of Experiment IT app 
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to be more consistent with the reinforcement 
theory than with the gain-loss theory of 
interpersonal attraction. 


DISCUSSION 


All predictions made by reward theory in 
Experiment I were confirmed. When both 
evaluations were delivered by Person B, there 
was more attraction to B when the first 
evaluation was favorable than when it was 
unfavorable, and there was more attraction 
to B when the second evaluation was favor- 
able than when it was unfavorable. When 
the first evaluation was unfavorable and the 
second evaluation was favorable, the subjects 
were more attracted to B when B delivered 
only the last evaluation than when B 
delivered both evaluations. When the first 
evaluation was favorable and the second 
evaluation was unfavorable, the subjects 
were more attracted to B when B delivered 
both evaluations than when B delivered only 
the last evaluation. In Experiment II, the 
results tended to be more consistent with 
reward theory than with gain—loss theory. 
In the short summary condition, the subjects 
were most attracted to the evaluator in the 
positive—positive condition and least attracted 
to the evaluator in the negative-negative 
condition. In the lengthy description condi- 
tion, the subjects were least attracted to the 
evaluator in the negative-negative condition, 
although there appeared to be little difference 
among the remaining three conditions. 

In the light of the results of the present 
series of studies and the results presented 
by Sigall and Aronson (1967) and Mettee 
(1971), it would appear that reward or re- 
inforcement effects are more general and per- 
haps more powerful than the effect produced 
by a gain or loss in esteem, Byrne’s (1969) 
reinforcement theory of interpersonal attrac- 
tion has received wide support, and it is not 
surprising that the theory should be sup- 
ported in a situation involving a sequence of 
personal evaluations. The conditions under 
which a gain or loss of esteem effect will be 
obtained appear to be somewhat limited. 

One condition that may be crucial is 
whether the evaluations are communicated 
directly to the subject or whether they are 
inadvertently overheard. In the present series 
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of studies and in the studies by Sigall and 
Aronson (1967) and Mettee (1971), the 
evaluator told the subject how she felt about 
her. In the Aronson and Linder (1965) study, 
the evaluations were inadvertently overheard. 
The evaluator communicated her feelings 
about the subject to a third party and was 
presumably unaware that the subject could 
hear her comments. Although it may be per- 
missible to hold a negative attitude toward 
someone, it is usually considered both rude 
and tactless to express these feelings directly 
to the person in question. When an evaluator 
tells a subect that she is untrustworthy, neu- 
rotic, deceitful, and unpleasant, not only is 
she giving the subject negative reinforcement, 
but she may also be giving the subject some 
information about herseli—that she is rude 
and tactless. This would give the subject two 
reasons to dislike the evaluator: the evaluator 
holds a negative opinion of the subject 
(negative reinforcement), and the evaluator 
is a rude and tactless individual. If the 
evaluator subsequently changes her mind and 
comes to view the subject in a favorable light, 
this may remove one source of the subject’s 
dislike for the evaluator but not the second— 
the evidence remains that the evaluator has 
been and could be a rude and tactless indi- 
vidual. The degree to which prior negative 
evaluations have a permanent effect in serving 
to reduce the evaluator’s attractiveness may 
well depend on whether the evaluations are 
communicated directly to the subject or 
whether they are inadvertently overheard. 
Subsequent research is needed to investigate 
this possibility. 
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Two experiments employing white subjects examined the effects of anonymity, 
expected retaliation, race of target, and a campus racial disturbance on deliv- 
ered and anticipated aggression (electric shock). Prior to statistical treatment, 
the data were subjected to principal components analyses, with three aggression 
components being identified: general direct aggression, extremes in direct 
aggression, and indirect aggression, In Experiment I, it was found that less 
direct and more indirect forms of aggression were delivered to black than to 
white targets when there was opportunity for the target to retaliate. When 
retaliation was unlikely, the subjects delivered more direct forms of aggression 
to black than to white targets. Following a campus racial disturbance, there 
were increases in direct forms of aggression toward black targets, with such 
aggression now being less dependent on the opportunity for retaliation (Ex- 
periment II). In both experiments more direct aggression was anticipated 
from black than from white targets. The results support the conclusion that 
white persons have learned to fear black retaliation, but that this fear acts 
only to inhibit direct forms of aggression in certain defined situations. 


Among the most salient characteristics of 
the current black movement is the use of vio- 
lence or the threat of violence to produce 
equality between black and white people. 
Black authors (e.g., Baldwin, 1963), as well 

as certain black militant leaders, have empha- 
sized that the black man is determined to get 
equal rights and that white society must 
change or face reprisals. It is not surprising 
then to find some anecdotal evidence which 
suggests that white people have learned to 
fear black retaliation. A good example is an 
excerpt from a letter to the editor published 
by the Northern Illinois University student 
newspaper, The Northern Star: 


We are going to ask that our names be withheld 
from this letter. Don’t bother writing in to sneer, 
‘what'sa matter whitey, you scared?’ Yeah, we are. 
You militants may feel free to gloat. Those of us, 


1 This study is based on a thesis presented by the 
first author to the Department of Psychology at 
Northern Illinois University in partial fulfillment of 
the requirements for the MA degree. It was reported 
at the annual meeting of the Midwestern Psycho- 
logical Association, Cincinnati, April 1970. The first 
and second authors are now at Florida State Uni- 
versity. 

2 Requests for reprints should be sent to Seymore 
Simon, Department of Psychology, Northern Ti- 
nois University, DeKalb, Illinois 60115. 
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both black and white, who want peace: sorry but 
once again we must hang our heads in shame. It’s @ 
sad time when one cannot stand and say, “I wro e 
that” without fear of attack [March 22, 1969]. 


Another is an account by a Chicago Con- 
spiracy 7 juror, Kay S. Richards, as reported § 
by the Chicago Sun-Times: i ; 


The point is, I was very much afraid of Bobby } 
Seale . . . I am afraid of Black Panthers. And all) 
through the first part of the trial I had a fear that 
when I got out of there my life wouldn’t be my own J 
if I convicted Bobby Seale. I would be very much 
afraid to convict Bobby Seale [February 24, 1970].® 


Given that white people have acquired a 
fear of black retaliation, it is interesting to 
speculate about the effect this fear might have 
on interracial relations, and particularly on 
interracial aggression. Berkowitz (1962) has 
suggested that the level of aggression which a 
individual administers to a target varies dis 
rectly with both the strength of the instigatiom 
to aggress and the aggressive cue value of the 
target, and inversely with the intensity ol 
inhibitions against aggression. A number oF 
studies bearing on this conceptualization have 
found that one variable that functions as al 
effective inhibitor of aggressive acts is the 


8 Reprinted with permission from the Chicago 
Sun-Times. 
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expectation of punishment (e.g., Bandura, 
1965; Bandura, Ross, & Ross, 1963; Edwards, 
1967). In the present context, the results of 
these studies suggest that fear of black retali- 
ation by white subjects should generally act 
to reduce aggression toward black targets. 
More specifically, the level of aggression di- 
rected toward black targets should be low 
under situational conditions designed to maxi- 
mize this fear and high under those condi- 
tions designed to reduce it. 

Some support for this notion comes from a 
preliminary study conducted by the present 
authors, White male subjects were provided 
with an opportunity to aggress against either 
a black or white target using electric shocks 
of differential intensities. Each subject in the 
first experimental group was introduced to 
his target openly, the target being brought 
into the experimental room to meet the sub- 
ject. At this time, the experimenter clearly 
indicated that the subject would be the person 
delivering the shocks. For these subjects then, 
the target could potentially retaliate, and any 
fear of such action by blacks was presumably 
high. Each subject in a second group viewed 
the target over what was purported to be a 
one-way closed-circuit television system. Since 
no camera was present in the experimental 
room and the experimenter made no mention 
of a subsequent meeting between the subject 
and the target, a subject in this condition 
could reasonably believe that the target could 
not identify him as the aggressor. For these 
subjects, the opportunity for retaliation was 
thus minimized, and fear of such action by 
blacks was presumably lower than for subjects 
in the first group. Consistent with expecta- 
tion, an analysis of the high-intensity shocks 
revealed that subjects paired with black 
targets administered reliably more intense 
shocks when they were led to believe that 
they were aggressing anonymously than when 
they were known to their targets. The sub- 
jects shocking white targets showed no differ- 
ences in aggressiveness as a function of the 
same situational conditions, their perform- 
ances being essentially the same under the 
anonymous and face-to-face conditions and 
comparable to those subjects shocking black 
targets under anonymous conditions. Unfortu- 
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nately, since the subjects in the two experi- 
mental groups differed with respect to both 
target anonymity and proximity, a completely 
unambiguous interpretation of the results was 
precluded. y 

Under improved design conditions, it was of 
interest in the current study («) to establish 
experimentally that white persons have 
learned to fear black retaliation and (b) to 
examine some of the variables related to this 
fear and how they might operate to facilitate 
or inhibit aggression toward blacks. As in the 
initial experiment, white subjects were pro- 
vided with an opportunity to aggress against 
either a black or a white target under condi- 
tions designed to either maximize or reduce 
the subject’s expectancy for counteraggres- 
sion. This expectancy was manipulated in two 
independent ways. First, as in the preliminary 
study, the anonymity of the aggressor relative 
to the target was varied. The subjects in one 
experimental group were led to believe that 
the target against whom they were to aggress 
could identify them, while the subjects in a 
second group were assured that this individual 
would never become aware of their identities. 
However, since the anonymity variable could 
only be related to the opportunity to retaliate 
outside of the experimental situation, a varia- 
ble having more immediate consequences was ` 
also introduced. Half of the subjects under 
each anonymity condition were told prior to 
ageressing that they would subsequently 
switch roles with the target, while the other — 
half of the subjects were given these instruc- — 
tions after they had agegressed. Finally, to 
obtain a direct measure of perceived differ- 
ences in counteraggressive intent, and hence 
presumed fear of such action, all subjects were 
asked to indicate the levels of shock they 
anticipated receiving from the target. 

On the assumption that fear of retaliation 
would be relatively high when aggressing 
against a black target, it was predicted that 
lower levels of aggression would be delivered 
to black targets under conditions favoring the 
opportunity for retaliation than under condi- 
tions minimizing such opportunity. Moreover, 
since fear of retaliation was assumed to be 
higher for black than for white targets, the 
same situational conditions were expected to 


238 


have a lesser effect in differentially influencing 
aggression toward white targets. It was also 
expected that, independent of delivered ag- 
gression, subjects would anticipate higher 
levels of counteraggression from black than 
from white targets. 


EXPERIMENT I 
Method 


Subjects. The subjects were 80 white male intro- 
ductory psychology students at Northern Illinois 
University who volunteered to participate for extra 
course credit for “an experiment in learning.” 

Apparatus. A modified Buss (1961) “aggression 
machine” was used. The front panel consisted of 
eight buttons numbered in consécutive order. Under 
Buttons 1, 2, and 3 was the word LOW; under 4 and 
5, the word MEDIUM; and under 6, 7, and 8, the 
word HIGH. At the side of the machine was a 
single reward button, A compatible unit in a control 
room, including a Hunter Klockounter, enabled the 
experimenter to monitor which one of the buttons 
was pressed as well as the duration of pressing. A 
Sony videorecorder, monitor, and camera were used 
for preparing and showing the target tapes. 

Procedure. The experimental procedure comprised 

four phases: (a) general introduction, (b) anonymity 
instructions, (c) role-switch instructions and admin- 
istration of aggression to the target, and (d) antici- 
pation of aggression from the target. 
_ General introduction. The subject was ushered into 
a room that was arranged to appear as though 
various behavioral and physiological measures could 
be monitored, An effort was made to assure the sub- 
ject that the experimenter would be busy recording 
a number of these measures while the subject was 
occupied with his task. The subject was then told 
that the experiment was concerned with the effects 
of reward and punishment on paired-associate learn- 
ing, and the study-test method of learning paired 
associates was described. It was further explained 
that the experimenter was concerned not only with 
the rate of learning, but also with the various 
physiological correlates of reward and punishment, 
and that since numerous instrument readings had to 
be taken during testing, additional assistance was 
needed in carrying out the experiment. Introductory 
psychology students were therefore being asked to 
operate some of the apparatus. It was noted that 
the other student who would be learning the paired 
associates had arrived earlier and was in another 
room and that the subject’s role would be to operate 
the reward and punishment equipment by delivering 
a light (reward) each time the learner made a 
correct response and an electrical shock (punish- 
ment) each time he made an error. 

Anonymity instructions. Under the pretext. of 
checking to see if the learner was ready to begin, 
the videomonitor was turned on, and the subject was 
led to believe that he was viewing the other student 
over closed-circuit television, At this point the 
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subject was placed into one of two experimental 3 


conditions. In the nonanonymous condition a camera 1 
was placed facing both the experimenter and the | 
subject, and a tape of either a black or white male | 


college student (target) was shown. The target was 
shown wearing earphones and having an electrode | 
attached to the palm of his right hand. The tapes 
were made so that the experimenter appeared to be | 


communicating with the target. The subject was 
introduced by the experimenter to the target, who 1 
responded and noted that he could see both the > 
experimenter and the subject very clearly. The target i 
was then told by the experimenter that the subject 1 
would be delivering the shock and controlling the 
light during the experiment. The videotape was then | 
turned off, and an explanation was made thal “the 1 
other person” might be distracted from learning if 


it were left on. 

Instructions and procedures in the anonymous 
condition were similar to those in the nonanonymous $ 
condition. However, there was no videocamera in the | 
room, the experimenter did not introduce the subject | 
to the target, and the experimenter explicitly told | 
the target that he would not know who was deliver- 
ing the shock and light to him either during or 
after the experiment. The reason given to the sub- f 
ject for this latter precaution was that nonanon)m- | 
ity distracted the other student from learning. It 
was further emphasized that since the target would 1 
be returning to participate in another, but related, 
phase of the experiment, it would be necessary for 4 
the subject to leave several minutes before the target. | 

Role-switching instructions. The subject was then 
taken to another room in which the aggression ma- | 
chine was located, and its use was explained. He was F 
told that when the learner made an incorrect re- 
sponse, he could deliver any one of eight different 
intensities of shock, and that shock increased from 
low shocks of 1, 2, and 3 to high shocks of 6, 7, 
and 8. He was further told that when the other ; 
student made a correct response, he was to push a 
reward button which lit a bulb on the learner’s 
panel. A list of 48 CVC (consonant-vowel-consonant) | 
nonsense syllables of medium meaningfulness value | 
(Noble, 1961), presumably representing the order of | 
correct nonsense syllable responses on the test list, ` 
was then handed to the subject. Following an expla- 1 
nation of this list, the subject was placed in one of | 
two role-switching conditions. In the prior-role- 
switch condition, he was told that after completing 4 
the present phase of the experiment, he and the 
other student would be switching positions such 
that he would then be learning some paired associates 
and the other student would be controlling the light 
and delivering shocks, In the subsequent-role-switch j 
condition, the subject was not informed that he | 
and the target would switch positions until after he | 
had completed the light and shock trials. For all 
subjects, the experimenter then left the room and 
played an audiotape of the target attempting to re- 
call the correct paired-associate responses. The tape 
was constructed so that there were exactly 6 correct 
responses and 10 incorrect responses (all intralist 
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TABLE 1 
COEFFICIENTS AND VARIANCES OF THE THREE PRINCIPAL COMPONENTS FOR ALL SUBJECTS ON 
DELIVERED AND ANTICIPATED AGGRESSION SCORES (EXPERIMENTS I AnD II) 
Component 
Measure 1 2 3 
Delivered Anticipated Delivered Anticipated Delivered Anticipated 
Mean intensity -6881 -7023 —.1281 —.0647 —.7142 —.7089 
Mean duration 2756 1450 9567 -9880 0939 0534 
Sum high shocks 6712 -6969 —.2614 —.1403 6936 -7033 
% total variance 62 61 31 33 7 6 


intrusions) in each of three blocks of 16 words, a 
total of 30 shock measures being obtained from each 
subject. 

Anticipation of aggression. After the subject had 
delivered shock to the target and was aware that 
he would be switching roles, it was explained that 
before proceeding, the experimenter was interested in 
knowing what levels of shock the subject expected 
to receive from the other student. The subject was 
told that each time a light flashed, he was to imagine 
he had made an error and indicate the shock level 
he expected to receive by pushing one of the eight 
shock buttons. The experimenter then returned to 
the control room and signaled a total of 30 consecu- 
tive simulated errors. Following the last error, the 
experimenter waited 3 minutes and returned to 
explain that since the other student had asked to 
leave, the subject was to leave immediately. 

The final design reflected a 2 X 2 X 2 X 2 factorial, 
with race of target (black and white), anonymity 
(nonanonymous and anonymous), role-switching 
instructions (prior and subsequent), and target repli- 
cation (two targets nested within each race) treated 
as factors. There were five subjects randomly assigned 
to each of the 16 cells, with assignment being made 
prior to a subject’s appearance for the experiment. 


Results 


The major dependent measures were mean 
shock intensity, mean shock duration, and the 
sum of high-intensity shocks (Response But- 
tons 6, 7, and 8). Since a multivariate re- 
sponse system was of primary interest, the 
total correlation matrices of both the delivered 
and anticipated aggression data were first 
subjected to separate principal-components 
analyses. Since the principal components ob- 
tained in this experiment were highly com- 
parable to those found in an independent 
analysis of Experiment II, the data from both 
experiments were pooled in defining the new 
variates. Analyses of variance were then con- 
ducted on the three (orthogonal) principal- 


component variables. (A rationale for such 
an approach can be found in Dempster, 
1963). To eliminate negative values, a con- 
stant of 5 was added to all transformed 
scores.* 

The principal components of both the an- 
ticipated and delivered shock data are pre- 
sented in Table 1. Since the structures are 
very similar, the corresponding components 
may be interpreted together. The first prin- 
cipal component accounts for approximately 
62% of the total variance and is dominated 
by intensity-related measures. Since differ- 
ences in intensity are associated with easily 
discriminable buttons which ostensibly pro- 
duce differentiable shocks, it seems reason- 
able to label this component general direct 
aggression, The second principal component 
represents an alternative measure of aggres- 
sion, with duration contributing heavily to 
this factor. Inasmuch as differences in dura- 
tion are likely to be less discriminable by the 
subject than differences in intensity, this com- 
ponent may be thought of as reflecting a type 
of aggression that is somewhat less direct 
than that measured by the first component. 
Approximately 32% of the total variance is 
accounted for by this component, which may 
be labeled indirect aggression. The third com- 
ponent provides a contrast between the values 
of the standardized high-shock and mean 
intensity measures. High scorers on this com- 
ponent tended to deliver either very high-level 
or very low-level shocks, or a combination of 
these. Low scorers tended to use predomi- 


4 For interested readers, analyses of variance results 
on the raw scores can be found in an appendix of 
Donnerstein’s (1969) thesis. 
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anonymous 
ANONYMITY CONDITIONS, 


Fic. 1, Mean general direct aggression delivered to 
_ black and white targets as a function of aggressor 
‘anonymity (Experiment I). 


nately buttons within the intermediate range. 
Approximately 6% of the total variance is 
accounted for by this component, which may 
be characterized as a measure of extremes in 
direct aggression. 
Analyses of variance, with race of target, 
_ anonymity, role-switch instructions, and 
target replication treated as factors were first 
performed on the delivered aggression com- 
ponents.” The analysis on the first component 
revealed significant effects for anonymity 
(F = 8.00, df = 1/64, p < .01), Race X Ano- 
 nymity (F = 4.65, df = 1/64, p < .05), and 
_ Race X Instructions (F = 7.96, df = 1/64, 
< 01). The interactions are presented in 
Figures 1 and 2. As is clear, the subjects 
shocking a black target delivered significantly 
lower levels of general direct aggression when 
they were not anonymous than when they 
were anonymous, and when they received 


5 The targets were obtained without any attempt 
to sample randomly from populations of black and 
white targets. A fixed-effects model was therefore 
employed in all data analyses since inferences are 
limited to the particular targets chosen, 
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prior-role-switch instructions as contraste 
with subsequent-role-switch instructions. T 
subjects shocking a white target showed m 


tion of the same situational conditions. I 
both figures the differences between bla 
and white targets under comparable tr 
ment conditions were statistically reliable. 
should also be noted that the tendency foj 
subjects paired with black targets to deli 
the lowest levels of aggression under con 
tions of both nonanonymity and prior-role: 
switching and the highest levels under anony: 
mous and subsequent-role-switch conditions 
is contained in the nonsignificant Race X Ano= 
nymity X Instructions interaction (p < 20). 

The analysis on the second principal com 
ponent revealed significant effects for ra ce 
(F = 4.18, df = 1/64, p < .05), anonymity 
(F = 4.09, df = 1/64, p < .05), and Race X 
Anoop (F = 4.02, df= 1/64, p< 05). 
The interaction is presented in Figure 3. As 
can be seen, the subjects paired with a bli 


MEAN GENERAL DIRECT AGGRESSION 


prior-role-switching subsequent-role-switching 


LOCUS OF RETALIATION 
Fic. 2. Mean general direct aggression delivered 


black and white targets as a function of locus 
retaliation instructions (Experiment I). 


VARIABLES IN INTERRACIAL AGGRESSION 


target delivered a higher level of indirect 
aggression under nonanonymous than anony- 
mous conditions, while the subjects shocking 
a white target did not exhibit differential 


levels of this kind of aggression over anonym- ' 


ity conditions. The analysis on the third prin- 
cipal component did not reveal any significant 
sources of variance. 

Separate analyses of covariance were next 
conducted on the first and second principal 
components of the anticipated aggression 
data. Due to marked heterogeneity of regres- 
sion, an analysis of variance was performed 
on the third component. In the covariance 
analyses, delivered scores were used as the 
covariate, and the reported means are cor- 
respondingly adjusted. The analysis on the 
first component revealed only a significant 
effect for race (F = 35.77, df = 1/63, p< 
001), with subjects shocking a black target 
anticipating a higher level of general direct 
aggression than subjects shocking a white 
target. Respectively, the transformed means 
were 4.76 and 3.13; the within-groups regres- 
sion coefficients were .74 and .66. 

The same analysis on the second component 
yielded a significant main effect for race 
(F= 5.57, df = 1/63, p < .05), and a reli- 
able Anonymity X Target Replication inter- 
action (F = 3.21, df = 2/63, p< .05). The 
interaction is primarily due to subjects 
anticipating a higher level of indirect ag- 
gression from one of the white targets 
(M = 4.87) than from one of the black 
targets (M = 3.21) under nonanonymous con- 
ditions. All other means were comparable 
and averaged 4.09. The regression coefficients 
for black and white targets were .98 and .67, 
respectively. 

The analysis of variance on the third com- 
ponent revealed a marginally significant ef- 
fect for race (F = 3.60, df = 1/64, p < .10) 
and a significant Race X Anonymity interac- 
tion (F=4.58, df=1/64, p< .05). The 
interaction is attributable to the fact that 
subjects expected lower levels of extremes in 
direct aggression from white (M = 3.75) 
than from black (M = 4.12) targets under 
anonymous conditions; under nonanonymous 
conditions, the means were comparable and 
averaged 3.95. It was of additional interest 
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MEAN INDIRECT AGGRESSION 
oe 


nonanonymous 


anonymous 
ANONYMITY CONDITIONS 


Fic. 3. Mean indirect aggression delivered to black 
and white targets as a function of aggressor ano- 
nymity (Experiment I). 


to demonstrate that the foregoing difference 
was independent of delivered aggression and 
primarily due to changes over anonymity 
conditions in anticipated aggression from 
white but not black targets. Accordingly, 
analyses of covariance were also conducted on 
the third component scores for black and 
white targets separately. For white targets, 
there was only a significant main effect for 
anonymity (F = 9.71, df = 1/31, p < .01), 
the adjusted means being 4:11 and 3.74 for 
nonanonymous and anonymous conditions, re- 
spectively. There were no reliable effects for 
black targets, the corresponding means being 
4.10 and 4.12. The regression coefficients 
were .04 and .87 for black and white targets, 
respectively. 


EXPERIMENT II 


Approximately 2 weeks after Experiment I 
had been completed, a racial. disturbance 
occurred on the Northern Illinois University 
campus. An estimated 200 black students 
were reported to have participated in what 
resulted in minor injury to a number of 
whites and damage to university property. 


Rumors were also spread of possible open 
conflict between black and white students. 
This type of disorder had never occurred on 
the campus before. 

The purpose of Experiment II was to ex- 
amine the effects that these events might 
have had on interracial aggression. Since the 
violence associated with such a disturbance 
could be considered analogous to those events 
initially producing fear of black retaliation, 
one hypothesis might be that an increase in 
this fear followed the disturbance. If so, then 
under conditions favoring the opportunity for 
retaliation, for example, nonanonymity, the 

. subjects paired with black targets should be 
expected to deliver lower levels of more direct 
kinds of aggression after the disturbance than 
before it. Based solely on increases in fear, 
no change in aggression would be predicted 
when the subjects aggressed anonymously, 
however, since fear of retaliation is assumed 
to be minimal under these conditions. It 
would also be expected that anticipated ag- 
gression from black targets would increase 
over time. 

An alternative hypothesis might be that 
generalized anger and hostility were aroused 
by the disturbance, and that this had the 
effect of strengthening the tendency to aggress 
against black targets. Under this assumption, 
the anonymity variable should be less effec- 
tive in controlling behavioral aggression 
toward blacks, with the result that higher 
levels of direct aggression would be delivered 
to black targets after the disturbance than 
before it. 


Method 


Subjects. The subjects were 28 white male intro- 
ductory psychology students at Northern Illinois 
University. None of these subjects had served in 
Experiment I. All subjects participated between 2 
and 5 days after the disturbance. 

Procedure. Since it was important to conduct the 
study as soon after the disturbance as possible, the 
experimental design constituted only a partial repli- 
cation of Experiment I. Twenty subjects were ran- 
domly assigned to one of two black targets, with 
10 subjects each aggressing under either anonymous 
or nonanonymous conditions. As a control for gen- 
eral rather than specific increases in arousal or fear, 
the remaining 8 subjects were randomly assigned to 
one of two white targets, with 4 subjects under 
each of the two anonymity conditions, All subjects 
were without prior knowledge that they would 


DONNERSTEIN, DONNERSTEIN, SIMON, AND DITRICHS 


switch roles with the target after they aggressed. 
all other respects, the general procedure was identi 
to that employed in Experiment I. 


Results 


Unweighted-means analyses of varian 
with time (before or after the disturbance), 
race of target, anonymity, and target replica 
tion treated as factors were first perform 
on the principal components of the delivere 
aggression data. The before-level data wi 
taken from those subjects under subsequent 
role-switch instructions in Experiment I. Th 
analysis on the first component revealed only 
a significant effect for race (F = 7.99, df= 
1/52, p<.01), with subjects shocking a 
black target (M = 4.62) delivering higher 
levels of general direct aggression than sub- 
jects shocking a white target (M = 3.57). 
While the Time x Race (p = .22) and Time 
x Race X Anonymity (p = .24) interactions 
were not found to be significant, it should be 
noted that these tests would be sensitive only 
to relatively large interaction effects because 
of the small number of white targets employed 
in the replication. A more powerful procedure 
for assessing the effects of the riot on possible 
changes in aggression toward blacks would be 
to conduct a Time X Anonymity X Target $ 
Replication analysis of variance for those sub- 
jects aggressing against black targets alone. 
The results of this analysis indicated signifi- 
cant effects for time (F = 5.56, df = 1/32, 
p<.05) and anonymity (F = 5.68, df = 
1/32, p < .05), with subjects delivering more 
general direct aggression to black targets after 
(M = 5.16) than before (M = 4.09) the riot, 
and under anonymous (M = 5.17) than non- 
anonymous (M = 4.08) conditions. Also, the 
Time x Anonymity interaction was margin- 
ally significant (p = .11), with differences in 
aggressiveness between anonymous and non- 
anonymous conditions tending to be smaller 
after the riot (Dm= 34) than before iti 
(Du = 1.82). Contrary to the changes noted ` 
for black targets, the mean general direct 
aggression scores for white targets before and | 
after the riot were virtually identical. | 

The unweighted analysis on the second 
component revealed significant effects for 
anonymity (F= 7.71, df=1/52, p< 01) | 
and target replication (F = 4.32, df = 1/52, 
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p < 05). The subjects delivered a lower level 
of indirect aggression under anonymous 
(M = 3,63) than under nonanonymous condi- 
tions (M = 4.22). Additionally, a higher 
level of indirect aggression was administered 
to one of the black targets (M = 4.46) than 
to both white targets (M = 3.80), the dif- 
ference between the other black target 
(M = 3.63) and the white targets not being 
reliable. While the Time X Race X Anonym- 
ity interaction in the unweighted analysis was 
not significant (p= .15), the Time X Ano- 
nymity interaction was reliable in the analy- 
sis conducted for black targets alone (F = 
4.20, df=1/32, p< .05), with subjects 
aggressing under nonanonymous conditions 
delivering lower levels of indirect aggression 
to black targets after the riot (M = 4.16) 
than before it (M = 4.76); under anonymous 
conditions, the subjects showed little change 
in this kind of aggression toward black targets 
over time (M = 3.67). For white targets, the 
means before and after the riot were similar 
and averaged 3.81. 
) The unweighted analysis on the third com- 
ponent indicated a significant Time X Race 
' interaction (F = 6.01, df= 1/52, p < .05), 
which was due to increases in extremes in 
irect aggression toward black targets and 
ecreases toward white targets after the riot. 
The before and after means for black targets 
were 3.93 and 4.23, respectively; for white 
‘targets, they were 4.10 and 3.83. 
Unweighted-means analyses of covariance 
were also conducted on the anticipation data. 
The analysis on general direct aggression 
yielded only a significant effect for race 
(p< .05), this again reflecting the fact 
that subjects paired with a black target 
(M = 4.50) anticipated a higher level of 
aggression than subjects paired with a white 
target (M = 3.24). The regression coefficients 
for black and white targets were comparable 
and averaged .60. The analyses on the second 
and third components did not reveal any 
significant or marginally significant sources 
of variance. 
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DISCUSSION 


The present results are generally consistent 
with the hypothesis that white people have 
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learned to fear black retaliation. Evidence 
supporting this conclusion was provided by 
the anticipation data which indicated that in- 
dependent of delivered aggression, the subjects 
paired with black targets expected higher 
levels of more direct kinds of aggression than 
the subjects paired with white targets. Given 
the component interpretations, the finding 
that aggressive intent of a relatively direct 
nature was attributed to blacks would seem 
to be in accordance with the aims of black 
militant leaders who have argued that wide- 
spread violence and the fear of its occurrence 
is the only means of achieving racial equality. 
Whether such perceptions with respect to ag- 
gressive behavior might generally influence 
interracial relationships, such as by providing 
a basis for lessened prejudicial attitudes 
toward blacks, is of course a critical question 
which remains to be answered. 

The anticipation data of Experiment I also 
revealed that subjects expected higher levels 
of indirect aggression from one of the white 
targets, and more extremes in direct aggres- 
sion from both white targets, under non- 
anonymous than anonymous conditions. A 
possible explanation of these results may be 
found in Berkowitz’s (1962) formulation that 
persons associated with aggressive behavior 
are likely to take on aggressive cue value. 
Thus, white subjects openly (nonanony- 
mously) aggressing against their particular 
(white) targets might have perceived them- 
selves as becoming more specific and justifi- 
able targets for counteraggression than the 
subjects aggressing under anonymous condi- 
tions. Since it is unclear why the same results 
were not obtained for either of the white 
targets on the first component, however, this 
(post hoc) interpretation should be accepted 
with caution pending replication. 

Additional evidence that blacks have suc- 
ceeded in instilling fear of retaliation in whites 
was provided by the delivered aggression data 
in Experiment I. It was found that white 
subjects delivered significantly less general 
direct aggression to black targets under condi- 
tions favoring the opportunity for either im- 
mediate (prior-role-switching instructions) or 
future (nonanonymity) counteraggression than 
under conditions minimizing such opportunity. 


+ a 


Moreover, the level of aggression administered 
to black targets under prior-role-switch and 
nonanonymous conditions was reliably lower 
than that delivered to white targets, to whom 
no differential aggression was given as a func- 
tion of the same situational conditions. These 
_ results are consonant with a large number of 
studies showing thatsfear of punishment gen- 
erally serves to inhibit aggressive behavior, 
and they agree well with a recent report that 
subjects who believe that their targets will 
counteraggress with a painful shock select 
lower shock levels themselves than do sub- 
jects who expect counteraggression to be un- 
pleasant but not painful (Shortell, Epstein, 
& Taylor, 1970). In the same study, a sig- 
nificant negative correlation was found be- 
tween ratings of an opponent’s aggressiveness 
and a subject’s shock settings (r= —.67). 

In discussing conceptions of equity in human 
interaction, Walster, Berscheid, and Walster 
(1970) cite research which indicates that 
anticipated contact (namely, nonanonymity) 
with a (white) victim generally serves to dis- 
courage the use of justification techniques for 
ageressive actions by harm-doers. Specifically, 
subjects expecting to see their victims again 
tend to derogate them less than when no 
further contact is expected. Derogation is also 
minimized when subjects anticipate retaliation 
from their victims. In the latter instance, the 
results are attributed to the equity-establish- 
ing effects of anticipated counteraggression, 
while in the former, to the decreased likeli- 
hood of the harm-doer being able to provide 
a credible basis for his actions when publicly 
confronted by his victim. On the assumption 
that the same pattern of results would be ob- 
tained with black subjects as targets, this 
research, together with the present findings, 
suggests that the variables of nonanonymity 
and expected retaliation might act to promote 
racial equality by minimizing aggressive be- 
havior by white persons initially and/or by 
arresting justification techniques after aggres- 
sive action has terminated. 

The present data also indicate, however, 
that subjects exhibit a less direct form of 
aggression when there is opportunity for the 
target to retaliate. It was found that subjects 
paired with black targets delivered higher 
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levels of indirect aggression under nona 
mous than anonymous conditions, and 
of this kind of aggression to black t 
white targets. As restated by Ber 
(1962), the notion of response sul 


to account for this finding reasonably 
According to this formulation, the net sir 
of an aggressive response is taken as 
difference between the levels of instigation a 
inhibition, with direct aggression being gr 
than indirect aggression when inhi)iti 
low, for example, anonymity, Howeve: 
inhibition is high, relative to instigation, 4 
under conditions of nonanonymity in te 
rent study, a heightened form of indire 
aggression is likely to be displayed. 
That alternate forms of hostilit; 
resorted to with black but not witi: 
targets obviously implies differences in th 
predispositions of white college students t 
ageress against the two racial groups. 4 
large body of research on the displacem 
of aggression toward outgroups (Harding 
Proshansky, Kutner, & Chein, 1969), as 1 
as several additional findings in the presél 
study, would appear to support this conclu 
sion. The marked increases in general direcl 
aggression from nonanonymous to anonymo 
and from prior-role-switch to subsequent-role: 
switch conditions for black but not white 
targets can be taken as supporting evidence 
in the current data. Also pertinent is the fac 
that the latter results were independent ol 
black target replication even though the par 
ticular targets represented on the videotape! 
were quite unalike. While one depicted a well 
dressed college student with short hair whe 
seemed a bit exuberant at times, the othe 
portrayed a student with a more natural Afro 
American look. The latter target spoke ini 
very low and deep voice and could be charae 
terized as being somewhat “militant” in api 
pearance. That subjects showed higher levels 
of aggression to both of these targets than 
to both white targets under anonymous 4 
subsequent-role-switch conditions, as well & 
lower levels under nonanonymous and prio 
role-switch conditions, provides evidence fa 
a generalized negative stereotype of blad 
persons as constituting a particular grow 
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against which to display hostility on the one 
hand and as being highly aggressive or 
militant on the other. 

Experiment II constituted a partial replica- 
tion of Experiment I and was conducted im- 
mediately following a campus racial disturb- 
ance. It was of interest to determine (a) 
whether anticipated and delivered scores after 
the disturbance would be comparable to the 
levels observed in Experiment I and (b) 
whether any observed changes would be con- 
fined to black targets. Consistent with notions 
relating generalized aggression to a particular 
target’s association with observed violence 
(Berkowitz, 1965), the results showed sig- 
nificant changes in delivered aggression for 
black but not white targets, with no reliable 
changes in anticipated aggression being found 
for either target race. Specifically, higher 
levels of direct aggression and reduced levels 
of indirect aggression were delivered to black 
targets after the disturbance than before it. 
Although contrary to the hypothesis that 
increases in fear of black retaliation would 
follow the disturbance, the findings agree well 
with a joint disinhibition and elicitation 
interpretation based on the vicarious instiga- 
tion of anger. Thus, while still anticipating 
more counteraggression from black targets, 
the subjects now displayed comparable de- 
grees of hostility toward blacks under anony- 
mous and nonanonymous conditions, with the 
overall levels of direct kinds of aggression 
being higher than that delivered to the 
same targets under anonymous conditions in 
Experiment I. 

Baldwin (1963) has suggested that the fear 
of black power to retaliate is the only means 
of achieving racial equality. While suggesting 
that white persons have indeed learned to 
fear black retaliation, the present data reveal 
that this fear influences only the form of 
aggressiveness toward blacks and not simply 
its occurrence. It was also found that direct 
forms of aggression were instigated by general- 
ized violence. Although the short-term effects 


® An alternative interpretation might be that the 
results of Experiment I were unreliable. However, 
the findings of both the preliminary study and other 
research in our laboratory discount this possibility. 
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of such violence therefore seem not to have 
been salutary, it may well be true that the 
very occurrence of conflict and the need for 
its resolution can, in the long run, provide an 
important impetus for producing acceptable 
social change. 


REFERENCES 


Batpwin, J. The fire next time. New York: Dial, 
1963. 

BANDURA, A. Influence of models’ reinforcement con- 
tingencies on the acquisition of imitative responses. 
Journal of Personality and Social Psychology, 1965, 
1, 589-595. 

Banpura, A., Ross, D., & Ross, S. A. Vicarious 
reinforcement and imitative learning. Journal of 
Abnormal and Social Psychology, 1963, 67, 601- 
607. 

Berkowitz, L. Aggression: A social psychological 
analysis. New York: McGraw-Hill, 1962. 

Berkowitz, L. The concept of aggressive drive: 
Some additional considerations. In L. Berkowitz 
(Ed.), Advances in experimental social psychology. 
Vol. 2. New York: Academic Press, 1965. 

Buss, A. H. The psychology of aggression. New 
York: Wiley, 1961. 

Dempster, A. P. Stepwise multivariate analysis of 
variance based on principal variables. Biometrics, 
1963, 19, 478-490. 

Downerstern, E. I. Variables in interracial aggres- 
sion: Anonymity, expected retaliation, and a riot. 
Unpublished master’s thesis, Northern Ilinois Uni- 
versity, 1969. 

Epwarps, N. L. Aggressive expression under threat 
of retaliation. Unpublished doctoral dissertation, 
University of Iowa, 1967. 

HARDING, J., PROSHANSKY, H., KUTNER, B., & CHEIN, 
I. Prejudice and ethnic relations. In G. Lindzey 
& E. Aronson (Eds.), The handbook of social 
psychology. Vol. 5. (2nd ed.) Reading, Mass.: 
Addison-Wesley, 1969. 

Noste, C. E. Measurement of association value (a), 
rated associations (a'), and scaled meaningfulness 
(m') for the 2,100 CVC combinations of the 
English alphabet. Psychological Reports, 1961, 8, 
487-521. 

SHORTELL, J., Erster, S., & Taytor, S. P. Instiga- 
tion to aggression as a function of degree of defeat 
and the capacity for massive retaliation. Journal 
of Personality, 1970, 38, 313-328. 

Watster, E., BerscHEmD, E., & Warster, G. W. The 
exploited: Justice or justification? In J. E. 
Macaulay & L. Berkowitz (Eds.), Altruism and 
helping behavior: Social psychological studies of 
some antecedents and consequences, New York: 
Academic Press, 1970. 


(Received March 26, 1971) 


N Jo b A pies ope Psychology 


1972, Vol. 22, No. 
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ON INTERPERSONAL ATTRACTIVENESS ' 
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It was proposed that the influence of a pratfall on a superior ability person’s 
attractiveness might depend on the ability level of the “beholder.” Specifically, 
it was thought that only persons of average and, hence, noncomparable ability 
would find a superior-ability person more attractive as a result of a pratfall. 
Conversely, persons of superior ability, because they are comparable, might feel 
trentena a superior-ability person and as a result would perhaps use a 
pratfall as an excuse to derogate their ability peer. Two experiments were 
conducted to test this notion. The first experiment showed that subjects of 
superior intellectual ability disliked a superior-ability person significantly more 
if he committed a pratfall, whereas an average-ability person was reacted to 
indifferently regardless of whether or not he committed a pratfall. The dislike 
shown for the superior-ability-pratfall person was so intense that it fell sig- 
niñcantly below the liking shown for the average-ability-pratfall person. In 
the second experiment, subjects of average intellectual ability tended to dero- 
gate a person of average ability if he committed a pratfall and was berated by 
a third person, while the same conditions brought about a slight increase in a 
superior-ability person’s attractiveness. These results are discussed in terms of 


the general pattern that emerged, showing that superior-ability subjects reacted 


toward a superior-ability person much as average subjects reacted pogo an 


average-ability person, 


Aronson, Willerman, and Floyd (1966) 
found that in the absence of a pratfall a 
superior- and an average-ability person were 
liked about equally. When a pratfall was com- 
mitted, liking for the superior person in- 
creased, while liking for the average-ability 
person dropped sharply. Aronson et al. were 
concerned with the somewhat surprising, 
albeit nonsignificant, finding that a display 
of human fallability increased liking for an 
otherwise highly competent person. The nega- 
tive effect of the pratfall (ie., the drop in 
liking for the average-ability person) was 
highly significant but relatively neglected; 
the present study focuses on this negative 
effect. 

Obviously, when a relatively incompetent 
person displays further evidence of his in- 


1A paper by P. C. Wilkins supervised by D. R. 
Mettee and based on Experiment I of the present 
article was awarded Yale’s 1969 Angier prize. The 
research was supported in part by National Science 
Foundation Grant GS-2749 to the senior author. 

2 Requests for reprints should be sent to David R. 
Mettee, Department of Psychology, Yale University, 
333 Cedar Street, New Haven, Connecticut 06520. 
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or 
competency, some decrease in liking is to 
expected. However, in the Aronson et al 
study, the magnitude of that decrease seem 
disproportionate to the increase in manifest 
incompetency. In the no-pratfall condition, l 
the large competence differential between the! 
average and superior persons had a negligibl 
effect on liking. Given this, it seemed unlikely 
that the average—pratfall person was dero- 
gated significantly more than the average- 
nonpratfall person simply because of th 
former’s slightly greater incompetence. 

If the pratfall’s negative effect was not due 
to greater incompetence, what was there about 
the pratfall that lowered an average peri 
son’s attractiveness? College subjects in the 
Aronson et al. study were listening to another 
college student answer difficult questions that 
they themselves probably could not answer. 
As Morse and Gergen (1970) have shown, 
comparison with an other whose perform- 
ance surpasses one’s own can reduce self- 
esteem, so perhaps these subjects found them- 
selves in a negative comparison situation that 
threatened their self-esteem. Now, a threat to 


self-esteem probably arouses negative feelings 


toward the source of threat, and, hence, threat- 
ened subjects may have been motivated to 
derogate the comparable other. However, 
derogating an other would ‘seem to have ag- 
gressive overtones (Sherif & Sherif, 1953), 
and, as Berkowitz (1962) has documented, 
the desire to aggress does not necessarily re- 
sult in actual aggression; rather, the latter 
| seems to depend on other factors that func- 
tion to release aggressive intents. Perhaps 
then, in the Aronson et al. study, a pratfall 
indirectly lowered an average person’s attrac- 
tiveness because it provided for the release 
of derogation against a source of threat to 
self-esteem. 

This line of reasoning raised two critical 
questions. First, “Why should this negative 
comparison situation have eventually led to 
derogation of the average-ability person, but 
not to derogation of the superior-ability per- 
son?” Second, “What is there about a pratfall 
that would function to trigger derogation?” 
a It was thought that derogation was directed 
toward the average- and not the superior- 
ability person because, paradoxically, the 
average-ability person’s performance repre- 
sented a greater threat to the subjects’ self- 
esteem. As was indicated above, the subjects 
eard the quiz questions, undoubtedly guessed 
t the answers, and probably found that they 
were not doing very well compared to the 
superior- or perhaps even the average-ability 
erson. However, the ability of the randomly 
chosen Minnesota sophomore was probably 
much more similar to that of the average- 
ability person, who correctly answered 15 dif- 
ficult questions, than it was to the superior- 
ability person. According to Festinger (1954), 
this should have made the average-ability 
person a much more salient and relevant 
standard for comparison than the superior- 
ability person, who was in effect noncompa- 
rable since his informational talent verged 
on the superhuman (46 of 50 difficult ques- 
tions answered correctly), If the average- 
ability person’s “comparable” performance 
exceeded the subject’s own attempts to an- 
swer the questions correctly, then, according 
to our rationale, the subject’s self-esteem 
would have been threatened. Here, a strong 
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tendency to derogate the average performer 
might be expected since he would be the source 
of threat to self-esteem. Conversely, the per- 
formance of the intellectually noncomparable 
superior person, though objectively surpass- 
ing the average person’s performance, would 
not threaten the average college student’s 
self-esteem (i.e., what a “brain” does is ir- 
relevant to the average student’s academic 
performance) and therefore there would be 
little desire to derogate. In more general 
terms, as Festinger’s theory of social com- 
parison stipulates, it is the similar other who 
provides the individual with a viable social 
referent for self-evaluation, which in turn 
makes him a potential threat, since this accu- 
rate evaluative information may be negative. 
The “dissimilar” other is not a relevant com- 
parison because, whether higher or lower in 
ability, he is so far removed from one’s own 
operating level that comparison with him fails 
to provide accurate and meaningful self- 
evaluative information. Since the dissimilar 
other is not a source of viable evaluative in- 
formation, either good or bad, he is not a 
potential threat.* 

Assuming that it is an ability peer’s per- 
formance that threatens self-esteem, why 
should a pratfall trigger derogation? Will any 
kind of blunder precipitate the wrath of one’s 
peers, or is there something about a pratfall 
in particular that brings out the vindictiveness 
in our compatriots? It is instructive here to 


3 Examples of the greater comparison relevance of 
similar than dissimilar others are numerous. For 
instance, Arnold Palmer is not a relevant comparison 
for the average golfer. The discrepancy in talent is 
just too great for comparison to be meaningful. 
Moreover, if Arnold errs and tops a drive, such a 
display of human weakness might make him more 
attractive. On the other hand, an average golfer does 
provide another average golfer with a meaningful 
and salient standard of golfing skill. Comparison with 
one’s peers provides a rather refined means of deter- 
mining how one measures up. If a peer gets “hot” 
and shoots par golf for nine holes, the average golfer 
may feel some personal distress. His peer’s perform- 
ance would have salience for him in the form of 
strong negative implications for his competence as a 
golfer. In this case, a topped drive off the tenth tee 
might trigger at least momentary derogation since 
it would psychologically “put down” one’s peer, 
thus mitigating the damage and threat to self-esteem. 


examine Aronson et al.’s (1966) description of 
their pratfall: 


the stimulus person clumsily spilled a cup of coffee 
all over himself. On the tape this blunder was 
accompanied by a good deal of noise and clatter . . . 
and the . . . anguished statement, “Oh my goodness. 
Ive spilled coffee all over my new suit [p. 227].” 


Aronson et al. make a clear distinction be- 
tween the blunder and the “hoopla” accom- 
panying it. The pratfall consisted of both 
these elements, and it is quite possible that 
the hoopla made the event humorous. More- 
over, humor may be the ingredient that 
distinguishes an ordinary blunder from a 
full-blown pratfall, and it is precisely this 
humorous element that might be expected to 
allow for derogation. Previous research 
(Landy & Mettee, 1969; Singer, 1968) sug- 
gests that humor allows for the guilt-free 
expression of aggressive impulses, presumably 
because it cloaks or masks the aggressive 
nature of the behavior. Thus, it may be that 
a pratfall is capable of triggering derogation 
because it allows the person to enshroud 
ridicule and derision with laughter, thereby 
disguising the aggressive self-serving nature 
of the behavior. This suggests that not just 
any blunder is capable of triggering the wrath 
of one’s peers. Rather, the blunder may 
have to evoke laughter before it can evoke 
derogation. s 
The rationale presented above provides a 
basis for contending that in the Aronson et al. 
study, the average-ability-pratfall person was 
highly derogated for reasons other than 
greater incompetency. Nevertheless, the aver- 
age stimulus person who committed a social 
blunder (the pratfall) did manifest the 
greatest degree of incompetency. It is possible 
that this, rather than threatened self-esteem, 
produced the strong derogation. To obtain 
firm support for our rationale, it was neces- 
sary to demonstrate that a person of high 
competency would be derogated if he (a) was 
perceived as a comparable other, (b) outper- 
formed the “comparer,” and (c) committed 
a humorous blunder (pratfall). Accordingly, 
intellectually superior subjects were exposed 
to a superior- or an average-ability person 
who committed or did not commit a non- 
humorous or humorous blunder (i.e., a prat- 
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fall). It was thought that superior 
would regard the superior-ability person 
appropriate standard for assessing teir ) 
ability and hence would compare t 
with him rather than with the avera 
person. Consequently, for the superio 
only the superior-ability person wo 
threat to self-esteem and, in the 
pratfall, an object of derogation. Th) 
occur even though the superior-pratfa 
obviously would be more competent ‘ian t 
average-pratfall person. 

In the present study, the pratfall « 
referred to above involved a stimulus | 
who simply committed a pratfall 
interviewer who reacted in a neutral 
A second pratfall condition was run i 
the interviewer was hostile, displaying ‘isgut 
and exasperation toward the stimulus ; 
for committing the pratfall. This was i 
determine if differences in interviewer 
ior influence subject reactions, No firn 
dictions were made in regard to th 
condition, because it was thought that 4 
interviewer’s open denigration of a source 0 
threat might have two essentially opposit 
effects. It might serve to make the other les 
of a threat to self-esteem (i-e., the other hat 
erred and negatively impressed the intei 
viewer), thereby reducing subject derogation( 
or the interviewer’s behavior might serve as 
a model for aggressive behavior (Bandura 
Ross, & Ross, 1961), thereby facilitating 
subject derogation. 

Finally, in a second experiment, sub 
jects known to possess average intellectué 
ability were exposed to average- or superior 
ability persons in the two pratfall condition 
and a no-pratfall control. This enabled us (4) 
to determine if our superior-ability persol 
would be derogated only by subjects wht 
were also superior (i.e., of comparable ability) 
and (6) to see if subjects known to 
average behave like Aronson et al.’s subjects 
and derogate an average-ability person (iei 
a peer) in a situation involving a pratfall) 


| 
4 We did not specify which of our pratfall condi 
tions might replicate Aronson et al. since oul 
duplication of their procedure was impossible. Thi 
tape-recorded manipulations had been destroyed, 20 
thus, there was no way of checking on the pre@” 
| 


i 
H 
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nature of the interviewer's response to the pratf 
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METHOD 
rview 


present study consisted of two experiments. 
first study, 150 Yale undergraduates listened 
of 10 tape recordings (n= 15 per group) of a 
us person and then filled out a questionnaire 
ating their reactions to the stimulus person. 
n the second experiment, 76 male undergraduates 
from Southern Connecticut State College went 
through the above procedure. They listened to one of 
6 tape recordings (ns ranged from 10 to 14) selected 
from the 10 recordings heard by Yale subjects. 

For the most part the subjects were run in groups, 
with each subject being seated in an individual 
cubicle and supplied with earphones. The tape 
recording was presented as a taped interview of “a 
student from another university being interviewed 
for his school’s College Quiz Bowl team.” On half 
of the tapes, the stimulus person was of superior 
ability, and on the other half he was of average 
ability. 

At the end of the interview, the average- and 
superior-ability persons either (a) ended the inter- 
view without incident—no blunder control; (6) 
spilled a little coffee on the table—a minimal 
blunder 5; (c) spilled coffee all over himself—a large 
blunder designed to be void of humor but equated 
in terms of magnitude of mistake with a pratfall; 
(d) spilled coffee all over himself and became ex- 
tremely agitated—a blunder with humor, that is, a 
*pratfall; (e) same as d, except that the interviewer, 
who showed mild concern for the stimulus person’s 
welfare in the other blunder conditions, acted in a 
hostile manner toward the stimulus person. Southern 

onnecticut State College subjects were not exposed 

Endings a or b. 

After listening to one of the tapes, the subjects 
gave their impressions and opinions of “John,” the 
stimulus person. Subjects then left the room, con- 
cluding the experiment. As they left, the experimenter 
made inquiries about subject suspicions, 


Procedure 


Subjects were recruited to partictipate in an ex- 
periment on “impression formation.” Yale subjects 
received experimental credits toward their introduc- 
tory psychology requirement; Southern Connecticut 
State College subjects were paid $2. Two of the five 
interview endings and 15-25 subjects were run per 
experimental session. The sessions took place in a 
language laboratory, with each subject being assigned 
an individual cubicle and provided with earphones, 
To minimize subject interaction, the subjects were 
seated at every other cubicle, and the experimenter 
maintained silence during the session, Because of 
subjects not showing, 2-4 subjects per cell were 


5A blunder was defined as a mistake without 
humor involving personal clumsiness, whereas a prat- 
fall was defined as a blunder with humor, that is, 
a humorous mistake. 
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run individually. The responses of these subjects did 
not differ from the subjects in the same condition 
run as a group. 

Instructions and cover story. The experiment was 
described as one on impression formation. The 
instruction sheet given to each subject read as 
follows: 


This is an experiment in impression formation. 
You will . . . listen to a tape of a student from 
another university being interviewed for his 
school’s College Quiz Bowl team. For convenience 
this student will hereafter be referred to as John. 
In all tapes, John’s actual name and address have 
been edited out... . 

The tape you will hear is one of hundreds made 
as records of student interviews. These interviews 
were informal and conducted by members of 
a student committee elected ...to select the 
university’s College Quiz Bowl team. Both the 
interviewer and the student being interviewed 
knew that their conversation was being recorded. 
They did not, however, know that it would be used 
for anything other than a record of the interview 
and the student’s responses to 50 difficult questions 
asked during the interview. In the course of several 
hundred interviews, however, fortuitous events 
occurred. These unplanned variations made the 
interviews especially valuable as stimuli for study- 
ing impression formation, 


Besides giving instructions, the instruction sheet 
had two additional functions. The first was to pro- 
vide a cover story, that is, a straightforward impres- 
sion formation experiment which involved measuring 
the subjects’ impression of a real stimulus person. 
The second was to increase the credibility of the 
tapes to which the subjects listened by (a) stating 
that John’s anonymity would be preserved thereby 
increasing John’s credibility as a real person and (b) 
stating that hundreds of interviews had been 
recorded and that “fortuitous events occurred.” This 
was done to reduce the novelty of the blunders and 
pratfalls, which if not logically explained, would 
have seemed unreal to the normally suspicious psy- 
chology subject. We wanted to insure that the prat- 
fall was perceived as a real unstaged event, and 
although we were interested in their reactions to 
the event, we had no special interest in that 
particular event per se. 

Recording the tapes. Both the superior- and 
average-ability interviews and the various endings 
were recorded many times, and the best ones were 
selected for duplication and use in the experiment. 
The superior- and average-ability interviews were 
duplicated five times, and each interview ending was 
duplicated twice. Electronic instrumentation insured 
that all tapes were equated on recording level, noise, 
and other incidental variables so that there would 
be no discontinuity between the interview and the 
endings spliced onto them. 
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Ability Manipulation 


Superior ability. John answered 46 of 50 difficult 
questions correctly. Furthermore, in response to 
inquiry by the interviewer, John modestly said that 
in high school he was editor of the yearbook, a 
member of the school dance band, and captain of 
the track team, that his high school grades were very 
good, and that he was doing very well in college. 

Average ability. John answered 15 of the 50 ques- 
tions correctly, Furthermore, he said that in high 
school he was on the yearbook staff as a proof- 
reader, in the band, and “on and off” the track 
team, that his high school grades were “all right,” 
and that in college he was doing about average. 

In addition to the number of correct answers, 
the superior-ability person generally answered more 
rapidly and with greater confidence and authority 
than did the average-ability person. 

In both the superior- and average-ability inter- 
views, the interviewer kept a neutral tone, implying 
neither praise for correct answers nor disapproval 
for incorrect answers. Whenever the stimulus person 
did not know an answer or was incorrect, the 
interviewer stated the correct answer. 


Interview Endings or Blunder 
Manipulation 


Control. The tape ended without a blunder or 
pratfall. 

Minimal blunder, The minimal mistake or blunder 
went as follows: 

Interviewer: Oh, by the way, take your coffee cup 
with you and leave it on the sink 
outside. 

Sure. (Small clatter of coffee cup and 
saucer.) I’m sorry. I spilled some 
coffee. 

Interviewer: O.K. I'll clean it up when you leave. 
Don’t worry. Just send the next guy 
in, please. 

Jobn: O.K. 

This blunder was run to determine if the degree 
or size of the mistake per se had any effect on 
attraction. 

Equated blunder. The equated blunder went as 
follows: 

[Loud clatter of cups and saucers] 

John: Oh! Gosh! I spilled coffee all over 

my new suit. 

Interviewer: Oh, Listen, PI clean it up in here, 
You can wash up in the sink outside 
and then send the next guy in. 

John: O.K. Goodbye. 

Interviewer: Goodbye. 

The extent of the mistake or blunder in this ending 
is greater than in the minimal blunder condition 
and equated to the blunder in the pratfall endings; 
that is, John spills coffee all over his new suit. The 
difference is that unlike the pratfall endings, John 
is not “anguished” or unduly upset by his blunder. 
This was intended to void the blunder of humor, 


r 


John: 
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Pratfall ending. The pratfall ending went as `% 
follows: 
[Loud clatter of cups and saucers] 
John: Oh, no! The coffee .. . all over my 
new suit. All over. It’s a new suit! 
Interviewer: How bad is it? 


John: Oh! It’s soaking through. I can feel 
it soaking through. And the whole 
suit, it’s just ruined. It’s all down the 
pants ...and all... And on... 
Interviewer; Listen . . . Pá 
John: Ugh!!! ` 


Interviewer: Listen, I'll clean it up in here. You 
can wash up in the sink outside and 
then send the next guy in. 

John: OK. 

Here, the extent of the mistake was objectively the 
same as in the equated blunder ending. However, 
humor was added by having John react with emotion 

to the incident. His voice was definitely “anguished,” 
and he was clearly upset. There was considerable T 
confusion.® 

Hostile pratfall ending. The hostile pratfall ending 

went as follows: 

[Loud clatter of cups and saucers] 


John: Oh, great! The whole pot of coffee 
... all over... and my new suit. 
Hey, I’m sorry. 

Interviewer: Hey, ah, I'll clean it up. 

John: Ah, I’m really sorry. 

Interviewer: Don’t worry. PII clean it up. ` 

John: I didn’t mean to do this. z 

Interviewer: Hey, all right, all right! 

John: Really, I’m sorry. 

Interviewer: TIl... just send the next guy in. 


Till take care of it all right! Come 

on, just send the next guy in!!! 
In this ending, the interviewer, in contrast to his 
behavior in the other endings, acted in a hostile 
manner toward John. The interviewer was irritated}, 
and he was obviously “down” on John. From the 
script, the stimulus person appears more apologetic „ 
in this ending than in the pratfall ending. While ' 
this is true to some extent, the written script is 
misleading. Because of the noise and general con- 
fusion, and because John and the interviewer often 
speak at the same time, what actually came across 
to the subject was the tone and mood of the stimu- 
lus person and the interviewer rather than the spe- 
cific content of what was said. When asked by thes, 
experimenter what they remembered of the pratfall 
ending, the subjects uniformly replied that “John” 


ê Tt is possible that the hoopla and confusion sur- # 
rounding the mistake not only added humor but also 
increased the subjective size of the mistake. While 
this is possible, the difference in mistake size be- 
tween the equated-blunder and pratfall conditions is 
much more apparent in the written script than it is 
on tape. The same mistake occurs, but then, in the 
pratfall condition, there is attendent confusion which 


tends to obscure any additional information about 
mistake size. 
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spilled coffee all over his new suit and came 
“unglued.” In answer to inquiry about the hostile 
pratfall ending, the subjects said the same thing, 
adding that the interviewer was really irritated 
with “John.” 


Measures of Interpersonal Attractiveness— 
Liking and Respect 


The subjects responded to 14 attraction measures on 
15-peint scales. Seven questions known to be highly 
intercorrelated (Goldberg & Mettee, 1968) tapped the 
subjects’ personal feelings of liking for the stimulus 
person, Seven other questions highly related to each 
other but not to the liking questions were intended 
to measure how much the subjects respected the 
stimulus person. 

There were two additional questions. A check on 
the ability manipulation asked the subjects on a 
15-point continuum, “How intelligent would you 
say John is?” On a separate page, a check was made 
to see if the two pratfall endings were perceived 
as being humorous. Subjects were first asked, “Did 
anything unusual happen during the taped interview 
which you heard?” and then, “If something unusual 
did happen, how would you rate this occurrence 
on the following scales?” The first and third of these 
scales were fillers; the second was an 11-point 
continuum ranging from “extremely humorous” (1) 
to “not humorous at all” (11). Following completion 
of the questionnaire, the subjects went into an 
adjoining room where the experimenter conducted 
a group debriefing and probed for suspicion. 

Data were also collected in regard to the ability 
of Yale and Southern Connecticut State College sub- 
jects to answer the “quiz” questions used to ma- 
nipulate ability. Since the subjects exposed to the 
interview heard the questions answered, we could 
not use them for this purpose. A separate group of 
Yale students (n= 15) were randomly selected from 
the Yale population and asked to take the 50- 
question quiz used in the interview. Similarly, a 
separate group of Southern Connecticut State 
College students (~=20) were recruited to answer 
the 50 questions. 


RESULTS AND DISCUSSION 
Ability Characteristics of Subjects 


Data from the 15 Yale and 20 Southern 
Connecticut State College students who an- 
swered the “college quiz” questions confirmed 
the expected. On this specific task, the Yale 
students were indeed brighter than the stu- 
dents from Southern Connecticut State Col- 
lege (this is also true in general; Verbal Col- 
lege Board means for the sophomore classes 
from which most of the subjects were drawn 
equaled 690 and 500, respectively). On the 
average, Vale students answered twice as 
many questions correctly as Southern Con- 
necticut State College students (Xs = 20 
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versus 10; £= 4.11, p< .001). Moreover, 
these “quiz” students were asked to indicate 
“tip of the tongue” questions—questions that 
they were not able to answer correctly but 
were certain that they could have if they had 
had more time. When this figure was added 
to the number correct, Yale students had an 
average score of 29, and Southern Connecticut 
State College students had an average of 19. 
That is, Yale and Southern Connecticut State 
College students considered 29 and 19 of 
the 50 questions, respectively, as asking for 
information they possessed. These latter fig- 
ures were thought to provide some idea of 
who Yale and Southern Connecticut State 
College subjects would perceive as an appro- 
priate person for ability comparison. 

These same Yale students who thought that 
they “knew” 29 of the answers also thought 
that superior Yale students (upper 10% of the 
class) would get 32 questions correct, that 
average “Yalies” would correctly answer 24 
questions, and that the average student at 
a state university would answer only 16 ques- 
tions. Southern Connecticut State College 
students who thought they “knew” 19 of the 
answers estimated that the average student 
at a state university would answer 19 ques- 
tions correctly and that a superior Yale stu- 
dent would answer 35 questions correctly. 
Obviously, what the Southern Connecticut 
State College student “knew” corresponded 
closely with his estimate of the “average” stu- 
dent’s performance. What the Yale student 
knew fell in between his estimate of an average 
and superior Yale student’s performance. In 
total, these quiz data show that relative to the 
average college student, Yale students believe 
that they possess superior intellectual ability, 
while Southern Connecticut State College stu- 
dents see themselves as “average” college 
students.’ Moreover, what Southern Connecti- 


7 Southern Connecticut State College experimental 
subjects reported that they obtained “average” grades 
(2.5 on 4.0 system) just as their fellow Southern 
Connecticut State College students who took the quiz 
reported that their informational knowledge matched 
their conception of what the average student would 
know. Yale experimental subjects reported that they 
obtained grades between superior and average, relative 
to the Yale population (3.1 when converted to a 4.0 
system) which corresponds to the ability self- 
perceptions of their Yale counterparts who took 
the quiz. 
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TABLE 1 


ANALYSES OF VARIANCE OF YALE SUBJECTS’ LIKING 
AND RESPECT RATINGS OF THE 
STIMULUS PERSON 


Source | SS | df MS E 
Liking 
Ability (A) 12.38 1 | 12.38) 1.87 
Interview ending (B) | 134.25 4 | 35.56] 5.07** 
AXB 147.68 4 | 36.92] 5.58** 
Error 926.98 | 140 662: j= 
Respect 
Ability (A) 353.03 1 | 353.01 | 83.61** 
Interview ending (B) | 67.51 4 | 16.88| 4.00* 
AXB 87.19 4 | 21.80] 5.16** 
Error 507.73 | 140 4.22) — 
*p <.01 
Dp < 001 


cut State College quiz students knew (19) 
marginally surpassed the average-ability per- 
son’s performance (15), which suggests that 
Southern Connecticut State College subjects 
would view the average-ability person as an 
appropriate standard for ability comparison. 
Although the performance of the superior- 
ability person (46) surpassed what Yale 
students knew (29), this latter total was on 
the border line of what Yale students con- 
sidered to be a very superior class (the upper 
10% of Yale students). This suggests that 
Yale subjects saw the superior-ability person 
as a more appropriate standard for ability 
comparison than the average-ability person. 
Moreover, threat to self-esteem should be 
intense when a superior individual discovers, 
as Yale experimental subjects did, that an- 
other person possesses superiority of a very 
high order. 


Experiment I (Yale) Data 


Manipulation checks. The superior-ability 
person was rated more intelligent than the 
average-ability person across all interview- 
ending conditions (F = 190.90, p < .001, for 
ability main effect). The humor manipulation 
was also successful, with both pratfall end- 
ings receiving uniformly high humor ratings 
and the blunder-only endings receiving uni- 
formly lower humor ratings (F = 12.08, 


< .001, for the interview ending main 
effect). 

Attraction data—liking and respect. Spe- 
cific measures of respect were highly inter- 
correlated (average r = .78) and were com- 
bined for analysis. The same procedure was 
followed with the specific liking measures 
(average r = .85). Respect and liking mea- 
sures did not intercorrelate highly (average 
r = 34). 

Analyses of variance computed on the 
combined liking and combined respect scores 
are presented in Table 1. There was a pro- 
nounced ability main effect favoring the 
superior-ability person on the respect measure 
(F = 83.60) but not on the liking measure 
(F = 1.87). On both measures there was a 
significant interview-ending effect and a sig- 
nificant interaction. Overall, liking and re- 
spect were lowest in the pratfall condition 
and highest in the hostile-pratfall condition; 
but, as Figure 1 shows, these variations oc- 
curred only in the superior-ability condition. 
Yale subjects were almost totally indifferent 
to the nature of the average-ability person’s 
end-of-interview behavior. Regardless of what ` 
he did, liking for the average-ability person 
hovered just below the neutral point, while 
respect for him was uniformly low. Appar- 
ently, whether the average-ability person did 
or did not commit a social blunder was of 
little importance to an intellectually superior 
person. 

Conversely, Yale subjects’ reactions to the ‘ 
superior-ability person varied dramatically as 
a function of the interview ending. In the 
control, simple blunder, and equated blunder 
conditions, liking was just above neutral, but . 
in the two pratfall conditions there was sharp 
divergence from this neutral response. The 
superior-ability person was derogated so se- 
verely in the pratfall condition that he was 
disliked more than his average-ability-person 
counterpart (¢= 3.08, p< .01) and was 
rated far below the equated-blunder superior- 
ability person (¢ = 4.23). In the hostile-prat- 
fall condition, this pattern was reversed: a 
superior-ability person berated by the inter- 
viewer was liked better than his average- 
ability-person counterpart (£= 3.06) and 
liked somewhat better than the superior-abil- 
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ity person in the equated-blunder condition 
(t = 1.61, p < .15; if all nonpratfall superior- 
ability-person groups are combined, this dif- 
ference is significant—t = 1.98, p < .05). Re- 
spect ratings of the superior-ability person 
were uniformly above neutral except in the 
pratfall condition, where the superior-ability 
person was accorded no more respect than his 
average-ability person counterpart and signifi- 
cantly less respect than any of the other su- 
perior-ability persons (all ts > 4,00). 

As predicted, these results were very dif- 
ferent from those obtained by Aronson et al. 
(1966). In the eyes of an ability peer, a prat- 
fall led to a superior person’s downfall rather 


MEAN LIKING 


Control 


Simple 
Blunder 


253 


than to enhanced attractiveness, whereas an 
average person who committed a pratfall was 
reacted to indifferently rather than being 
derogated. Apparently, subjects of high intel- 
lectual ability saw the superior-ability person 
as a relevant comparison for their own abil- 
ity, and, since his performance was superb, 
he threatened their self-esteem (cf. Morse & 
Gergen, 1970). This, in turn, led to severe 
derogation of the threat source (the superior- 
ability person) when humor was present. 
Thus, only a stimulus person regarded by sub- 
jects as a peer (i.e., the superior-ability per- 
son) was derogated and then only when the 
blunder was humorous. 
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Fic. 1. The degree of liking and respect shown for the superior- and average- 
ability persons by Yale subjects. 
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TABLE 2 


MEAN LIKING AND RESPECT SHOWN BY ALL SOUTHERN 
CONNECTICUT State COLLEGE SUBJECTS AND BY 
SOUTHERN CONNECTICUT STATE COLLEGE 
“Humor” SUBSAMPLE SUBJECTS (PRAT- 

FALL AND HOSTILE-PRATFALL 


CONDITIONS) 
Liking | Respect 
Condition 
Superior | Average | Superior | Average 
person person person person 
Whole sample 
Equated blunder | 8.50 9.04 10.46 8.50 
n 11 14 11 14 
Pratfall 8.92 9.54 10.99 | 10.67 
n 14 13 14 13 
Hostile pratfall 8.00 7.86 9.83 6.41 
n F 14 10 14 10 
Subsample™} 

Equated blunder | 8.50 9.04 | 10.46 8.50 
n 11 14 11 14 
Pratfall 8.53 9.84 10.21 8.20 
n 10 7 10 rA 
Hostile pratfall 9.45 7.07 10.92 5.07 
n 10 6 10 6 


Note.—Neutral point on the scale was 8.00; an analysis of 
variance was computed with the harmonic mean adjustment 
for unequal Ns. 

a Ability X Interview Ending interaction F for liking = 3.60, 
df = 2/52,p < .05; main effect Fs < 1.00; MS. = 4.67. 

» Ability main effect F for respect = 28.14, df = 2/52, » < 
.025; the ending effect was not significant; MS. = 5.10. 


The hostile-pratfall condition was seem- 
ingly an exception to the above statement. 
Here, a humorous blunder committed by an 
intellectual peer did not lead to derogation. 
However, rather than an exception, this result 
may provide support for the idea that threat- 
ened self-esteem mediated derogation. It is 
possible that the interviewer’s “put down” of 
the superior person reduced the superior- 
ability person’s perceived superiority, so that 
he was no longer a threat to self-esteem and 
hence was not derogated. If this were the case, 
the absence of derogation in this condition 
would be further evidence that threatened 
self-esteem was a mediating link in the chain 
of events leading to derogation. 

Aronson et al.’s (1966) results, though es- 
sentially the opposite of those found here, can 
be accounted for with the threatened self- 
esteem rationale simply by assuming that 
Minnesota sophomores viewed the average- 
ability person rather than the superior-ability 


Davi R. METTEE AND PAuL C. WILKINS 


person as their peer, or relevant comparison 
person. Given this, the average-ability person, 
and not the objectively much better superior- 
ability person, would have been the source of 
self-esteem threat. To check this analysis we 
turn to the Southern Connecticut State Col- 
lege data. 


Experiment II (Southern Connecticut State 
College) Data 


Manipulation checks. Southern Connecticut 
State College subjects rated the superior-abil- 
ity person as more intelligent than the aver- 
age-ability person (F = 41.96). This effect 
was weaker than it was for Yale subjects and 
was due entirely to the Southern Connecticut 
State College subjects rating the average- 
ability person higher than did Yale subjects 
(the superior-ability person was rated equally 
high by both subject groups). 

‘The humor manipulation was, on the whole, 
unsuccessful, both in a relative and an abso- 
lute sense. The “pratfall” endings were not 
more humorous than the equated blunder end- 


ing (F < 1.00). Moreover, mean humor rat- . 


ings clustered around six or neutral on the 11- 
point humor scale. Southern Connecticut State 
College subjects apparently did not see any- 
thing funny about the pratfall endings, al- 
though they did find the average-ability per- 
son’s behavior to be even less funny than the 
superior-ability person’s behavior (F = 4.53, 
p < 05). 

There is no obvious reason for the Southern 
Connecticut State College subjects not find- 
ing the “pratfall” endings humorous since the 
“humor” stimuli were identical for both sub- 
ject groups. However, even though there was 
no overall effect, there was great variation in 
the humor ratings of the pratfall endings, 
with a considerable number of Southern Con- 
necticut State College subjects finding them 
quite humorous. Hence, some of the analyses 
to follow will include only those subjects who 
rated the pratfall endings above neutral on the 
humor scale, 

Southern Connecticut State College attrac- 
tion data. No significant liking effects were 
found with the Southern Connecticut State 
College sample as a whole. The top half of 
Table 2 shows that directionally, the average- 
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ability person was liked slightly more than the 
superior-ability person in the equated-blunder 
and pratfall conditions; this difference was 
reversed in the hostile-pratfall condition pri- 


_ marily because of a drop in liking for the 


average-ability person. 

The respect data for the Southern Connec- 
ticut State College sample showed significant 
effects with the superior-ability person being 
respected more than the average-ability per- 
son (F = 22.53, df = 1/70, p < .001), espe- 
cially in the hostile-pratfall condition (inter- 
action F = 3.72, df=2/70, p< .05). The 
superior-ability person was accorded moder- 
ately high respect across all conditions, while 
respect for the average-ability person was 
above neutral in the equated-blunder and 
pratfall conditions and decidedly lower in the 
hostile-pratfall condition (¢ for hostile-pratfall 
versus equated-blunder conditions = 2.15, 
p < .05). Thus, when Southern Connecticut 
State College subjects did derogate, they “put 
down” the average- rather than the superior- 
ability person. 

Southern Connecticut State College attrac- 
tion data—humor subsample. Tn order to com- 
pare Yale subjects with Southern Connecti- 
cut State College subjects who were operating 
under the same humor conditions, Southern 
Connecticut State College subjects were se- 
lected who rated the pratfall endings above 
neutral on the humor scale (almost every 
Yale subject rated the pratfalls above neu- 
tral). The Southern Connecticut State Col- 
lege equated-blunder group was left intact 
since their humor ratings matched those made 
by Yale subjects (ie., in both the Yale and 
Southern Connecticut State College sample, 
only a few subjects rated the equated blunder 
above neutral on the humor scale). 

It is obvious from inspection of Table 2 
and Figure 2 that the patterns found with the 
entire Southern Connecticut State College 
sample were on the whole intensified with the 
humor subsample. Superior-ability persons 
were moderately respected under all circum- 
stances, and average-ability persons were 
neutrally respected except in the hostile-prat- 
fall situation where respect dropped sharply, 
The liking data yielded a significant interac- 
tion which reflected a sharpening of the direc- 
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tional differences found with the entire South- 
ern Connecticut State College sample. In the 
equated-blunder and pratfall situations, the 
average-ability person was liked somewhat 
more than the superior-ability person; con- 
versely, in the hostile-pratfall situation, the 
average-ability person was significantly less 
liked than the superior-ability person (f= 
2.14, df=52, p<.05). This reversal 
stemmed from a significant drop in liking for 
the average-ability person in the hostile-prat- 
fall condition (t= 2.31, p= .025 for the 
average-ability person hostile-pratfall versus 
pratfall condition) and a modest increase in 
liking for the superior-ability person. 

This pattern of liking was very different 
from that found with Yale subjects (see Fig- 
ure 2). Relative to the equated-blunder con- 
trol, the superior-ability person was never 
derogated by Southern Connecticut State Col- 
lege subjects in the humor subsample. Further, 
liking for the superior-ability person mod- 
estly increased under circumstances that in- 
volved a pratfall. On the other hand, the av- 
erage-ability person, whose end of interview 
behavior was ignored by Yale subjects, suf- 
fered a significant decrease in liking when he 
committed a pratfall that elicited scorn from 
the interviewer. These data very clearly show 
that a specific ability person evoked quite 
different reactions from Yale than from South- 
ern Connecticut State College subsample sub- 
jects. However, it is equally clear that reac- 
tions toward ability peers were in close cor- 
respondence. Southern Connecticut State Col- 
lege and Yale subjects did not derogate their 
respective ability peers under the same cir- — 
cumstances, but when derogation did occur, 
an average-ability person was to a Southern 
Connecticut State College or average-ability 
subject what a superior-ability person was to 
a Yale superior-ability subject _(i.e., only the — 
relevant comparison person was derogated). 
In contrast, a nonpeer or noncomparable — 
ability person was never derogated. 

The reactions of our average subjects in a 
general sense paralleled the reactions of Aron- 
son et al.’s (1966) subjects, except that in i 
the present study a pratfall and a hostile in- 
terviewer were required to elicit responses 
brought forth by only a pratfall in the Aron- ; 
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Fic. 2. The degree of liking and respect shown by Southern Connecticut 
State College humor subsample subjects for the superior- and average-ability 
persons; reactions of Yale subjects presented for comparison. 


son et al. study. One possible methodological 
explanation of this difference is that the Aron- 
son et al. interviewer may have been impa- 
tient with the blundering stimulus person. If 
this was the case, then their pratfall and our 
hostile-pratfall conditions were not all that 
different in terms of operations, as well as in 
the results produced. 

In regard to the effects of humor, it appears 
that humor was necessary and sufficient to 
trigger derogation of a superior subject’s 
ability peer (the superior-ability person), and 
was a necessary but not sufficient ingredient 
for triggering derogation of an average sub- 
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ject’s ability peer (the average-ability per- 
son), If only a humorous blunder (i.e., a prat- 
fall only) was committed, then Yale subjects 
derogated the superior-ability person; other- 
wise, they did not. In fact, in the case of a 
pratfall accompanied by a hostile interviewer, 
liking actually increased slightly. This ab- 
sence of derogation and the high humor rat- 
ings in the hostile-pratfall condition suggested 
that the interviewer’s hostility reduced per- 
ceived threat and that humor in itself does not 
produce derogation but only functions as a 
releasing agent. On the other hand, ability 
peers severely derogated the average-ability 
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person only in the hostile-pratfall condition, 
and only if they perceived the pratfall as hu- 
morous. Hence, the blunder had to be per- 
ceived as humorous and take place in the 
presence of an aggressive social agent before 
the average subject expressed dislike for his 
peer. 

The latter finding raises the question as to 
why the humor-hostile interviewer combina- 
tion led the average subject to derogate 
his peer (the average-ability person), but 
prompted the superior subject to like his 
peer (the superior-ability person) more. Per- 
haps average-ability subjects are more depen- 
dent on social standards than superior-ability 
subjects (cf. Hollander, 1958), and hence 
needed more than a mere pratfall as an 
excuse for derogating. Thus, the guise of 
humor plus the interviewer’s aggressive be- 
havior, which could be plausibly viewed 
as socially sanctioning aggression, may have 
provided the average subject with the excuse 
he needed for engaging in derogation of his 
ability peer. Needing less of an excuse to 
derogate, superior subjects may have been less 
predisposed to see the interviewer’s behavior 
as a sanction and hence were more prone to 
see it as evidence that the superior-ability per- 
son was far from perfect. This, in turn, may 
have.made the superior-ability person less 
threatening and consequently less an object of 
derogation.® 

Finally, recent data obtained by Helm- 


® The reader might also question why average sub- 
jects liked the superior-ability person slightly more 
in the hostile-pratfall condition rather than the 
pratiall-only condition. Aronson et al. (1966) sug- 
gested that such an increase is due to a pratfall’s 
making a superior person seem more human in the 
observer’s eyes. Perhaps being seen as more human 
requires that the superior person suffer the conse- 
quences of his act. However, in the pratfall-only 
condition, the superior-ability person in a sense gets 
away with his blunder, which, rather than making 
him appear more human, may further emphasize his 
superior position, In contrast, the interviewer’s hostile 
reaction was concrete evidence that just like any- 
one else, the superior-ability person had to reap the 
costs of his mistake and hence was human after all. 
Moreover, within the hostile-pratfall condition, it 
was only when the average subject laughed at the 
hostile pratfall (i.e. found it funny) that liking for 
the superior-ability person was high, and laughing 
at the expense of a superior person’s predicament 
might be expected to make him appear more human. 
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reich, Aronson, and LeFan (1970) are 
consistent with the present data and the 
threatened self-esteem rationale. These in- 
vestigators found that chronic high-self-esteem 
subjects responded like Yale subjects and 
derogated a superior-ability person if he com- 
mitted a pratfall, while the average-ability 
person was rated equally low in both a prat- 
fall and nonpratfall condition. Moderate-self- 
esteem subjects showed the opposite pattern 
and responded like our “average” subjects. 
They derogated the average-ability person 
somewhat when he committed a pratfall, 
whereas they liked the superior-ability person 
more in the pratfall condition. Although lev- 
els of general self-esteem cannot be equated 
with levels of intellectual ability, we would 
expect both high-self-esteem subjects and Yale 
subjects to regard the superior-ability person 
as an appropriate standard for comparison. 
Similarly, we would expect moderate self-es- 
teem and Southern Connecticut State College — 
subjects to both regard the average-ability 
person as an appropriate comparison standard. 
The reactions of the high-self-esteem and 
moderate-self-esteem subjects were consistent 
with this expectation. However, a low-self- 
esteem group run by Helmreich et al. also 
derogated the superior-ability person. This 
reaction to the superior-ability person corre- 
sponds very closely to those of our Yale sub- 
jects and appears to contradict the notion 
that it is the comparable other who will be 
derogated. However, this contradiction may 
be more apparent than real in that low-self- 
esteem individuals, unlike most people (Fes- 
tinger, 1954), may not base their assessments 
of comparability on the other’s perceived simi- 
larity. In fact, Coopersmith (1967) has shown 
that low-self-esteem children accept superior 
performance standards as being applicable to 
themselves, and the discrepancy between 
these standards and their own performance is 
one reason why they have low self-esteem. 


Thus, it may be that the low-self-esteem sub- _ 


jects in the Helmreich et al. study did not 
consider a superior-ability person’s perform- _ 
ance as noncomparable, but rather saw the 
superior performance as a viable standard for 
judging their own presumably poor perform- — 
ance (i.e., the superior-ability person was a 


comparable other in the low-self-esteem sub- 
ject’s eyes). This negative comparison would 
be highly threatening, and consequently, we 
would expect the threat source to be highly 
derogated. 

Unfortunately, the present investigation 
does not provide data pertinent to the above 
analysis, since none of our subjects were even 
roughly comparable with Helmreich et al.’s 
low-self-esteem group. The grade point aver- 
ages of our average-ability subjects were ex- 
amined in an attempt to select out a low- 
ability group. However, there was so little 
variability in grade point averages that a low- 
ability group could not be demarcated (70 of 
the 76 grade point averages in the Southern 
Connecticut State College sample were be- 
tween 2.00 and 3.00, so there simply was no 
subgroup of low-ability subjects). Obviously, 
further research is necessary to determine if 
low-self-esteem subjects seriously compare 
their performance with superior standards and 
severely derogate superior others as a result. 


CONCLUSION 


Taken as a whole, the results from the two 
experiments supported our theoretical ra- 
tionale. Relative to their reactions in nonprat- 
fall situations, superior subjects derogated 
persons of superior ability who committed 
only a pratfall but not persons of average 
ability. Similarly, average subjects tended to 
derogate persons of average ability who com- 
mitted a pratfall in the presence of a hostile 
interviewer, but did not derogate persons of 
superior ability. Apparently, an ability peer 
was a relevant standard for comparison, and 
consequently, his relatively successful per- 
formance (which was less pronounced in the 
case of the average-ability person where peer 
derogation was also weaker) threatened self- 
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esteem which, under circumstances involving 
humor, resulted in derogation of the peer. 
Conversely, a nonpeer, even a highly superior 
one, inspired indifference because he was not 
a relevant comparison, and hence, his perform- 
ance had no implications for the subject’s 
self-esteem. 
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EFFECT OF THE SOCIAL CONTEXT ON THE 
PRINCIPLE OF BELIEF CONGRUENCE* 


BERNIE I. SILVERMAN? ann RAYMOND COCHRANE 8 


Michigan State University 


The principle of belief congruence was tested on 43 middle-class Caucasian 
home owners living in an all white neighborhood in Lansing, Michigan. 
Twenty-two had previously signed a petition, purportedly sponsored by the 
Urban League, supporting the principles of open housing, while 21 had refused. 
Subjects evaluated hypothetical persons (who varied on race, attitude simi- 
larity, and socioeconomic class) in four social contexts (friend, neighbor, home 
purchaser, and son-in-law) and completed a measure of perceived social pres- 
sure to discriminate against Negroes. Major findings were (a) that the princi- 
ple accurately described the subjects’ responses in all social contexts; but (b) 
as perceived social pressure increased, across social contexts, the principle of 
belief congruence less accurately described the subjects’ evaluations; and (c) 
there was a significant relationship between petition signing and discrimination 
on race when the hypothetical persons were rated as home purchasers. Possible 


interpretations of the findings are discussed. 


It may be inferred from the principle of 
belief congruence (Rokeach & Rothman, 
1965) that individuals tend to like others 
whom they perceive as haying beliefs and 
attitudes similar to their own, while disliking 
those whom they perceive as holding beliefs 
and attitudes different from their own. The 


the findings of Rokeach et al., Triandis 
(1961) found that college students discrimi- 
nated far more on racial similarity than be- 
lief similarity when rating the desirability of 
hypothetical persons in roles such as friend, 
neighbor, and marriage partner. 

Rokeach (1961) attempted to explain the 


‘N same inference may also be drawn from the disparity between his own and Triandis’ re- 
balance theories of Heider (1946) and New- sults by claiming that the belief component of 
comb (1963). However, one unique implica- Triandis’ hypothetical persons was relatively 
tion of the principle of belief congruence is weak and nebulous compared to the belief 
that attitude similarity or dissimilarity is a component of the hypothetical persons used 
more potent determinant of interpersonal at- in his own study. Triandis felt the inconsis- 
traction than is racial similarity. tency in results was due to a different factor. 

$ Evidence for the principle of belief congru- He wrote: “Had Rokeach considered some- 

« ence was first supplied by Rokeach, Smith, thing more than friendship, he would not have 


and Evans (1960). They found that college 
students discriminated more on belief simi- 
larity than racial similarity when evaluating 
hypothetical persons as friends. Contrary to 
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obtained his results | Triandis, 1961, p. 186].” 
Triandis was implying that the roles or situa- 
tions the hypothetical persons were placed in 
determined the differential effects of attitude 
and racial similarity. 

Subsequent investigations explored this 
proposition. Byrne and Wong (1962) and 
Byrne and McGrew (1964) found that sub- 
jects discriminated more on attitude similarity 
than racial similarity when evaluating both 
the likableness and desirability as a work 
partner of the hypothetical persons. Rokeach 
and Mezei (1966), using overt behavior as an 
index of discrimination, concluded that col- 
lege students discriminated more on attitude 
similarity than racial similarity when choosing 
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someone to have coffee with. They also found 
that adults, applying for jobs at state mental 
institutions, acted in a similar manner when 
selecting other job applicants as future work 
partners. 

But when subjects were asked to evaluate 
hypothetical persons in roles other than friend 
or work partner, a large racial discrimination 
effect occurred. Triandis and Davis (1965) 
found that college students discriminated more 
on race than belief when rating hypothetical 
persons as marriage partners and neighbors, 
while Insko and Robinson (1967) showed 
that southern high school students also dis- 
criminated mainly on race when the hypo- 
thetical persons were placed in the same situ- 
ations. Stein (1966), whose subjects were 
northern high school students, found that the 
race effect was greatest for “have a close 
relative marry,” “invite home to dinner,” and 
“neighbor on the street,” but only in the first 
instance was it stronger than the effect of 


attitude similarity. These paper-and-pencil 


manifestations of racial discrimination, as well 
as overt racial discrimination in housing (Far- 
ley & Taeuber, 1968; Grodzins, 1957), ap- 
pear to challenge the validity of the principle 
of belief congruence. 

Rather than discard the principle as inaccu- 
rate, Rokeach and Mezei (1966) asked for a 
dispensation. They wrote: 


the focus of racial and ethnic discrimination is to 
be sought in society, not in the individual’s psyche. 
If a society’s contraints were altogether removed, 
man would not discriminate in terms of race or 
ethnic grouping . . . but in terms of the principle 
of belief congruence [p. 171]. 


Whenever there are strong societal sanctions 
supporting racially discriminatory behavior, 
the principle is not to be applied, but when 
these sanctions are absent or innocuous, the 
principle of belief congruence should account 
for the largest portion of the variance in inter- 
personal attraction. 

Other investigators (Insko & Robinson, 
1967; Stein, Hardyk, & Smith, 1965; Tri- 
andis & Davis, 1965) have generally ac- 
cepted Rokeach’s qualification of the princi- 
ple of belief congruence, However, in doing so 
they have rendered the principle of belief con- 
gruence immune from falsification and there- 
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fore meaningless. In studies in which a large 
race effect was obtained in specific situations, 
it was attributed to the social sanctions salient 
in those situations. Yet the presence of social 
sanctions facilitating racial discrimination in 
a situation was inferred from a large race 
effect. The reasoning is circular. Whenever 
results are in accord with the principle, social 
sanctions are not thought to be operating, and 
the principle is supported. But when results 
show that much of the variance in responses 
is due to the race of the hypothetical persons, 
social sanctions are invoked, and the principle 
remains unscathed. 

Clearly an independent measure of per- 
ceived social pressure to act in a racially dis- 
criminatory manner is required. In the present 
study such a measure was employed. The first 
hypothesis tested was that as perceived social 
pressure to discriminate against Negroes in- 
creases across situations, thé extent to which 
the principle of belief congruence accurately 
describes the subjects’ responses decreases. 

From the studies reviewed it may be seen 
that support for the principle of belief con- 


gruence rests mainly on the behavior of col- ~ 


lege students. In order to test the generality 
of the principle of belief congruence, the sub 
jects used in the present study consisted of 
middle-class Caucasian homeowners living in 
an all white neighborhood in Lansing, Michi- 
gan. 

The almost exclusive use of students to test 
the principle of belief congruence not only 
limits the generality of the findings but also 
impairs their validity. Cook and Selltiz (1964) 
wrote: 

It seems reasonable that most respondents, pre- 
sented with tests in an academic setting . . . will 
assume that the responses which will place them in 
the most favorable light are those which represent 
them as well adjusted, unprejudiced, rational, open 
minded, and democratic [p. 39]. 

By inhibiting racial discrimination, the con- 
ventional experimental situation may provide 
built-in support for the principle of belief con- 
gruence. 

In the present study, the subjects filled out 
questionnaires in their homes in the presence 
of a graduate student. This was thought to 
attenuate, to some extent, the pressure felt by 
subjects not to discriminate on race, Even so, 
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it by no means guarantees that the subjects 
were responding in a manner congruent with 
their racial attitudes rather than in a manner 
they believed to be socially acceptable. If the 
responses given to the paper-and-pencil instru- 
ment could be related to some sort of corres- 
ponding overt behavior, their validity would 
be further enhanced, 

The behavior observed in the present study 
was that of petition signing. Each subject was 
given an opportunity to endorse a petition 
supporting the principles of open housing. 
Because the subjects were unaware that their 
responses would come under the scrutiny of a 
psychologist, it is quite probable that the 
subjects’ attitude toward selling their homes 
to a Negro accounted for a significant por- 
tion of the variance in their petition-signing 
behavior. The second hypothesis tested was 
that individuals who signed the petition would 
discriminate significantly less on the factor of 
race, when evaluating hypothetical persons as 
potential home purchasers, than those indi- 
viduals who refused to sign the petition. 


METHOD 


The study was conducted in a middle-class, ex- 
clusively white neighborhood. Within this area, five 
contiguous streets were arbitrarily chosen from 
which to draw subjects. All of the living units on 
these streets, as well as in the remainder of the 
neighborhood, were single family dwellings. Those 
who were later to serve as subjects were initially 
contacted in their homes by female students at 
Michigan State University who presented themselves 
as volunteer workers for the Urban League. Each 
individual was asked to sign a petition that read: 


A petition to be presented to the Mayor of 
Lansing, Michigan, sponsored by the Urban 
League of Lansing. We, as residents and prop- 
erty owners in Lansing, pledge not to discrimi- 
nate on race or ethnic background in the selling 
or renting of our homes, Further, we promise 
to employ only those real estate agents who 
abide by the principle set forth above. 


The female students were instructed to make it 
clear to the homeowners that they were not obli- 
gated to sign the petition, but that if it accurately 
reflected their own views, their signature would be 
greatly appreciated, All homes on the designated 
five streets were called on once, Individuals who 
were not home obviously could neither sign nor 
refuse to sign the petition and therefore were elimi- 
nated from the possible subject pool. As a result of 
this procedure, 73 persons signed the petition, while 
73 refused. 
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From these individuals were selected 20 couples 
who signed the petition and 20 couples that failed to 
do so. Homes at which both the husband and wife 
had either signed or refused to sign were selected, 
because in the later stages of the study it would save 
time to be able to administer questionnaires to two 
persons at once. Forty households, then, were sent 
letters in which it was explained that a study con- 
cerning the generation gap was being conducted by 
the psychology department at Michigan State. Those 
who received the letter were asked to participate in 
the study to provide an adult sample to be compared 
with the students at the university as a measure of 
the generation gap. Two weeks after the petition had 
been circulated, the experimenter began calling on 
those households that had received the letter. 

A total of 43 individuals, representing 23 separate 
households, consented to take part in the study, 
Twenty-two of them had previously signed the peti- 
tion, while 21 had refused to sign. Of those who had 
signed, 12 were males and 10 were females. Of those 
who refused to sign, 10 were males and 11 were 
females, Sixteen of those who endorsed the petition 
were Protestants, while 6 were Catholic. Of those 
who did not sign the petition, 17 were Protestants 
and 4 were Catholics. The average age of both 
groups was 45. The mean number of years of formal 
education for those who signed was 15, whereas for 
those who refused to sign, the corresponding figure 
was 144. 

During the first meeting with the subjects, a ques- 
tionnaire was administered in order to obtain 
demographic information and to ascertain the sub- 
jects’ attitudes toward four issues that were of 
current interest. These were: (a) our government’s 
past actions in Vietnam and its present policy; (b) 
the busing of students from one area to another to 
promote integration; (c) the manner in which 
authorities ought to handle youth protest demon- 
strations; and (d) the issue of birth control. Subjects 
were asked not only to express their position on 
each issue but to explain why they felt the way 
they did. After completing the questionnaire, all 
subjects agreed to have the experimenter return in 
about a month with several more questionnaires. 

Six weeks after subjects had completed this ques- 
tionnaire, the experimenter returned with another 
questionnaire which presented eight hypothetical 
persons for them to evaluate. Each hypothetical per- 
son represented some combinations of race (Cau- 
casian or Negro), socioeconomic status (high or 
low), and attitude similarity (similar or dissimilar). 
Operationally, attitude similarity consisted of four 
attitudes in accord with the subject’s, while attitude 
dissimilarity was communicated by four attitudes 
diametrically opposed to those of the subject. Thus, 
the attitudes of the eight hypothetical persons were 
tailor-made for each subject. 

In previous studies the hypothetical persons were 
conveyed to the subjects in a somewhat sketchy and 
possibly incredible manner either by means of check 
marks on attitude surveys or through brief phrases. 
An attempt was made in the present study to make 
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them more convincing by describing each in a sub- 
stantial paragraph containing direct quotes based on 
the subject’s own responses. For example: 


John Washington Phillips is a Negro, 40 years old, 
is married, and has two children, Robert 15 and 
Risa 14. Mr. Phillips is a mailman in Detroit and 
graduated from Central High School in that city. 
As far as our involvement in Vietnam goes, he 
says “were wasting Our money, men, and resources 
over there when we have poor in our country that 
need help, so I’m opposed.” “I also oppose the 
busing of students to promote racial integration 
since you can’t change peoples attitudes by forcing 
others upon them.” With respect to demonstra- 
tions by the nation’s youth, he says, “peaceful 
protests that do not interfere with the majority’s 
rights should be allowed, but when demonstrators 
break the laws and destroy other people’s prop- 
erty, the police should deal quickly and firmly 
with them.” “I support birth control because it 
allows people to have only as many children as 
they can afford to bring up properly,” concluded 
Mr. Phillips. 


Socioeconomic status was conveyed by means of 
the occupation and educational background attributed 
to the hypothetical persons. High-status hypothetical 
persons held a graduate degree and were employed as 
either an accountant, lawyer, sales manager, or 
chemist, whereas low-status hypothetical persons had 
high school diplomas and were employed as either a 
bus driver, security guard, shoe salesman, or mailman. 
Below each paragraph describing a single hypo- 
thetical person was a scale numbered 1 to 9. By 
circling 1, the subjects could indicate extreme attrac- 
tion toward the hypothetical person. By circling 9, 
the subjects could indicate an intense antipathy 
toward the hypothetical person. As the set of eight 
hypothetical persons was presented to the subjects 
four times, four separate booklets were constructed 
that were identical except for their first page, which 
served to place the hypothetical persons in the roles 
in which the subjects were to evaluate them. The 
four roles or situations were friend, neighbor, home 
purchaser, and son-in-law. 

The second questionnaire used in this study was 
designed to tap the subjects’ perceptions of their 
friends’, neighbors’, and relatives’ feelings toward 
Negroes, since these might serve as perceived sources 
of social pressure for behaving in a racially dis- 
criminatory manner. On the first page of this ques- 
tionnaire, the subjects could indicate how they 
thought their friends would react to them if they 
befriended a Negro. Alternative responses were: 


1. It would probably raise their impression of 
me a notch or two as most of them would sort of 
admire me. 

2. They wouldn’t care one way or the other. 
Some might think a bit more of me for it, some a 
pit less, but most would feel the same toward. me 
as they always have. 
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3. Although they would still treat me as they 
always have, most would probably object to my 
having a Negro for a friend and think a little less 
of me for it. 

4, They would object to my having a Negro as 
a friend, and no doubt several would stop associ- 
ating with me on account of it. 

5. Certainly all my friends would feel very badly 
about my associating with a Negro, and would 
very likely devalue me and subsequently quit 
seeing me, 


On the second page of the questionnaire, the subjects 
could indicate how they felt the majority of their 
neighbors would feel about having a Negro family 
as neighbors. Alternative responses were: 


1. They would prefer to live in an integrated 
community and thus would be happy to have a 
Negro on the block. 

2. It wouldn’t affect the people one way or the 


other, They would think of and treat the Negroes. R 


just as though they were white. 

3. Although they would treat the family cor- ~ 
dially, they would probably prefer white neighbors 
so that property values would be protected. 

4. The people would very likely treat the Ne- 
groes coldly and feel badly about Negroes moving 
in. 

5. The people would be very unhappy about 
Negroes moving in and probably would react vio- 
lently. 


a 


if 
ba 
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On the third and final page of the questionnaire, the 
subjects could indicate how they thought their rela- 
tives would feel about a Negro marrying into the 
family, Alternative responses were: 


1. They would be very happy about it as they 
have always looked up to and admired Negroes, 
both as individuals and as a race. 

2. They wouldn’t care one way or the other— 
upon marrying into the family the individual = 
would be treated just as any other relative. 


2 


3. Although the majority of my relatives would © 


treat the individual warmly, they probably would 
have preferred that the family remain all Cau- 
casian for various reasons. 

4. Many of my relatives would no doubt object 
strongly to having a Negro in the family and 
would do all they could to dissuade the blood 
relative from marrying the Negro. 

5. The majority of my relatives would feel very ” 
badly about a Negro marrying into the family 
and very likely would not associate with the Negro 
and his or her mate. 


Before they were asked to complete the question- 
naires just described, the subjects were reminded 
that the purpose of the study was to measure the 
generation gap by comparing their attitudes to those 
of the students at Michigan State. The subjects were 
then asked to evaluate the hypothetical persons, first 
as friends, then as neighbors, as home purchasers, and 
finally as sons-in-law. The hypothetical persons ‘were 


pi 


nr 


y 


ę 


` 


g 3 


SOCIAL CONTEXT AND BELIEF CONGRUENCE 


described as real individuals who had been inter- 
viewed by the experimenter in the 6-week period 
since his last visit. Subjects were instructed to read 
the description of each hypothetical person as many 
times as they liked, but once having made their 
evaluation, not to refer back to it. As soon as the 
subjects finished one booklet, it was taken from 
them, and they were given another to complete. 
Upon completing their evaluations of the hypo- 
thetical persons, the subjects were given the second 


< questionnaire which measured perceived social pres- 


sure to discriminate on race. As husbands and wives 
were usually filling out the questionnaires simultane- 
ously, they sat apart from one another so that the 
responses of one had no influence on the responses 
of the other. After filling out both of the question- 
naires, the subjects were thanked for participating in 
the study and told that they would be sent a pre- 
liminary report of the findings. One month later all 
subjects received a letter informing them of the 
study’s true purpose and of the results. 


RESULTS 


Table 1 shows the perceived social pressure 
facilitating racial discrimination felt by the 
subjects as a group in three of the four situa- 
tions in which the hypothetical persons were 
evaluated. It is clear that as a group, the sub- 
jects felt more pressure to discriminate against 
Negroes as members of their families than as 
potential home purchasers, while perceiving 
more social pressure to discriminate against 
Negroes as potential home purchasers than as 
friends. With regard to subjects’ perceptions 
of their friends’ reactions supposing that they 
befriended a Negro, 34 (79%) chose Alterna- 


“tive 2, indicating their friends would think no 


less of them. Apparently these persons per- 
ceive no social pressure to discriminate 


TABLE 1 


PERCENTAGE OF SUBJECTS INDICATING VARIOUS 
DEGREES OF PERCEIVED SOCIAL PRESSURE TO 
DISCRIMINATE AGAINST NEGROES IN 
THREE SITUATIONS 


Situation 
Response 
alternative ; Home sS 
Friend PERET Son-in-law 
1 0 0 0 
2 79 6 4 
3 17 74 14 
4 4 20 65 
5 0 0 17 


Note.—Responses 1 and 2 indicate no perceived social pres- 


sure to discriminate. Response 5 indicates maximum perceived 
social pressure to discriminate. 
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TABLE 2 


ANALYSIS OF VARIANCE OF SUBJECTS’? RATINGS 
OF HypoTHETICAL PERSONS 


variance 
Source SS df F ac- 
counted 
for 
Subjects (A) 1123.82 42 
Race (B) 178.07 1| -36.72 1,4 
Z=AXB 203.58 42 
Attitude (C) 4818.76 1} 241.18**| 38.4 
Z=AXC 839.01 42 
Status (D) 182.41 1) 35.35** 1.4 
Z=A XD 216,74 | 42 
Situation (E) 1099,90 3| 60.60%* 8.7 
Z=AXE 762.68 | 126 
BXC 64.10 1} 21,58** = 
2ZSA xB XC 124,80 42 
BXD 8.95 1 6.08* Sw 
Z=AXBXD 61,83 | 42 
BXE 395.27 3 | 54.22% Set 
Z=AXBXE 306,19 | 126 
CXD 42,91 1| 37.31% ae 
Z=AXCKXKD 48,49 42 
CXE 255.63 34 16.51 2.0 
Z=AXCXE 650.21 | 126 
DXE 58.32 3 7.93** ee 
Z=AXDXE 309.95 | 126 
BXCXD 1,22 1 69 = 
Z=AXBXCXD 74.31 42 
BXCXE 70.39 3) 17:12% S 
Z=AXBXC XE 172.33 | 126 
BXD XE 51.44 SB) LSZH = 
Z=AXBXD XE 124,39 | 126 
CXDXE 13.47} 3| sos] — 
Z=AXCXD XE 139.25 | 126 
BXCXDXE 23.75 3 8.16** SH 
Z=AXBXCXD KE 122.84 | 126 
Total 12545,22 | 1375 
* p <05. 
aE ASES 


against Negroes as friends. Regarding sub- 
jects’ perceptions of their neighbors’ attitudes 
toward having Negroes as neighbors, 32 
(74%) chose Alternative 3, indicating that 
although they thought their neighbors would 
treat a Negro family cordially, they would 
still prefer Caucasian neighbors. These per- 
sons no doubt perceive some social pressure 
not to sell their homes to Negroes. F inally, 
concerning subjects’ perceptions of their rela- 
tives’ reactions if a member of their family 
should marry a Negro, 28 (65%) selected 
Alternative 4, and 7 (17%) chose Alternative 
5. These individuals see their relatives as 
strongly objecting to having a Negro in the 
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TABLE 3 


Mean Supyects’ RATINGS OF HYPOTHETICAL PERSONS AND PROPORTION OF 
VARIANCE ACCOUNTED FOR BY VARIABLES ACROSS SITUATIONS 


Friend Neighbor Home purchaser Son-in-law 
Factor Level < n % 
x ee x sence * cues x variance 
Race (A) N 4.47 3 4.50 =) 4.94 1.8 7.82 18.3 
W 4.77 4.51 4.16 5.40 
Attitudes (B) S 2.27 64.7 2.31 56.5 2.82 35.3 5.37 19.1 
D 6.94 6.69 6.29 7.84 
Status (C) HS 4.52 Se 4.26 sf 4.00 35 6.04 4.0 
LS 4.69 4,75 5.11 7.18 
AXB N-SA ZI 2.38 3.33 7.18 
N-DA 6.72 ma 6.61 pe 6.55 ae 8.46 4.4 
W-SA 2.38 2.25 2.31 3.56 
W-DA 7.16 6.76 6.02 7.23 
AXC N-HS 4.31 415 4.47 7.66 
N-LS 4.58 r 4.84 tera 5.41 on 7.98 
W-HS 4.73 4.37 3.53 4,43 
W-LS 4.81 4.65 4.80 6.37 © 
BXC SA-HS 2.10 1.95 2.11 4,47 
SA-LS 2.45 n 2.68 ox 3.53 = 6.27 
DA-HS 6.94 6.56 5.89 7.61 
DA-LS 6.94 6.81 6.58 8.08 
BXCXD N-SA-HS 1.88 1.88 2.67 6.95 
N-SA-LS 2.46 2.88 4,00 7.41 
N-DA-HS 6.74 6.41 6.27 8.37 
N-DA-LS 6,69 ora 6.81 = 6.83 or 8.55 8 
W-SA-HS 2.32 2.02 1.55 2.00 
W-SA-LS 2.44 2.48 3.06 5.13 
W-DA-HS | 7.13 6.72 5.51 6.86 
W-DA-LS 718 6.81 6.53 7.60 
Residual interactions 28 29 40 35 


Note.—The lower the score, the more favorable the reaction. N = Negro; W = white; S = similar; D = dissimilar; HS = 


high status; LS = low status. 


family and therefore perceive much social 
pressure against having their children marry a 
Negro. 

Because over 90% of the subjects sense 
some social pressure to discriminate against 
Negroes both as home purchasers and sons-in- 
law, the principle of belief congruence need 
not accurately describe the subjects’ evalua- 
tions of the hypothetical persons in these 
situations. The validity of the principle of 
belief congruence would be seriously impaired, 
however, if it fails to accurately describe the 
subjects’ behavior in evaluating the hypo- 
thetical persons as friends, since a large ma- 
jority felt no social pressure to discriminate 
against Negroes in that situation. 

Table 2 shows the result of a four-way 
analysis of variance with repeated measures 


Dey 
performed on the subjects’ ratings of the _ 
hypothetical persons. All four main effects f 
show significance. High-status Caucasians, to 
whom attitudes similar to those of the sub- 
jects were attributed, were generally evalu- 
ated most favorably. The situation in which 
the person was evaluated accounted for 8.77% 


of the total variance, second in importance» __ a 


only to the attitude variable. D 
Both attitude and similarity and socioeco- 
nomic status had a greater effect in determin- # 

ing the subjects’ reactions to Caucasian hypo- 

thetical persons than to Negro hypothetical 
persons, as indicated by the significant Race 
X Attitude and Race X Status interactions. 
The Attitude X Status interaction achieved 
significance because the status of the hypo- 
thetical persons affected the ratings given 


‘them significantly more when they espoused 
attitudes similar rather than dissimilar to 
those of the subjects. 

The significant two- and three-way interac- 
tions involving the situation variable consti- 
tute the central focus of this study. Table 3 
shows the relative importance of the other 
variables within each of the situations. The 
~< principle of belief congruence (or attitude 
similarity) accurately describes the subjects’ 
ratings of the hypothetical persons in three of 
the four situations. The subjects discriminated 
more on attitude similarity than racial simi- 
larity in all instances except when evaluating 
the hypothetical persons as sons-in-law. In 
that situation there was considerable per- 
__,ceived social pressure to discriminate against 

Negroes, and the subjects did so. Yet even in 

this case, attitude similarity played as im- 

portant a role as racial similarity in influenc- 

ing the subjects’ ratings. 

Further examination of the results indicates 
that the data support the first hypothesis. In 
the situation of friendship, attitude similarity 
is approximately 215 times more important in 
determining preference than is racial similar- 

City. In the situation of home purchaser, atti- 
tude similarity is approximately 20 times 
more important than racial similarity, while 
in the situation of son-in-law, the two varia- 
bles are of equal importance in determining 
the subjects’ ratings. As the ratio between 
variance due to attitude and variance due to 

Jace becomes smaller, Table 1 shows that 

“perceived social pressure to discriminate 

“against Negroes increases, thus confirming the 
first hypothesis. 

A racial discrimination score, calculated 
by subtracting the sum of the ratings given 
the four Caucasian hypothetical persons from 
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the sum of the ratings given the four Negro 
hypothetical persons was determined for each 
subject in each situation. In order to elimi- 
nate negative values, a constant of 9 was 
added to each score. Therefore a score of 9 
represented no discrimination on the factor of 
race, while scores of less than 9 indicated 
racial discrimination against Caucasians, and 
scores of more than 9 indicated racial dis- 
crimination against Negroes, Table 4 shows 
the mean racial discrimination scores in the 
four situations generated by the 22 subjects 
who signed the petition and the 21 subjects 
who refused. When evaluating the hypotheti- 
cal persons as home purchasers, those who 
signed discriminated significantly less against 
Negroes than those who refused to sign, thus 
supporting the second hypothesis. Also, those 
who endorsed the petition discriminated sig- 
nificantly less against Negroes than those who 
refused to sign when the hypothetical persons 
were evaluated as sons-in-law. But when rat- 
ing the hypothetical persons in the situations 
of friend and neighbor, there was no differ- 
ence in the amount of racial discrimination 
exhibited by those who signed the petition as 
compared to those who refused to sign. 


DISCUSSION 


This study makes it clear that neither a 
straightforward belief congruence explanation 
nor a straightforward social distance expla- 
nation is adequate to predict subjects’ evalua- 
tions of others in various situations, The vary- 
ing importance of the race variable across 
situations presents difficulties for both. 

Friendship is consistently placed near 
the extreme intimacy end of a social dis- 
tance continuum (Bogardus, 1925; Triandis 


TABLE 4 
PETITION SIGNING AND RACIAL DISCRIMINATION SCORES IN Four SITUATIONS 


Friend Neighbor Home purchaser Son-in-law 
Group n | 
ic t Xx | t x t X t 
TREER, 2 | 7.86 8.72 10.09 7.09 
Signed petition a 86 2.44** 172 
Refused to sign 21 7.71 9.42 14.23 47 


Note.—The lower the score, the less discrimination against Negores, 
05. 
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& Triandis, 1965). Yet in this study less 
racial discrimination was manifest in the 
friendship situation than in any other. Simi- 
larly, the social context of “neighbor” seems 
more intimate than that of “selling one’s 
home,” yet the former produced less racial 
discrimination than the latter. It appears 
necessary to take into account the per- 
ceived social pressure that the subjects felt 
toward racial discrimination in any given 
situation to understand why race was a less 
important criterion in the friendship situa- 
tion than the less intimate home purchaser 
situation. 

The principle of belief congruence fared 
somewhat better. Even in situations where 
the subjects as a group perceived consider- 
able social pressure to discriminate against 
Negroes, the attitude similarity variable ac- 
counted for a greater portion of the total vari- 
ance in responses to the hypothetical persons 
than did the race variable. In fact, this is 
the first study in which attitude similarity 
proved to be at least as important a determi- 
nant of responses as race when the hypo- 
thetical persons were evaluated as family 
members. This could be due either to the sub- 
jects’ reluctance to discriminate on race or 
their proclivity for discriminating on attitude 
similarity. The second alternative seems more 
compatible with the data, since the subjects 
showed no hesitancy to discriminate against 
Negroes in specific situations. It would appear 
that the subjects employed in the present 
study were more sensitive to the hypothetical 
persons’ attitudes than were subjects used in 
previous studies. 

The description of the hypothetical persons 
offers a possible reason for the subjects’ 
heightened sensitivity to the attitude simi- 
larity variable. Race was conveyed by a single 
word, and several short phrases provided in- 
formation from which the socioeconomic class 
of the hypothetical persons might be inferred. 
The remainder of the description was devoted 
entirely to attitudes. The large amount of 
space devoted to the hypothetical persons’ 
attitudes may have served as a demand char- 
acteristic in that it “told” the subjects that 
this was a variable of importance to the ex- 
perimenter and hence must be taken into 
account, 
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Fishbein (1963) suggested that an indi- % 
vidual’s attitude toward a stimulus person is 
related not only to the favorableness of the 
beliefs atttributed to the person but also to 
the probability that the person in question 
actually subscribes to these beliefs. The use 
of full sentences and quotations rather than 
check marks to describe the attitudes of 
the hypothetical persons may have made it + 
highly credible to the subjects that the hypo- N 
thetical persons actually held the attitudes 
ascribed to them. If so, then it is not sur- 
prising that large differences in attraction due 
to attitude similarity resulted. Thus, the in- 
creased credibility of the hypothetical persons 
might be a second reason that the variable 
of attitude similarity had such a great effect 
in the present study. 

Even though it was an important predictor 
of subjects? responses within any situation, the 
principle of belief congruence contains no 
inherent explanation for the differential im- 
portance of race and attitude across situa- 
tions. The principle needs to be supplemented 
with the notion of social pressure (contingent 
upon specific situations) to explain why ther 
is no discrimination against Negroes as friends } 
despite significant discrimination against’ 
Negroes as sons-in-law. 

Tables 1 and 3 show clearly that as perceived 
social pressure to discriminate against Negroes 
increases across a situation, the importance of 
race as a factor in social choice also increases. 
This does not mean that we are in a positio: 
to posit a causal relationship between the 
tendency to discriminate against Negroes in a” 
given situation and social pressure to do so. 
What we have in fact is a measure of 
perceived, rather than actual, social pressure. 

Although the subjects themselves did not 
express discriminatory attitudes against Ne- 
groes, they perceived their actual neighbors ™ 
as wanting to exclude Negroes from the neigh- ~ E 
borhood. Possibly for this reason the subjects ; 
indicated that they would rather sell their, 
homes to a Caucasian than a Negro. Super- 
ficially this suggests that perceived social 
pressure facilitates or even causes racially dis- 
criminatory behavior. This explanation seems 
questionable, however, if the subjects’ neigh- 
bors also participated in the study as subjects. 
Tt is as if each subject was denying prejudice 


AA 


in himself while attributing it to others. Quite 
possibly the subjects were not giving honest 
‘responses to the hypothetical persons, but 
were instead projecting their own prejudices 
onto their neighbors, rendering both of the 
paper-and-pencil instruments invalid. Al- 
though this may well have happened to some 
extent, evidence reported earlier showing a 
ignificant relationship between the responses 
to the hypothetical persons and petition sign- 
ing leads us to believe that the former have 
some validity. 

Another possible explanation of the dis- 
crepancy between the subjects’ report of their 
own attitudes and the perceived attitudes of 
their neighbors is that the sample on which 
Wthe study is based inevitably consisted only 
of those residents who agreed to take part. 
It is possible that the subjects’ neighbors 
who refused to participate in the study could 
be identified as the source of most of the 
social pressure to keep Negroes out of the 
neighborhood. If so, the nature of the sample 
would explain the apparent discrepancy. Of 
course, the nonprobability nature of the 
x: sample makes generalization difficult. 

According to Robinson (1950), a better 
case might be made for the existence of a 
relationship between perceived social pressure 
and racial discrimination by looking at cor- 
relation coefficients between racial discrimina- 
tion scores and perceived social pressure scores 
within the three appropriate situations. The 
l. RREN 4 

significance of such correlations would be a 
‘necessary though not a sufficient condition 
for supporting a causal relationship between 
perceived social pressure to discriminate 
against Negroes and actual racial discrimina- 
tion, Unfortunately, little would be achieved 
by this procedure in this study, since there 
was little variability among subjects on the 
>“ measures of perceived social pressure to calcu- 
late a meaningful statistic. Given significant 
coefficients, it would still be impossible to 
decide whether social pressure causes dis- 
crimination or whether perceived social pres- 
sure might be due to the subjects’ own atti- 
tudes rather than to the attitudes of the 
subjects’ friends, neighbors, or relatives. Sev- 
eral investigators (Campbell, 1950; Katz, 
1947; Travers, 1941; Wallen, 1943) have in 
fact found correlations between a person’s 
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own attitude and his estimate of the group’s 
attitude in circumstances in which it was 
unlikely that the group was exerting any 
pressure for conformity. Of course, the old- 
fashioned explanation that prejudiced people 
have prejudiced associates would still remain 
plausible. A conclusive demonstration of a 
causal relationship would involve experimen- 
tally manipulating social pressure and ob- 
serving the effects on manifestations of racial 
discrimination. 

With respect to the hypothesis concern- 
ing petition signing, it is possible to argue 
that the insignificant relationship between 
petition signing and discrimination against 
Negro hypothetical persons as friends and 
neighbors suggests that the subjects were re- 
sponding not in accord with their attitudes 
toward the paper-and-pencil stimuli but rather 
in accord with the demand characteristics of 
the experimental situation. However, the fact 
that there was no relationship is not sur- 
prising if one considers the subjects’ attitudes 
toward the several situations in which the 
Negroes were encountered. Rokeach (1968) 
has posited: “a person’s social behavior must 
be mediated by at least two types of atti- 
tudes—one activated by the object and the 
other activated by the situation [p. 455].” 
This view implies that a person’s evaluations 
of an object and behavioral intentions might 
well vary depending on the situation in which 
the object was found. For example, a subject 
might have very different reactions toward 
the stimuli “Negro friend” and “Negro home 
purchaser.” It is likely that the petition pre- 
sented to the subjects primarily called to 
mind the concept of a Negro in the role of 
home purchaser and only to a lesser extent 
the concept of a Negro in any of the other 
situations. Thus, the relationship between 
racial discrimination scores and petition sign- 
ing ought to be strongest when the hypotheti- 
cal persons were evaluated as home pur- 
chasers. Indeed, this was the case. However, 
when the hypothetical persons are evaluated 
as friends and neighbors, it seems likely that 
the salient attitude complexes are somewhat 
different from that invoked by petition sign- 
ing. Thus, the failure of the relationships to 
reach significance in these instances does not 
seem surprising, Perhaps the most puzzling 


finding is that those who signed the petition 
discriminated significantly less against Ne- 
groes as sons-in-law than those who refused 
to sign. By the argument given above, 
one would not expect a significant relationship 
between these measures, 

This study attempted to explore some of 
the parameters affecting the importance of 
race, attitude similarity, and social status as 
factors in social choice. The results suggest 
that the situational context is critical in deter- 
mining the importance of race relative to 
attitude similarity, and that it is probably 
perceived social pressure which makes race 
salient in some situations and not in others. 
Further, the strength of a relationship be- 
tween social choice and an external criterion 
(petition signing in this case) depends 
upon the degree to which the same attitude 
components are present in both. 
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LON HARVEY AND MILLS’ “EFFECT OF A DIFFICULT 
TUNITY TO REVOKE A COUNTERATTITUDINAL 
ACTION UPON ATTITUDE CHANGE” 


JOHN C. TOUHEY + 


University of Nevada, Reno 


arvey and Mills’ finding that a revocable act of counteradvocacy leads to 
more attitude change than an irrevocable act does not challenge Bem’s self- 
perception theory. The authors failed to report either perceived choice 
measures or simulation data in support of their argument. A discrepancy 
manipulation, for which dissonance theory and self-perception theory make 
opposite predictions, was not checked. Finally, Harvey and Mills contend that 
a significant Revocability X Extremity interaction, a post hoc analysis, refutes 
Bem’s position, but their argument rests on the untested assumption that more 


internal cues are associated with extreme 


Controversies between Bem (1967, 1968) 
and advocates of cognitive dissonance theory 
(Mills, 1967) have enriched our recent litera- 
ture, but misunderstanding of the self- 
perception theory persists. Harvey and Mills 
(1971) contend that their finding of greater 
attitude change for revocable rather than ir- 
revocable acts of counteradvocacy constitutes 
a challenge to Bem’s theory. That Harvey and 
Mills have constructed a manipulation that 
appears to facilitate attitude change is not in 
question. Their principal argument against 
self-perception theory that “Bem’s behavior- 
istic orientation would seem to prohibit him 
from assuming that a person could infer his 
attitudes on the basis of a decision which 
was not overt [p. 209],” however, invites 
confusion. 

The test of the self-perception theory is 
not whether it “seems” to be limited to overt 
behavior but whether an outside observer can 
reproduce the attitudes of the experimental 
subjects. By approaching the Harvey and 
Mills data from this standpoint, an observer 
could infer that a subject who speaks against 
* student participation in university policies 
and then refuses an offer to revoke his speech 
acts from conviction, while little or nothing 
can be known about the “true” attitude of a 
subject who speaks against student control at 
the behest of the experimenter. In addition, 


1 Requests for reprints should be sent to John C. 
Touhey, Department of Psychology, University of 
Nevada, Reno, Nevada 89507. 
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attitudes. 


subjects who refuse an opportunity to undo 
their actions also permit the observer to at- 
tribute more perceived freedom of choice to 
their decision to speak. Harvey and Mills’ 
failure to report perceived choice measures 
for each treatment group omits crucial data 
that might support or refute their argument. 
It is possible, of course, that outside observers 
might attribute more choice to the subjects 
who refuse to revoke their actions than these 
subjects attribute to their own acts. A second 
outcome would have subjects attributing more 
choice to their actions than do observers, and 
either finding could count heavily against the 
self-perception theory. But there is also a 
third possibility, namely, that both subjects 
and observers infer more choice and, there- 
fore, more favorable attitudes. Such a finding 
would provide further evidence for a hypothe- 
sis of isomorphism between the attributions 
of actors and observers. Reporting simula- 
tion data in support of this argument, how- 
ever, would appear gratuitous in view of the 
fact that Harvey and Mills omitted all choice 
data from their paper. 

Publishing one of the few dissonance studies 
in recent years to omit choice data, Harvey 
and Mills do little to alleviate the confusion 
with their discrepancy manipulation. Half of 
the subjects were told that previous subjects 
had “[thought] of the relationship between 
their previous ideas on the issue and what 
they said in the speech [p. 204].” Dissonance 
theory predicts that increasing discrepancy 
between prior attitudes and behavior will lead 


to more attitude change; the self-perception 
theory predicts less change. The findings 
showed no difference between the discrepancy 
and no-discrepancy treatments, but since 
Harvey and Mills did not report a manipula- 
tion check, the reader can only consider two 
plausible but equally unverifiable interpreta- 
tions. On one hand, the discrepancy manipula- 
tion may have failed to raise the salience of 
the initial attitudes, and, on the other hand, 
increased salience may have been manipu- 
lated but without consequences for attitude 
change.” Since the manipulation is unchecked, 
it is not possible to discriminate between 
these two interpretations. 

Although attitude change was not influ- 
enced by their discrepancy manipulation, 
Harvey and Mills (1971) reported a post hoc 
two-variable analysis including revocability 
and extremity of initial attitude as indepen- 
dent variables. They argue that the self-per- 
ception theory predicts less attitude change for 
subjects holding extreme attitudes because the 
“subjects with extreme initial attitudes should 
have had more [internal] cues [p. 208].” 
Dissonance theory, presumably, predicts the 
opposite result. Subjects with the most favor- 
able attitudes toward student control did 
show greater change, but the causal sequence 
remains unarticulated. Harvey and Mills pro- 
vided no evidence to show that extreme atti- 
tudes are more salient or important than mod- 
erate attitudes toward student control. To be 
sure, several studies of attitude change in 
field experiments have provided evidence that 
supports a positive correlation between the 
extremity of an attitude and its importance 
(e.g., Hovland, Harvey, & Sherif, 1957 ), but 
independent specifications of salience and im- 


2 An unpublished study by Snyder and Ebbesen 
(1971) directly manipulated the salience of initial 
attitudes and found less attitude change. The finding 
supports the self-perception rather than dissonance 
theory and suggests that the discrepancy ma- 
nipulation employed by Harvey and Mills was not 
successful. 
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portance are seldom employed in lab 
studies of attitude change, least of all in dis§q 
nance studies, In fact, generalization from t 
findings of field studies would lead to the saul 
prediction as the self-perceptign theory, th 


change less as a consequence’) 
counteradvocacy. } 

In conclusion, Harvey and Mills have iden- > 
tified an interesting variable to increase, atti- » 
tude change in forced-compliance paati os, 
but the self-perception theory can ai. o pict 
that observers will attribute more Tá .dy5le 
attitudes to subjects who refuse an opportu- 
nity to revoke their actions. To restate Bem’s 
(1968) point, an attack on the self-percep- 
tion theory must challenge the assumption 
that subjects and observers are functionally 
equivalent. It may be trivial, then, to point 
out that the minimal requirement entails two 
sets of data (one for observers and onè for 
actors), but Harvey and Mills appear to be- 
lieve that they can know the predictions of 
outside observers before they are asked. 4 
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In a recent article (Harvey & Mills, 1971), 
we took the position that our findings were 
difficult to explain in terms of the self- 
perception theory that Bem (1967) proposed 
as an alternative interpretation of cognitive 
dissonance phenomena. Disagreement with 
this position has been expressed by Touhey 
(1972) in a comment on our article. 

In our study, college men were induced to 
record a speech arguing for a position that 
was discrepant with their prior attitudes. 
Our major finding was that subjects in our 
irrevocable-commitment conditions who were 
given an opportunity to record a substitute 
speech at an inconvenient time and place 
showed more attitude change toward consist- 
ency with the action than the subjects in our 
irrevocable-commitment conditions who were 
not given the difficult opportunity to revoke 
their counterattitudinal action. 

In his comment, Touhey (1972), making 
the assumption that the test of self-perception 
theory is whether an outside observer can 
reproduce the attitudes of experimental 
subjects,? suggested that 


an observer could infer that a subject who speaks 
against student participation in university policies 
and then refuses an offer to revoke his speech acts 
from conviction, while little or nothing can be 
known about the “true” attitude of a subject who 
speaks against student control at the behest of the 
experimenter [p. 269]. 


1 Requests for reprints should be sent to Judson 
Mills, Department of Psychology, University of 
Maryland, College Park, Maryland 20742. 

2 This assumption is not made by Bem in his 
recent writings. “No ‘as if’? methodology, including 
the technique of interpersonal simulation, is an 
adequate substitute for the intensive study of the 
actual situation being modeled [Bem, 1967, p. 273; 
Bem & McConnell, 1970, p. 251.” 


CAN SELF-PERCEPTION THEORY EXPLAIN THE FINDINGS 
OF HARVEY AND MILLS (1971)? 


JOHN HARVEY 
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Harvey and Mills’ position that their findings were difficult to explain in terms 
of Bem’s self-perception theory is not challenged by Touhey’s comment. Touhey 
fails to provide an explanation for Harvey and Mills’ findings in terms of self- 
perception theory or to mention any specific objections to Harvey and Mills’ 
discussion of how self-perception theory would bear on their findings. 


This suggestion by Touhey overlooks the fact 
that the subjects who were given the difficult 
opportunity to revoke their counterattitudinal 
action did mot refuse the offer. As we took 
pains to emphasize in discussing our results, 


The overt behavior of subjects in the revocable- 
commitment conditions was identical to the overt 
behavior of subjects in the irrevocable-commitment 
conditions. In fact, no overt action was necessary at 
that time or in the future in order for subjects in 
the revocable-commitment conditions to decline the 
opportunity [Harvey & Mills, 1971, p. 208]. 


Since the subjects in our revocable-commit- 
ment conditions did not refuse the offer, 
Touhey’s (1972) suggestion that “subjects 
who refuse an opportunity to undo their ac- 
tions also permit the observer to attribute 
more perceived freedom of choice to their 
decision to speak [p. 269]” is also irrelevant. 

Not only did Touhey fail to provide an 
explanation in terms of self-perception theory 
for our finding that giving a person a difficult 
opportunity to revoke a discrepant action will 
increase attitude change in the direction of 
consistency with the action, but he did not 
mention any specific objections to our discus- 
sion of how self-perception theory would bear 
on this finding. What we said about the 
finding was as follows: 


The greater attitude change in the revocable- 
commitment conditions is difficult to explain in terms 


3 Prior to the postmeasure of attitude. After the 
completion of the postmeasure, subjects in the 
revocable-commitment conditions were asked if they 
were planning to return and record another speech, 
and subjects in the irrevocable-commitment condi- 
tions were extended the same opportunity given 
subjects in the revocable-commitment conditions and 
asked if they thought they might return. None of 
the subjects in either condition indicated they might 
return, 
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of Bem’s self-perception approach (1967a). The 
instructions given to the subjects in the revocable- 
commitment conditions (i.e., that they could compose 
and bring in another speech on the same issue, 
saying whatever they preferred) could have provided 
them with a cue that they should not give the action 
much weight in the judgment of their attitudes, If 
so, Bem’s approach would predict less attitude change 
in the revocable-commitment conditions. To the ex- 
tent that giving the subjects the opportunity to 
revoke the discrepant action increased the salience of 
the prior attitude, Bem’s position would also predict 
the opposite results. Increasing the salience of the 
subject’s prior attitude should have provided him 
with another cue which he could use in inferring his 
attitude, Since this cue should have conflicted with 
the cues from the subject’s behavior, the behavioral 
cues should have had less weight in the subject's 
judgment of his attitude [Harvey & Mills, 1971, 
p. 208]. 

It might be thought that Bem’s position could 
account for the results by assuming that the sub- 
jects in the revocable-commitment conditions decided 
not to exercise the opportunity and then used this 
decision to judge their attitudes. However, since the 
overt behavior of the subjects in the revocable- 
commitment conditions was identical to the overt 
behavior of the subjects in the irrevocable-commit- 
ment conditions, the subjects who were given the 
opportunity had no more behavioral cues with which 
to judge their attitudes than the subjects who were 
not given the opportunity. Bem’s behavioristic orien- 
tation would seem to prohibit him from assuming 
that a person could infer his attitude on the basis 
of a decision which was not overt [Harvey & Mills, 
1971, p. 2091. 


Since we said nothing in our discussion of 
self-perception theory about perceived choice 
or the predictions of observers, Touhey’s 
assertion that we failed to report either per- 
ceived choice measures or simulation data in 
support of our argument is irrelevant.* 

Another finding of our study was that the 
effect of the difficult opportunity to revoke 
the discrepant action was greatest for those 
subjects whose prior attitudes were most dis- 
crepant with the action. Touhey did not offer 
any explanation in terms of self-perception 
theory for this finding. He also did not men- 
tion any specific objections to our discussion 
about how self-perception theory would bear 
on this finding. What we said about it was 
as follows: 


The finding that the effect of the opportunity to 
revoke the discrepant action was greater when the 


4 We failed to report such data because they were 
not collected. They were not relevant to testing the 
hypotheses which our study was designed to test. 


Jupson Mitxrs anp Jonn Harvey 


subjects’ initial attitudes were extreme than whe? AR 
they were moderate is particularly difficult to explai 
from Bem’s position. The subjects with extrem 
initial attitudes should have had more cues to us 
in inferring their attitudes which conflicted with 
the cues from their behavior, and thus the cues from 
their behavior should have had less weight [Harvey — — 
& Mills, 1971, pp. 208-209]. ~~ 


Since in our discussion we did not assume 
that extreme attitudes are more salient or im- 
portant than moderate attitudes or that more 
internal cues are associated with extreme atti- 
tudes, Touhey’s comment that we did not 
provide evidence for these assumptions is 
irrelevant. 

On one point we are in agreement with 
Touhey, namely, his suggestion that our dis- 
crepancy reminder manipulation may have 
failed to raise the salience of the initial atti- 
tude. In fact, we made this very point in our 
article. We stated in our abstract that “An 
attempt to directly manipulate the salienc 
of the discrepancy between the action and 
prior attitudes was unsuccessful [Harvey & 
Mills, 1971, p. 201],” and we noted in the 
discussion that “It is possible that the instruc- 
tions designed to directly manipulate salience 
may have had little impact on the subject 
[Harvey & Mills, 1971, p. 209].” 
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set of button presses, Communication w: 
The target person was less favorably pe 


between liberals and conservatives in the 
others. 


A series of studies concerned with stigma- 
tizing conditions have revealed a number of 
interesting things about the impact of such 
conditions on interpersonal behavior. In one 
of the earlier studies, a confederate posing 
as a naive subject reported to undergraduate 
males either a history of mental illness 
for indicated that he was an average student 
*.+*(Farina, Holland, & Ring, 1966). The real 
subject was then led to believe that he had 
to teach a particular pattern of button presses 
to the confederate, who, meanwhile, had been 
taken to another room. They could only com- 
municate by means of electrically connected 
panels that allowed the subjects to inform 
fe confederate that his guess was wrong by 

ministering an electric shock to him. The 
subject also believed he could steer the con- 
federate toward the correct guess by setting 
the shock level at any one of 10 intensities 
and varying the length of the shock. The re- 
sults were that the confederate was shocked 
for longer periods when he was perceived 


oe 
X 


a 


1 This study was, in part, financed by research 
grants from the National Science Foundation 
}(GS-2646) and the University of Connecticut Re- 
search Foundation. Thanks are due Kenneth Ring 
and Lloyd Stires for their helpful criticism of this 
manuscript. 

2 Requests for reprints should be sent to Amerigo 
Farina, Department of Psychology, University of 
Connecticut, Storrs, Connecticut 06268. 

3 Director, Istituto Nazionale di Psicologia del 
C.N.R., Rome, Italy. 


273 


POLITICAL VIEWS AND INTERPERSONAL BEHAVIOR * 


AMERIGO FARINA? BARRY CHAPNICK, JASON CHAPNICK, 
AnD RAFAELLO MISITI è 


Connecticut 


The present study is concerned with the effect of political beliefs on inter- 
personal perception and behavior. Thirty extreme liberals and 30 extreme 
conservatives were selected from a large introductory psychology class. A 
confederate, in the guise of a naive subject, presented himself as a liberal or a 
conservative, after which the subjects had to teach him an arbitrarily correct 


as restricted to informing the confed- 


erate that he was wrong by shocking him, Intensity and duration of shocks 
and responses to a postexperimental questionnaire were the dependent measures. 


rceived and more painful shocks were 


administered to him when his political views differed from those of the 
subject. Other data obtained were interpreted as indicative of major differences 


way they perceive and behave toward 


as stigmatized. However, there were no 
differences for intensity. 

The findings suggested the possibility that 
there are strong social pressures tending to 
make subjects treat stigmatized individuals 
the same as anyone else, in spite of their 
underlying feelings about them. This the 
subjects managed to do for the intensity 
measure that they had to select and, hence, 
knew clearly. But when they were less able 
to monitor their behavior, as in the case of 
shock duration, their generally unfavorable 
feelings toward the stigmatized led them to 
give longer shocks. It was hypothesized, then, 
that people may mistreat stigmatized others 
only when they are not fully aware of what 
they are doing. 

In a later study, the stigma investigated 
was degree of physical deformity (Farina, 
Sherman, & Allen, 1968). The same con- 
federate either wore a shoe that was built 
up on one side and limped slightly, or he 
came in a wheelchair and one of his legs 
appeared to be missing. This illusion was at- 
tained by means of a specially altered chair 
that had a hidden compartment for the con- 
federate’s leg. The same teaching task was 
assigned to the subjects, and the same ap- 
paratus was employed. Again, significant dif- 
ferences were found for the duration but not 
for the intensity measure, and this time 
shorter shocks were administered to the con- 
federate when he was severely stigmatized. 
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As in the preceding study, it seemed pos- 
sible that subjects tried to behave as if the 
stigma did not exist, although their under- 
lying feelings of sympathy led them to act 
more favorably toward the amputee when 
their behavior could not be easily monitored. 

Consistent results were also obtained in a 
quite different type of study in which sub- 
jects’ beliefs about their own emotional sta- 
bility was manipulated (Ring & Farina, 
1969). The intent of the study was to evalu- 
ate the effect of degree of similarity to a 
stigmatized person on the treatment accorded 
to that person, in this case ostensibly an 
ex-mental patient. Increasing perceived simi- 
larity led to shorter shocks being adminis- 
tered, but again no differences were found 
for intensity. 

In an effort to evaluate the tenability of 
this hypothesis, two procedures were carried 
out. The subjects of the Ring and Farina 
(1969) study were asked, after the task had 
been completed, to estimate the mean shock 
intensity and duration that they had used. 
The correlations between estimated and actual 
mean ‘shock intensity and duration were, 
respef vely, .93 and .38, a difference that is 

x highly significant (p < 0001). This indicates 
that subjects are more aware of the former 
than of the latter type of behavior, as the 
hypothesis demands. In the other procedure, 
a {Snale confederate revealed to female under- 
graduates that she was either an ordinary 
student or an unrepentant prostitute (Mosher, 
Farina, & Gliha, 1970). The reasoning here 
was that there would not be social pressures 
to treat her like anyone else, and, indeed, less 
favorable treatment might be viewed as 
proper. The same task and apparatus were 
used, and this time, no duration differences 
were found, but, in accord with theoretical 
expectations, the intensity scores were signifi- 
cantly higher in the prostitute condition. 

The present study was concerned with a 
variable seemingly quite different from those 
just considered, that is, political opinion and 
its behavioral significance. Liberals and con- 
servatives encountered a confederate who 
presented himself either as an extreme liberal 
or as an extreme conservative. The subjects 
were then given an opportunity of expressing 
their aggression toward him, both verbally 
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pa 
and behaviorally. In this way, it was hoped ^a g 
that the procedures and ideas developed to 
understand stigma could be made useful in 
clarifying the interpersonal significance of po- 
litical views. For example, consideration sug- 
gests that in America there is a strongly held 
ideal that people ought to be allowed to dis- 
sent and that, within broad limits, differences y 
in political opinions are a good thing. If peo- 
ple act according to this ideal, then, there >. 
ought to be no relationship between harsh- 
ness of treatment and political views as far 
as the intensity of shock is concerned. It ý- 
might still be the case, even if there is gen- j 
eral adherence to this ideal, that people will 
behave more harshly toward those who differ __ 
from them, without any recognized wish to`% 
do so. The duration of the shock should 
reveal this. 

Other considerations, however, suggest that 
contemporary values do not demand even- 
handed treatment of others, regardless of their 
political views. There certainly has been a lot 
of violence on college campuses recently 
centered on what political views ought to 
prevail, especially in regard to Vietnam and, 
minority groups. Some extreme factions 
among liberal students have been advocating 
violence toward those who express opposing 
views. Eyen relative moderates report increas- 
ingly militant feelings and a desire for action 
(Keniston, 1968). Thus, it may be believed 
that those who express markedly different po- 
litical views should be treated more punig® 
tively, and the intensity measure would revealx/ 
this. In any case, there are reasons for expect- 
ing the radicals and conservatives to behave 
in different ways. Keniston (1968) reports 
the group of radicals he studied to be ex- 
tremely candid and nonviolent people. This 
suggests that if they act punitively toward 
another because of his views, they will do se. 
openly by increasing the intensity of the Wa 
shocks they administer. On the other hand, ` 
the conservatives are higher on authoritarian- 
ism than the radicals (Janowitz & Marvick, 
1953) and have been shown to display pre- 
cisely those behaviors that they appear to 
find abhorrent (Wrightsman, 1969). This 
suggests that they might feel more hostility 
toward radicals than they openly recognize 
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and, thus, use the more subtle duration After this, they Were shown the ne subjec 
ure in punishing t m. panel, and three lights, labeled A, B, and C 
ao P k 8 he : the panel were pointed out which, they were to 

METHOD corresponded to three buttons on the learner’s par 

Subic The panel also had a shock delivery button, Anott 
EGS % $ : button allowed the shock to be set at any of 

All male subjects in three large introductory Psy- clearly marked intensities that were alleged to deliy 

chology classes at the University of Connecticut increasingly stronger shocks, They were inform 

were administered questionnaires entitled, “Opinions that the learner would have 30 trials at guessing tl 

~~ About Important World and National Issues.” The 


Correct pattern of five button presses that only tl 
subjects were asked to indicate the extent of their teacher knew, However, the teacher could infor 


As an example, Item 4 was as follows: “The only 
solution for the rampant cri 

nation is strict enforcement of our laws.” A total of “Shock” was delivered to check the apparatus. The 
oy $73 usable questionnaires were obtained, and the 30 


apparatus also allowed the confederate to record 
the intensity and duration of the “shock” delivered 
Procedure after each guess, 
Subjects were contacted by telephone and asked 


After this task was completed, the naive subject 
to come to a room where they were to meet the i i 


was given a questionnaire, which he was told would 

be held in strict confidence, Meanwhile, the learner 

was allegedly completing a questionnaire in the ad- 

; i joining room. The questionnaire consisted of 10 items 

a naive subject. It was explained that the experi- designed to obtain the subjects impression of the 

i task and the confederate. Finally, they were brought 

> and it was designed to answer together, and an effort Was made to a: LR Wif any 

were suspicious, and none appeared +” 

> J- 0 be, 
subject was dropped from the Sted 


because he misread a pattern as Correct, a 
The subjects were then told delivering shocks. > x 


RESULTS 


r of things desired. This asked the 
we could evaluate the role of i ae z 
such information in interpersonal communication, Subject to rate the political beliefs of the 
The first question asked him how he spent his time learner on a 5 
at the university, For all conditions, the confederate tive Condition, he was tated as conservative 
responded in the same way, appearing to be an op very Conservative by all 30 subjects. In the 
average student. For example, he indicated a liking radical condition kewe rat 
for movies and some of his courses. He was then 7 3 2 
asked to give his opinion about some issues viewed Or very radical by all subjects. Thus, he was 
“as important at the local, national, and international perceived as intended. 
levels. Here the confederate responded as an extreme 


half of each kind of subjects on The results concerning the shock data are 
conservative for half of each kin of subjects an + 

as a radical for the remainder. Thus, with regard P ce ee x Table 1. They yere analyzed by 
to Vietnam, he said either that we should stop analyses o „Variance a described by Winer 
fooling around and stop them there right now or (1962), which are suitable for three-factor 


else that we ought to get out right away since the experiments such as the present one. There 
war was immoral. With regard to demonstrations by 


TA ae i a i g are two between-subjects factors (type of sub- 
Yadicals agains employment ini erviews, e sal 3 . se 
either that they were making fools of themselves or J@Ct and experimental condition) and one 


else he approved, indicating that action had to be Within (trials). T. 
taken by someone, 


TABLE 1 


MEAN SHOCK Intensity AND DURATION BY TYPE OF 
SUBJECT AND EXPERIMENTAL CONDITION 


Condition 
Subjects Conservative Radical 
Intensity) Duration | Intensity) Duration 
Conservative 4.5 64.7 5.0 83.9 
Radical 7.0 51.6 4.5 61.5 


each dependent measure were significant 
beyond the .01 level except for the condition 
(confederate liberal or conservative) main 
effect for duration, which, however, was sig- 
nificant at the .05 level. Some of these results, 
such as main effects for trials, are not par- 
ticularly interesting, but others are note- 
worthy, Thus, more intense shocks are given 
when the confederate’s political views differ 
from those of the subject, and the radicals 
deliver particularly strong shocks. The con- 
servatives also deliver longer shocks to the 
radical target person, but the radicals actually 
administer skorter shocks to the conservative 
than to.thke radical target. For duration, the 
conserve... are exceptional in that they 
deliver® ariel ly long shocks. 

Tn an eén-,.4 to better comprehend what was 
occurring, duration and intensity of the 
shocks given were correlated. The overall co- 
efficient for all subjects combined was very 
low (.29). However, the pattern obtained 
when the four groups were considered sepa- 
rately was quite interesting. These correla- 
tions are shown in Table 2. As may be seen, 
three of the four correlations are positive, but 
for the radical subjects with the conservative 
target person, the coefficient is negative and 
fairly large. Interestingly, the next largest 
coefficient is for conservatives with the radical 
target (.25). This coefficient differs at a sta- 
tistically significant level from that of the 
radicals with the conservative target (—.51). 
Thus, the radicals differ from the conserva- 
tives not only in delivering shorter shocks to 
those whose views are unlike their own, but 
also by decreasing the duration of the shock 
as they increase intensity. 

In a final analysis, each subject’s score on 
the initial measure of political attitudes was 
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correlated with the mean intensity and dura: 
tion of the shocks he administered. The co- 
efficients obtained are shown in Table 3. Inj 
interpreting these scores, it should be remem- 
bered that increasingly lower scores on the 
questionnaire indicate increasingly more radi- 
cal responses. In spite of the fact that only 
relatively extreme and, therefore, homogene- 
ous subjects were selected, individual differ- 
ences in political views are related to behav- 
ior for each type of subject. The more extreme 
a radical subject’s views are, the more likely 
he is to give intense shocks to a conservative.: 
Also, the more extreme a conservative sub- 
ject’s views, the longer is the duration of the 
shocks he administers to a radical, although 
the correlation in this\case is s 
only the .10 level. All ae other scores shown 
in Table 3 are small and’ do not approach 
significance. Thus, it seems that with increas- 
ing extremism of political Views, one finds in- 
creasingly more punishment displayed toward 
another person, if that person expresses dif- 
ferent political views. Also, for radicals, this 
is manifested in the intensity scores, whereas 
for the conservatives, this is shown by dura- 
tion. This suggests differences between them 
in the intentionality and awareness of their 
behavior. 

The postexperimental questionnaire results 
were relatively unambiguous. For 5 of the 10 
items, no differences ‘were found either as a 
function of type of subject or condition, These’ 
items were, in the main, concerned with ob 
jective aspects of the*task performed by th 
confederate, which was, of course, the sam 
for all subjects. Significantly large chi-square 
coefficients were obtained for the remaining | 
items. For 4 items, the radicals and conserv 
tives behaved similarly, expressing poore: 
opinions about the confederate when his po 


TABLE 2 


CORRELATION BETWEEN SHOCK INTENSITIES AND 
DURATIONS FOR EACH OF THE Four GROUPS 


Condition 


Subjects 
Conservative 
Conservative 19 
Radical =—51 y 


sheen 


TABLE 3 


ie ‘ORRELATIONS BETWEEN SCORES ON INITIAL ATTITUDE 
| QUESTIONNAIRE AND MEAN INTENSITY AND 
t DURATION OF THE SHOCKS ADMINISTERED 


Condition 
Subjects Conservative Radical 
Intensity | Duration | Intensity | Duration 
) Conservative 02 13 24 | 49 
Radical = Orson A SO AS 
*p <.10 
AD <05 


tical ideas were different from their own. 
Under these conditions, the confederate was 
rgd as less intelligent, as less liked by 
ther people, as less liked: by themselves, and 
hey indicated experiencing more discomfort 
ith him. The final item asked for an esti- 
ation of how well the confederate got along 
with other people, and for this item, the radi- 
als and conservatives were significantly dif- 
erent. The conservatives rated the confed- 
p as getting along more poorly. 


DISCUSSION 


_ The results of this study make it clear that 
»olitical beliefs can influence interpersonal 
elationships in several important ways. If an 
individual’s political ideas are extreme, he will 
le more likely to perceive someone who dis- 
¡grees with him as stupid and dislikable, and 
"à will be more uncomfortable with him. He 
‘ll also administer more intense shocks to 
a person, which suggests that this in- 
ased pain is caused quite intentionally. It 
¿ms that a readily verbalizable belief that 
bse who disagree with us ought to be hurt 
‘evails, at least among the radicals who give 
tikingly intense shocks. Of course, these 
fere subjects with rather extreme ideas who 
countered another person with similarly ex- 
reme ideas. It remains to be seen whether 
omparable findings would be obtained with 
ore moderate subjects and/or with the tar- 
et person taking a more moderate position. 
owever, when confrontations centered on 
olitical beliefs do occur, we would expect 
recisely those people who hold extreme views 
be involved, The results of this study sug- 
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gest violent outcomes in such confrontations. 
This we have often seen recently, especially in 
Italy and France. 

With regard to the two types of subjects, 
they were similar in some ways, as the pre- 
ceding discussion indicates, but perhaps the 
differences between them are more interesting. 
Some data suggest that when radical indi- 
viduals treat another person unkindly, they 
do so after a more rational and conscious de- 
cision than is true for conservatives. The 
clearest support for this possibility was the 
finding that with increasing extremism in the 
case of conservatives, one encounters increased 
shock duration, whereas more extreme radicals 
mete out more intense shocks. Also, as may 
be seen in Table 1, when someone of oppos- 
ing views is encountered, the former admin- 
ister extremely long shocks, whereas the latter 
administer very intense ones. The question- 
naire finding that showed conservatives, in 
comparison to radicals, perceive the confed- 
erate as less able to get along with others may 
be consistent with this. It suggests a disposi- 
tion to view others as inadequate and un- 
worthy, which might lead to a lack of care as 
to how others are treated rather than a con- 
scious decision to hurt them. In any case, the 
finding is quite in harmony with prior re- 
search showing differences between radicals 
and conservatives in the perception of others 
(Scodel & Freedman, 1956; Scodel & Mussen, 
1953). 

Some results, which are among the most 
interesting, were completely unexpected. These 
were the duration findings for the radical 
subjects. These subjects actually gave signifi- 
cantly skorter shocks to the conservative than 
to the radical target person. In trying to un- 
derstand these findings, one should note that 
it was in the same condition that the radicals 
administered uniquely intense shocks. The 
mean intensity was 7.0, with 10 being the 
upper limit and with the next highest group 
having an average intensity of only 5.0. It 
was also in this condition that a fairly high 
correlation (and the only negative one for the 
four groups) was found between intensity 
and duration. These results suggest a fairly 
compelling interpretation about the person- 
ality of the radical subjects. When they en- 


counter the confederate and hear him take a 
position on Vietnam and demonstrations that 
they heartily dislike and that they probably 
believe to be immoral, they decide that he 
should be punished and give him intense 
shocks. They, as well as some of the radicals 
interviewed by Keniston (1968), appear to 
believe that it is time to take action. How- 
ever, also consistent with Keniston’s descrip- 
tion, they are basically nonviolent and kindly 
people and so would be expected to feel bad, 
and perhaps guilty, over hurting another per- 
son. They might well administer shorter 
shocks because of these guilt feelings and yet 
not think about being sorry for the conserva- 
tive, whose views are so abhorrent. If one 
then makes the reasonable assumption that 
the more intense the average shock adminis- 
tered, the greater the guilt, the negative cor- 
relation found would be expected. 

The results suggest that the conservative is 
quite a different sort of person. He does not 
seem consciously to decide to inflict pain on 
those who disagree with him, as implied by 
the small difference in intensity across condi- 
tions. However, when he is not fully aware of 
his behavior, he may cause the dissenter a 
good deal of pain, as the duration results in- 
dicate. 

These interpretations are testable and lead 
to different predictions for these two types of 
subjects in a wide variety of situations. Thus, 
if they were at a party and became involved 
in a political disagreement with their host, the 
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radical might be expected to be insulting ani 
perhaps to leave the party. Later he mig 
apologize. The conservative’s actions would bi 
less overtly dramatic, but he might “acci 
dently” break something or burn the rug wi 
his cigarette. 
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SOCIAL EVALUATION PROCESSES ! 
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A subjective probability model was used to predict the extent to which persons 
were liked, and changes in likableness, from beliefs associated with various attri- 
butes of these persons. Each subject (P) was asked to estimate the probability that 
he and another (O) were liked, and also the probability that he and O possessed 
a series of personality attributes. These estimates were obtained both before and 
after P had received false information about O’s belief that he and O were similar 
with respect to a certain characteristic. Likableness was predicted using a model . 
which assumes that relations among subjective probabilities obey the laws of 
mathematical probability theory. The predictive accuracy of the model was more 
limited than expected. Beliefs associated with favorable attributes generally 
formed more accurate predictors than beliefs associated with unfavorable attributes 
or attributes pertaining to similarities between P and O. Change in P’s belief that O 
liked him was more accurately predicted than was the change in P’s belief that he 
was generally liked by other persons or change in P’s belief that he liked O. Im- 
plications of results for theoretical formulations proposed by McGuire and Wyer 


are discussed. 


| model proposed by Wyer and Goldberg (1970) 
was used to predict how information about one- 
self affects liking for the source of the informa- 
tion and the belief that one is liked by the 
source. The effect of such information on the 
\belief that one is generally liked by others was 
also investigated. Although these effects have 
been studied previously (Byrne, 1961; Feather, 
1967; Harvey, Kelley, & Shapiro, 1957; Heider, 
1958; Manis, 1955), there have been few 
attempts to predict the magnitude of these 
effects and their dependence on the type of 
\nformation presented. 
/ Consider the following law of mathematical 


[proren theory: 
Pp PPn t A Px) Pra a 


where Py is the probability that a person is 
likable; Px and Px, = 1 — Px are the proba- 
ilities that he does and does not possess some 
attribute X; and Py,x and Pr;x are the condi- 
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In the present study a subjective probability 


» tional probabilities that he is liked given that 


he possesses or does not possess X, respectively. 
If information is presented that changes Px, 
the corresponding change in Pr can also be 
predicted using this equation. Although Equa- 
tion 1 is a tautology in mathematical proba- 
bility theory, its validity is an empirical ques- 
tion if the components of the equation are the 
subjects’ estimates of the likelihood that the 
events in question are true. 

If subjective probabilities obey the laws of 
mathematical probability, Equation 1 should 
hold regardless of the object being evaluated. 
Moreover, this equation should hold regardless 
of the attribute on which the predicted value 
of Px is based. For example, if information is 
received that produces a change in Px, and this 
in turn leads to a change in Pr, a change should 
also occur in one or more of the beliefs associ- 
ated with a second attribute Y (i.e., Py, Pry, 
and Py,y-) which are theoretically related to 
Py according to the equation 


Py = PyPry + (1 — Py)Pryy. 
The present study investigates this possibility. 


METHOD 


Forty-eight introductory psychology students par- 
ticipated in the study in randomly assigned pairs. Each 
subject (P) was asked to make judgments about himself 
and the other (O) under one of eight combinations of 
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fest attribute (a characteristic with respect to which P 
described himself to O) and feedback (fictitious informa- 
tion from O to the effect that P and O were either 
similar or dissimilar with respect to the test attribute). 
Of these subjects, 7 omitted various items in the 
questionnaire that they were asked to complete, and 
they were eliminated. Three additional subjects were 
eliminated randomly to produce proportional cell fre- 
quencies at each combination of experimental variables 
(test attribute and type of feedback). The final sample 
of 38 subjects included 4 who received positive feedback 
about one of the four test attributes and 4 who received 
negative feedback about this attribute. The remaining 
30 subjects were evenly distributed over the other six 
combinations of test attribute and type of feedback. 


_ Preparation of Materials 


Selection of attributes. Four test attributes were 
selected, which, on the basis of normative data, were 
moderately likely to be possessed by a person (in each 
case, .4 < Px < .6) and were relevant to judgments of 
how well persons are liked (in each case, Pr/x — Prix 
> .25). To increase the credibility of the fictitious feed- 
back, each of the four attributes pertained to the simi- 
larity of persons with respect to a certain characteristic. 
They were: similar in attitude toward communism, 
similar in social and cultural background, similar in 
recreational interests, and similar in attitude toward 
morality. 

Tn addition to the above 4 test attributes, 16 others 
were selected, Six of these pertained to similarities be- 
tween persons with respect to certain attitudes and 
beliefs. Of the remaining 10, 6 were favorable (P1/x 
> Px), and 4 were unfavorable (Pux < Pr a). 

Preparalion of questionnaires. The questionnaire 
administered to subjects was in four parts. In the first 
section, P estimated the likelihood that he would like a 
person who possessed each of the 20 attributes selected 
for study (Px/x) and also the likelihood that he would 
like a person who did not possess each attribute (Prix). 
‘Attributes were arranged in random order. These 
estimates, as well as those obtained in other sections of 
the questionnaire, were made by placing a check along 
a 21-point scale from 0 (extremely unlikely) to 100 
(extremely likely). 

In the second section, P estimated how likely it was 
that O would like him (Prop). Then, he estimated how 
likely it was that O would describe him as possessing 
each attribute (Pxop). 

In the third section, P estimated how likely it was 
that he would like O (Piro) and then estimated how 
likely it was that O possessed each attribute (Pxro). 

In the final portion of the questionnaire, P estimated 
how likely it was that he was generally liked by other 
persons (Pr) and also how likely it was that he would 
describe himself as possessing each attribute (Px). In 
this section, attributes pertaining to similarities referred 
to P’s similarity to other persons generally rather than 
only to O. 

The questionnaire was identical for all subjects and 
was the same before and after the administration of 
feedback. 
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Procedure 


Subjects were told that the purpose of the study was 
to find out how persons form impressions of others on 
the basis of small amounts of information that they 
receive about these persons. It was stated that charac- 
teristics of persons could be inferred both from the 
things they say about themselves and from these 
persons’ reactions to things others have said. 

Subjects were then asked to introduce themselves to 
one another and to indicate their year of school and 
major. Then, each was asked to describe verbally to 
the other the characteristic to which one of the test 
attributes pertained (e.g., his attitude toward com- 
munism). No two subjects in the same pair were asked 
to discuss the same characteristic. To increase the 
quality of their oral presentations, the subjects were 
given 3 minutes to write down the points they felt were 
most important in informing others about the character- 
istics that they were asked to discuss and to ider 
how best to present them, 

After finishing their presentations, whic re 
typically 1-3 minutes long, the subjects were given the 
questionnaire with appropriate instructions for com- 
pleting it. 

Administration of feedback. Subjects were then asked 
to work individually on a short task which was unrelated, 
to the major portion of the study. During this period, 
the experimenter prepared fictitious feedback for each 
P concerning O’s estimate of how similar they were to 
one another with respect to the characteristic P had 
described in himself. Under positive feedback conditions, 
P received information that O’s estimate of the likeli- 
hood that they were similar was 95 along the 100-point 
scale used for recording judgments. Under negative feed- 
back conditions, P received information that O's 
estimate of the likelihood that they were similar was 
5 along the scale. Feedback information was recorded 
on a special form on which all 20 attributes were listed, 
and blank spaces were provided for recording “self- 
ratings,” “‘other’s ratings,” and “predictions of other’s 
ratings” pertaining to each attribute. This fairly com- 
plicated form was used to suggest that the feedback 
given to P was only a part of all of the questionnair: 
data to be compared. 

P was administered the feedback described aboy 
with instructions that the new information was bein 
given him to determine whether it would increase hi 
accuracy in predicting the other ratings made by O 
Since the study was primarily concerned with quanti 
tative predictions of the amount of change, this implicit 
“demand”? to change judgments was not expected to 
create severe difficulties for the interpretation of the 
results obtained. After receiving feedback, the subjects 
completed the questionnaire a second time. 

After the experiment, each subject was informed that 
the feedback he had received did not necessarily reflect 
the other’s true rating of him. 

To prepare data for analysis, the subjects’ question- 
naire ratings were divided by 100 to transform them to 
units of subjective probability. 
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RESULTS 
Effectiveness of Feedback 


Feedback had the expected effects. P’s 
ratings that O judged him to have the test 
attribute increased from .63 to .88 under 
“positive feedback and decreased from .49 to .31 
under negative feedback. Similar results were 

- obtained for P’s ratings that he actually 
possessed the attribute and of his ratings that 
O possessed the attribute. The increases were 

_ from .63 to .80 and from .63 to .88, respectively; 

the decreases were from .53 to .34 and from .50 
to .38, respectively. All changes were highly 
significant (p < .001), but they did not signifi- 
cantly depend on the particular test attribute 
involved (in each case, F < 2.00, df = 3/30, 
p> 10). 
_ Effects on conditional probabilities. Analysis 
of Px produced a significant interaction of 
type of feedback and pre-post (F = 6.32, 
df = 1/30, p< .01); Pr/x increased from .618 
to .763 under positive feedback conditions but 
decreased from .597 to .526 under negative 
feedback conditions. The corresponding inter- 
action for Pi;x was not significant (F = .07, 
df = 1/30): 


Prediction of Likableness Estimates 


Three primary dependent variables were 
considered: P’s estimate of the probability 
that O would like him (Prop), his estimate of 
the probability that he would like O (Prpo), 
and his estimate of the probability that he is 
generally liked by other persons (P1,). Predicted 
values of these subjective probabilities both 
before and after feedback were estimated for 
each subject separately from equations similar 
in form to Equation 1. For example, Prop was 
predicted from the equation 


Pror = PxopPxjx + (1 — Pxor)P1/x’. 


Predictions of Pyro and Pr were obtained 
similarly using estimates of Pxpo and Px, 
respectively, 

The mean predicted and mean obtained 
values of Prop, Puro, and Pr under each com- 
bination of type of feedback and test attribute 
are compared graphically in Figures la—tc. 
The standard errors of the differences between 

_mean predicted and mean obtained values of 
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these probabilities, based upon the data in 
Figure 1, were .122, .132, and .165, respectively; 
the correlations between predicted and ob- 
tained estimates of these quantities were .529, 
.615, and — .001, respectively. Predicted 
values were generally underestimates of ob- 
tained values. P’s estimates of the probability 
that he was liked by O, and the probability 
that he liked O, were both more accurately 
predicted than his estimates of the probability 
that he was generally liked by others. 

Change in likableness estimates. Plots of ob- 
tained versus predicted changes in likableness 
under each combination of feedback and test 
attribute conditions are shown in Figures 1d—1f 
for each dependent variable. The standard 
errors of estimating changes in Prop, Prpo, 
and Pzr from predicted values were .079, .117, 
and .143, respectively; correlations between 
mean predicted and mean obtained changes in 
these variables over the eight combinations of 
feedback and test attribute were .845, .797, 
and .463. The model thus predicts changes in 
Prop and Prro fairly accurately, but is 
unsuccessful in predicting changes in Py. 


Supplementary Analyses 


Effects of type of attribute on predictive 
accuracy. To determine whether the predictive 
accuracy of Equation 1 depended on the type 
of attribute on which the predicted value was 
based, the 16 attributes that were not used as 
test attributes were first divided into three 
groups: favorable, unfavorable, and those per- 
taining to similarities. The mean predicted 
value of each dependent variable, based on 
each type of attribute, was calculated for each 
subject both before and after feedback. An 
analysis of variance was then performed on 
data pertaining to each dependent variable 
separately as a function of two between-subject 
variables—test attribute and type of feedback 
—and two within-subject variables—pre-post 
and a “dummy” variable with five levels: the 
obtained value of the likableness variable 
being considered and four predicted values 
based on the test attribute alone, favorable 
attributes, unfavorable attributes, and attri- 
butes pertaining to similarities. If the proposed 
model is a generally accurate description of 
social evaluation processes, relations involving 
this latter variable should be nonsignificant. 
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Mean predicted and obtained values of each 
© likableness variable both before and after 
feedback are shown in Table 1, Analyses of 
variance of each set of data yielded a significant 
main effect of obtained-predicted (F > 17.45, 
df = 4/120, p < 001) and a significant Ob- 
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tained-Predicted X Type of Feedback X Pre- 
Post interaction (F > 2.76, df = 4/120, 
p< .05). Additional analyses in which obtained 
values were omitted indicated in each case that 
the four predicted values differed from one 
another (F > 17.82, df = 3/90, p < .001), and 
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TABLE 1 
MEAN OBTAINED AND PREDICTED Estimates OF LIKABLENESS BEFORE AND AFTER FEEDBACK 
AS A FUNCTION OF TYPE OF FEEDBACK 
Negative feedback Positive feedback 
Item M 
Before After Change Before After Change 
Pror 
Obtained value .616 568 —.048 .639 811 +.172 659 
Predicted value based on 
Test attribute .583 .530 —,053 547 742 +.195 .600 
Favorable attribute -684 .642 —.042 .661 713 +.052 675 
Unfavorable attribute 544 499 —.045 ATL 546 +.075 515 
Attributes pertaining to similarities 552 543 —.009 bold. 559 +.042 543 
Piro 
Obtained value 668 -632 —.036 737 813 +.076 712 
Predicted value based on 
Test attributes 592 541 —.051 552 742 +.190 -607 
Favorable attributes .694 .636 —.058 097 128 +.031 689 
Unfavorable attributes 519 .509 —.010 516 531 +.015 19 
Attributes pertaining to similarities 551 542 —.009 .516 555 +.039 541 
i 
Obtained value 684 674 —.010 -145 153 +.008 714 
Predicted value based on 
Test attributes 579 528 —.051 -550 -737 +.187 598 
Favorable attributes -720 .679 —.041 -709 .742 +.033 712 
Unfavorable attributes .520 541 +.021 514 599 +.085 544 
Attributes pertaining to similarities 554 549 —.005 SLT, Sar ++.040 44 


that these differences were contingent on type 
of feedback and pre-post (F > 3.36, df = 3/90, 
p < 05). These results are contrary to predic- 
tions based on the proposed model and call 
into question its generality. Predictions based 
on favorable attributes were generally more 
accurate than predictions based either on un- 
favorable attributes or attributes, such as the 
test attribute, which pertained to similarities 
between P and O. Predicted changes in likable- 
ness based on test attributes were generally 
greater than those based on attributes to which 
feedback did not pertain, and they were also 
greater than the changes that actually oc- 
curred. Changes in Prop were predicted more 
accurately by estimates based on test attri- 
butes than by estimates based on other 
attributes, However, changes in Pypo and Py, 
were generally less accurately predicted by 
estimates based on test attributes, particularly 
under positive feedback conditions. 

Effects of feedback on beliefs about other 
attributes. It was hypothesized that feedback 
about test attributes would affect not only 
likableness estimates but also beliefs pertaining 
to other attributes of the objects being rated, 
that is, Pxop, Pxpo, Px, Pryx, and Pr;x. To 
explore this possibility, separate analyses of 


variance were performed on each of these 
probabilities as a function of type of feedback, 
pre-post, and type of attribute (test, favorable, 
unfavorable, or one pertaining to a similarity 
between P and 0). 

Analyses of Pxor, Pxpo, and Px each 
yielded several significant effects. The Type of 
Feedback X Pre-Post X Type of Attribute in- 
teraction, which was significant in each analysis 
(F > 7.13, df = 3/90, p < .01), indicated that 
beliefs about attributes to which feedback did 
not directly pertain changed less than beliefs 
about the test attribute (see Table 2). To 
determine whether changes in estimates per- 
taining to attributes other than the test 
attribute were reliable, each variable was re- 
analyzed omitting data pertaining to the test 
attribute. Tests of relevant main effects and 
interactions indicated that feedback signifi- 
cantly affected P’s beliefs that O judged him 
to have attributes other than the test attribute 
(Pxop), but that feedback did not reliably 
affect P’s beliefs that he (P) actually possessed 
these attributes (Px) or P’s beliefs that O had 
them (Pxpo). In no case did these effects de- 
pend on the type of attribute involved. 

Analyses of Px yielded a main effect of 
attribute type (F = 80.08, df = 3/90, p < .001) 
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TABLE 2 


MEAN ESTIMATES OF THE LIKELIHOOD OF POSSESSING DIFFERENT TYPES oF ATTRIBUTES, AND CONDITIONAL 
PROBABILITIES OF BEING LIKED GIVEN THESE ATTRIBUTES, AS A FUNCTION OF 
TYPE OF FEEDBACK 


Negative feedback Positive feedback 
Item 
Before After Change Before After Change 
Pxor 
Test attribute .492 .308 —.184 034 882 +.144 
Favorable attributes 721 689 —.033 703 776 +.073 
Unfavorable attributes 426 443 +.017 487 503 +.016 
Attributes pertaining to similarities 495, 463 —.032 541 632 +.091 
XPO 
Test attribute 505 379 —.126 634 882 +.144 
Fayorable attributes 756 694 — .062 761 798 +.037 
Unfavorable attributes 427 422 —.005 470 ATT +.007 
ee pertaining to similarities 490 479 —.011 .555 605 +.050 
“Test attribute 532 345 —.187 .629 -803 
Favorable attributes 179 742 —.057 153 806 
Unfavorable attributes 301 301 -000 466 426 
Attributes pertaining to similarities 502 A81 —.021 547 565 
L/X 
Test attribute 597 526 —.071 618 763 
Fayorable attributes 845 .797 — .048 .832 .822 
Unfavorable attributes .276 .290 +.014 .266 365 
: Attributes pertaining to similarities 567 538 —.029 549 582 
L/x" 
Test attribute 479 505 +.026 437 468 
Favorable attributes 214 .240 +.026 .240 259 © 
Unfavorable attributes 056 621 —.035 „672 -650 
Attributes pertaining to similarities 483 499 +.016 AT1 450 


and Type of Feedback X Pre-Post interaction 
(F = 5.65, df = 1/30, p < .05). Pxyx increased 
following positive feedback and decreased 
following negative feedback; however, 
magnitude of these changes did not signifi- 
cantly depend on the type of attribute in- 
volved. Analysis of Pz,;x, yielded only a main 
effect of attribute type (F = 68.68, df = 3/90, 
p< .001); no main effects or interactions 
involving pre-post or type of feedback were 
significant. 


Discussion 


The accuracy of the model proposed by Wyer 
and Goldberg (1970) in describing social 
evaluation processes was somewhat disap- 
pointing. If the model is at all applicable to 
social evaluation processes, the conditions 
under which it is accurate appear to be much 
more limited than had been expected. 

Predictions of likableness were generally 
more accurate when based on beliefs associated 
with favorable attributes than when based on 
beliefs about unfavorable attributes or beliefs 


the © 


about similarities between P and O. Why this 
should be true is unclear. Data in Table 2 show 
that the value of the first component of 
Equation 1 (PxPr/x) is relatively large when 
X is favorable, while the value of the second 
component (Px:Pr/x) is relatively large when 
X is unfavorable. Perhaps the latter component 
is not as influential as Equation 1 implies. Also, 
note that the objects of evaluation in this study 
were generally well liked (see Table 1); possi- 
bly, predictions based on favorable attributes 
are accurate when the object being evaluated 
is fairly likable, while predictions based on 
unfavorable attributes are more accurate when 
the object is relatively disliked. These issues 
warrant further investigation. 

P’s belief that he was generally liked by 
others (Pi) was unaffected by feedback in- 
formation even though he significantly changed 
his belief that he possessed the test attribute 
to which the feedback pertained. McGuire 
(1968) has suggested that beliefs which are 
remotely associated with those to which new 
information directly pertains may be affected 
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only after sufficient time has elapsed for the 


- Information and its implications to be com- 


pletely assimilated. This time lag may be most 


. likely to occur when the beliefs affected are 


firmly rooted in a subject’s cognitive system; 
newly formed beliefs that are based primarily 
on information received in the experiment (eg, 
Prop and Pirro) may be relatively less stable 
and thus more likely to be immediately affected 
by new information, as results indicate. This 
interpretation is consistent with previous 
research that also showed the general model to 
be more accurate in predicting changes in 
beliefs formed during the experimental situa- 
tion (Wyer, 1970) than in predicting changes 
in more firmly established beliefs (Wyer & 
Goldberg, 1970). On the other hand, the feed- 
back administered to subjects may have 
created implicit experimenter demands to 
change ratings of Prop and Pypo, but it did not 
Create the impression that the subjects were 
expected to change how. well they were 
generally liked. The validity of these alterna- 
tive interpretations cannot be determined 
from the present study, 

The general formulation under consideration 
implies 
attribute should affect beliefs that the object 
of evaluation possesses attributes other than 
the test attribute. Moreover, the magnitude of 
these effects should generally increase with the 
magnitude of the change in likableness. Data 
in Table 2 indicate that changes in beliefs re- 


relatively more Pronounced when changes in 
likableness were large. These data are therefore 


The results of both this study and previous 
research (Wyer, 1970) Suggest that beliefs in 
the first premise of the syllogism “A,” “if A, 
then B,” “B” appear to affect beliefs in the 
second premise (P34) as well as beliefs in the 
conclusion (Pg). This does not necessarily 
follow from the assumption that persons 
Process information according to certain rules 
of logical inference, since the two premises in 
question are logically independent, However, 
if the above equation describes a state of 
consistency among the several beliefs involved, 
and if cognitive inconsistency produced by 
change in one belief creates pressure to reduce 


the inconsistency by changing other beliefs 
related to it, the above phenomenon might be 
expected. This latter assumption is of course 
not novel (cf. Abelson, Aronson, McGuire, 
Newcomb, Rosenberg, & Tannenbaum, 1968), 
and was made by McGuire (1960) in developing 
a similar probabilistic model of cognitive 
organization. 

Despite the present limitations on accuracy 
of the proposed model, its applicability to social 
evaluation processes deserves further investi- 
gation, particularly in light of the Poor success 
of cognitive balance formulations (cf. Wyer 
& Lyon, 1970; Zajonc, 1968). The general 
approach potentially enables finer distinctions 
to be made between the effects of different 
types of information about persons upon evalu- 
ations of these persons than do balance formu- 
lations which consider only the affective quality 
of the information. These potentialities should 
be explored further, 
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predicted by fellow students. The subjects themselyes underestimated the shock 
levels, suggesting a possible guilt-reduction mechanism. One implication is that 
our society tends to produce individuals who act contrary to their conscience 
when faced with even mildly compelling situations, 


Milgram (1963, 1964a, 1964b, 1965a, the case [p. 274],” and Suggested that a 
1965b, 1967), in a series of articles, reported full understanding of the subject’s action must 
¿= the results of a comprehensive laboratory take into account both Perspectives. 


tions will Y carry out the command of X, and one study and found that the F scale effec- 
under what conditions will he refuse? The tively predicted obedience to commands. Mil- 


of a learner in a learning experiment, but who A related technique also using a shock gen- 
was actually a confederate of the experi- erator was developed to study aggression in 
menter, the laboratory (Buss 1961). Using this “ag 


Milgram (1964b) also investigated the ef- gression machine,” Buss found that men were 

= fect of group pressure and shock levels. The more aggressive than women and that men 
results showed that when the subject partici- used higher shock levels with a male “learner” 
pates with two confederates, the mean shock when compared to a female learner. For 


Social structure is most potent in producing extensively, employing a social-psychological 
obedience. He wrote merely that “The person analysis emphasizing both situational condi- 
brings to the laboratory enduring dispositions tions and the development. of aggressive per- 

~ toward authority and aggression, and at the sonalities, Bandura, Ross, and Ross (1961) 
Same time he becomes enmeshed in a social indicated the importance of imitation of mod- 
structure that is no less an objective fact of els in the transmission of aggression, and 
se Rosenhan and White (1967) noted that re- 


Pahis research was made possible through a grant hearsal and observation of a model’s behavior 
from the Oregon State University Research Council wea ie a 


to the first author. 

? Requests for reprints should be sent to Knud S. ia 
Larsen, Department of Psychology, Oregon State fore readily þe employed to study aggression 
University, Corvallis, Oregon 97331. in the laboratories and the effect of sex dif- A 
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ferences and modeling on laboratory aggres- 
sion, 

The present investigators desired to study 
the willingness of subjects to administer pun- 
ishment as a function of both personality and 
situational variables. In some ways, we have 
tried to replicate Milgram’s work, in other 
cases, to innovate other approaches. In all 
situations, however, there are differences in 
apparatus, procedure, and instructions; there- 
fore, any comparisons to Milgram’s work 
must be guarded. One major difference was 
that in no case did the experimenter command 
the subject to administer the punishment; 
rather than obedience to specific commands, 
the experimenters permitted the situations to 
exert their influence, and, consequently, the 
essential variable studied was situational con- 
formity. That is not to say that the experi- 
menter did not have influence, but an effort 
was made to minimize his influence to simply 
explaining the situation, after which he re- 
moved himself from the room until the experi- 
ment was completed. In this respect, our work 
follows Milgram’s (1964a) control procedure 
or the aggression paradigms of Buss (1961). 


METHOD 
Subjects 


The original pool of subjects consisted of all stu- 
dents enrolled in introductory psychology courses 
during the winter term 1970, at Oregon State Uni- 
versity. One hundred and thirty-three males and 214 
females completed the personality measures (de- 
scribed below), and 213 subjects participated in the 
various conditions of the experiment. One hundred 
and twenty-four subjects completed both the person- 
ality measures and the experiment. 


Apparatus 


Two rooms secured for the experiment were sepa- 
rated by about 20 feet. Since no form of communi- 
cation was possible between the subject and the 
learner, the setup approximated Milgram’s remote 
condition. In one room, the shock apparatus was 
placed on a table. It consisted of a gray rectangular 
box, about 8 inches high and 12 inches wide, In 
the upper right-hand corner, there was an orange 
light labeled “Right,” and in the upper left-hand 
corner, there was an orange light labeled “Wrong.” 
A meter which indicated with a needle the amount 
of voltage used was located on the left side. Below 
the meter was @ switch, which the subjects used to 
control the length of shock by holding it down. A 
dial was located on the right side of the machine 
and was labeled from O to 390 volts. The 0-120 
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range was Jabeled in green as “sii 

121-240 range was Jabeled in yellow ers 
and the 240-390 range was labeled in » ri 
severe shock.” It was possible for 

select any voltage level, although eve: 

level was clearly marked, The ot! sora 
equipped with an “electric chair,” consis! i 
electrodes and a machine which comms) 
Jearner’s response to the subjects. © 
chine (unknown to the subjects) fed, the 
subjects’ choice of voltage levels. 


Procedure 


Control condition (n= 78). Wh: 
rived at the appointed time, he w 
he had heard anything about the 
he were familiar with the experi! 
None of the subjects admitted any ) 
study. The following instructions 
orally: 


ibject ar 


vhether 


Very little is known about the s of puni 
ment upon learning. What we have med here is 
an experiment to investigate the problem. it is a 
teacher-learner experiment involving two people: 
you as the teacher and a learner we is a volun- 
teer and who is being paid for bi The 
learner, who I will introduce to ı moment, 
is asked a series of 60 paired- 
tions. Whether his answers are * or wrong are 
indicated to you by the approprists light flashing 
on this machine [the experimenter points to the 
machine]. Your job, then, is to shock the learner 
after each wrong answer. There is no preferred 
way for you to respond, One major question is to 
discover which voltage levels have the most posi- 
tive effects on the learning process. You should 
therefore feel free to experiment with various 
voltage levels. In a nutshell, be the most effective 
teacher you can be, using shock to punish the 
Jearner for wrong answers. 


parison ques- 


The experimenter then led the sub}: second 
room and introduced him to the | (stooge) 
who was wired to the electric chair. The subject was 
told that the reason for this introduction to the 
apparatus was to “familiarize you with the equip- 
ment, the learner, and the general procedure.” The 
experimenter then accompanied the subject back to 
the original room where he asked the subject to take 
a seat at the shock equipment. The experimenter 
explained the apparatus to the subject as follows: 


The voltage dial goes from 0 to 390 and pro- 
gresses from slight, to moderate, to severe shock. 
‘Although the shocks can be extremely painful, the 
learner does not suffer any permanent damage. 
Are there any questions? Jf not, I will go down 
to the other room with the learner and will flash 
the panel of lights to signal to you that we are 
ready to begin. Remember: watch the panel, and 
administer shock only after the wrong answers, 
deciding as you go the amount of voltage for each 
trial. 


= 
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Following the 60 trials, the subjects were asked to 
fill out a short survey. This survey form included 
questions which asked the subject the highest volt- 
age with which he shocked the learner and the 
amount of pain that this shock elicited on a 10-point 
scale. They were then thanked for their cooperation 
and told that they were welcome to receive the re- 
sults upon completion of the study. They were asked 
not to discuss the experiment with anyone until all 
experimenting was completed, 

After finishing the experiment, the subjects were 
debriefed during regular class sessions, Verbal com- 


` ments made at that time indicated that the subjects 


who participated accepted the rationale presented by 
the experimenter. Three potential subjects refused to 
participate, and two were skeptical of the rationale. 
These latter two did not complete the experiment. 
All subjects then responded to a nine-item survey 
form listing various reasons for participating in the 
study, and they were asked to check which reasons 
applied to them, 


Model condition (n=79). Similar instructions 
were given in the modeling condition, but the pro- 
cedure differed. When the subject arrived at the 
appointed time, he was told that the experiment was 
running slightly behind schedule. In order to save 
time in becoming familiar with the equipment, he 
was asked to enter the experimental room and 
watch the current “teacher” (confederate of the 
experimenter) finish his series of shock treatments. 
The teacher was actually a confederate who, when 
the subjects entered the room, was shocking at 
moderate levels, but after a series of 10 trials, ended 
at the 390 maximum voltage level. It was reasoned 
that the subjects would find it easier to administer 
punishment (find it more legitimate) if someone first 
modeled such behavior. 


Conformity condition (n=20). The conformity 
situation was analogous to Milgram’s (1964a) group 
experiment, where a team of three persons decided in 
common the shock level that should be adminis- 
tered to the learner. Actually, two members of the 
team were confederates of the experimenter. These 
confederates placed themselves on either side of the 
subject in such a manner that the subject always 
ended up controlling the shock equipment, The sub- 
jects were told that recent experiments in psychology 
have indicated the superiority of group over indi- 
vidual decisions and that we wanted to investigate 
whether the group was also a better teacher in the 
context of the teacher—-learner experiment. 

During the course of the experiment, one confed- 
erate always asked for a voltage level 10 voltage 
points higher than the last administered shock level, 
Whereas the other confederate went 20 voltage 
points ahead of the previous group decision. On each 
trial, the average of the pooled individual decisions 
was used as the shock level. If the subject remained 
at the same shock level throughout the trials, the 
same voltage (high or low) would be administered 
on each trial. The subject administered all of the 
shocks. 
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Female learner condition (n= 27). In this condi- 
tion, a female confederate played the role of the 
learner, and the subjects were asked to administer 
shock to her. The instructions and procedure were 
identical to those given in the control condition. 
Since it is generally considered wrong in our culture 
to inflict pain on a female, it was expected that all 
subjects would shock the female learner less when 
compared to the control condition (in which males 
were shocked). 

High model condition (n= 9), In the high model 
condition, the experimenter left the shock equipment 
on 350 volts, rather than turning it back to O volts 
as was done at the beginning of each session in the 
control condition. We wanted to investigate the rela- 
tionship between the “suggestion” that the previous 
subject shocked up to 350 volts and the willingness 
to shock more intensely when compared to the con- 
trol condition, 

Learner response. The responses that appeared on 
the shocking apparatus were 50% random right and 
50% random wrong, as taken from a table of ran- 
dom numbers. Thus, the subject could not ration- 
alize either improvement or decreased learning by 
the learner as a result of the shock administration. 

Number of trials, In all conditions, 60 trials (30 
right, 30 wrong) were administered (unless the sub- 
ject reached 390 volts before the end of the trials, at 
which point the experiment was terminated). 

Personality variables. The following five indexes 
were thought important in measuring personality 
dynamics relevant to aggression. (a) The Aggression 
scale: The Aggression scale was a Thurstone scale 
developed by Larsen (1970); it consists of 20 items 
selected on the basis of usual Thurstone scale cri- 
terion of low Q values and covering the 11-step 
range. The Thurstone scale related inversely to 
worldly-mindedness and severity of parental aggres- 
sion training. (b) Attributed Power scale: In the 
Attributed Power index, the subjects were asked to 
rate five power relationships (e.g., policeman-citizen) 
on a series of dichotomous power-laden adjectives. 
The scale shows high internal consistency and relates 
positively to dogmatism, chauvinistic nationalism, 
low self-esteem, and competitive behavior in non- 
zero-sum games (Larsen & Minton, 1971). On the 
basis of these relationships, it was thought that at- 
tributed power would relate to the willingness to 
administer intense shock. (c) Buss Hostility Scale: 
Buss and Durkee (1957) devised an inventory for 
assessing different kinds of hostility. Three subscales 
(Assault, Indirect Hostility, and Irritability), com- 
prising 30 out of a total inventory of 74 items, were 
used in this study, (d) Parental Aggression Training 
(PAT) Scale: The PAT scale consists of a group of 
26 questions measuring perception of parental ag- 
gression training and was developed by Larsen and 
Schwendiman (1970). The first 13 items reflected 
mother aggression training, and the other 13 items 
reflected father aggression training. The PAT scale 
has five response alternatives: No Action, Mild 
Reprimand, Harsh Words, Mild Spanking, and Severe 
Punishment, The test-retest reliability is .89. The 
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considered relevant because severity 
related in several 


PAT scale was 
of aggression training has been 
studies to aggressive behavior (Larsen & Schwendi- 
man, 1970). (e) Machiavellianism: The content 
of the Mach TV scale used in this study dealt with 
the cynical manipulation of people and consisted of 
20 items (Christie & Geis, 1969). The Machiavel- 
lianism scale was considered relevant because shock- 
ing a person intensely implies @ cynical disregard for 
his welfare. 

Statistical analysis. Pearson product-moment cor- 
relations were calculated between the various pet 
sonality measures to determine any jnterrelationships 
and between the personality measures and (a) total 
voltage administered, (b) total time shocked, (c) 
maximum voltage and (d) the subjective estimate 
of the pain í inistered to the subject. The number 
of subjects whi filled out both the personality mea- 
sures and participated in the experiment were, re- 
spectively, control (n= 58), conformity (n= 13), 
model ("= 53)- In the female learner and high model 
conditions, the number of subjects who overlapped 
jn both the personality administration and experi- 
warrant the calculation of 
correlation coefficients. In addition, a set of t tests 
were carried out between the four experimental con- 
ditions and the control condition, for males and 
females separately, analyzing for differences in total 

voltage administered, total time shocked, maximum 
voltage (actual), and maximum voltage (guess). 


RESULTS AND DiscussioN 


Table 1 shows the relationships between the 
personality assessments, total voltage admin- 
istered, total time shocked, maximum voltage 
(actual) , and maximum voltage (guess) for 
the control, conformity, and model conditions. 


8 The intercorrelation between the several person- 
ality assessments employed in the study indicated 
that with exception of the relationship between per- 
ceived mother and father aggression training, none 
of the correlations were large. Reliable relationships 
were found between aggression and Machiavellian- 
ism, that is, those subjects who scored low on the 
aggression scale (and therefore tend to be more 
aggressive) tendency toward high 
Machiavellianism. High attributed power also re- 
Jated to severe punitiveness by mothers and to high 
Machiavellianism. High hostility related to low per- 
ceived mother and father aggression training and 
low Machiavellianism. In short, the expected rela- 
tionships were found between the aggression and 
Machiavellianism scales, between attributed power, 
and mother aggression training and Machiavellian- 
ism, respectively. The correlation between the Buss 
Hostility scale, mother aggression training, father 
aggression training, and Machiavellianism were, how- 
ever, in the opposite direction of that expected. It is 
clear that the measures are relatively independent 
and reflect evidently different constructs and di- 
versified measurement procedures. 
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The overall conclusion is that there is little 
relationship between these assessments and 
the behavior of shocking a subject. One reason 
may be that these indexes were not relevant 
to behavior in the experiment. Tn other words, 
the personality assessments may overlap only 
minimally with the somewhat contrived labo 
ratory behavior. Tt could be argued that these 
situations would induce conformity behavior, 
and that shocking a subject is more an act 
of conformity than an act of aggression. if 
this is correct, then a set of measures assess- 
ing conformity might have produced larger 
and more reliable correlations between assess- 
ments and behavior. 

The smallness of correlations between 
paper-and-pencil tests and behavior is no't 
something new (Kagan, 1961; Lipman, 
1959). Part of the reason may be found in 
the specificity of the situation that the assess- 
ments were not designed to measure. An addi- 
tional reason may be a lack of understanding 
of the type of pehavior that is being mea- 
sured in the paper-and-pencil session and in 
the feld or laboratory. A third reason for the 
relatively small size of correlations has been 
suggested by Rokeach.* He suggested that the 
discrepancy between measured attitudes and 
actual behavior may be attributed to the 
fact that in most experiments only attitudes 
toward the attitudinal object are measured. 
By also measuring attitudes toward the vari- 
ous elements in the situation, he has demon- 
strated considerable increases in the amount 
of total variance accounted for. In the current 
experiment it would perhaps have been wise 
to measure attitudes toward various situa- 
tional elements (€8:; the experimenter, the 
equipment, type of instructions, etc-) in 
addition to the broader attitudinal objects 
measured by the paper-and-pencil tests. 


Differences between Control and Experimental 
Settings 


Table 2 shows the means and ¢ values for the 
control and experimental settings analyzing 
for differences in the dependent measures. All 
of the mean values were higher in the model 
when compared to the control condition for 
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4M. Rokeach. Invited lecture, Oregon State Uni- 
versity, August 1970. 
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TABLE 1 


PEARSON PRropuct-MOMENT CORRELATIONS BETWEEN PERSONALITY VARIABLES AND TOTAL 
VOrACe ADMINISTERED, TOTAL Time SHOCKED, Maximum VOLTAGE, AND 
SUBJECTIVE ESTIMATION OF PAIN 


4 Total voltage Total ti Maxi Sub E 
Variable PERCET aocked voltage pain aim instore 
Control 
Aggression .268 —.03 22 .18 
Attributed Power 14 =15 Seth 04 
Hostility 21 10 .03 08 
Mother Aggression Training .20 2A .21 18 
Father Aggression Training 03 rol E] —.09 
Machiavellianism —.03 —.01 04 05 
Conformity 
Aggression 09 —.04 15 18 
Attributed Power — 44 09 AL 38 
Hostility 36 —.10 Sy —.06 
Mother Aggression Training 07 —.09 61» -60 
Father Aggression Training 43 SOL —25 —.19 
Machiavellianism 33 25 .06 07 
Model 
Aggression —.02 -10 —.08 —.03 
Attributed Power 05 —.26 —.06 —.08 
Hostility 03 —.09 .06 03 
Mother Aggression Training —.04 —.04 —.09 —.06 
Father Aggression Training .05 .08 02 .00 
Machiavellianism .00 16 .23 «28° 


three of the dependent measures (total volt- 
age administered, total time shocked, and 
maximum voltage [actual], and the direction 


TABLE 2 


of the difference was highly reliable. The 
result is quite consistent with the literature on 
modeling, imitation, and aggression (Mussen, 


VALUES BETWEEN CONTROL AND OTHER FOUR EXPERIMENTAL CONDITIONS ANALYZING FOR 
DIFFERENCES IN AVERAGE VOLTAGE ADMINISTERED PER CONDITION, TOTAL TIME 
SHOCKED, Maximum VOLTAGE (ACTUAL), AND MAXIMUM VOLTAGE (Guess) 


Variable 


Average voltage administered 
per condition 


t 
Total time shocked 
x: 


te gee voltage (actual) 


t 
eae voltage (guess) 


t 


Control Model Conformity Female High model 
(n = 78) (n: = 79) (n = 20) (n = 27) (n = 9) 
132 178 236 130 173 
—4,94* —1.71** 21 = LO0** 
30.6 54.6 35.7 25.5 24.9 
—6.40* —1.03 1.74** .89 
199 244 308 203 239 
—3.64* —4,28* —.20 —1.37 
157 172 293 205 237 
—.79 5.39" —2,33** —2.74* 


* p < .005 or better, 
**  < 05 or better. 
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PERIMENTAL CONDITIONS WHEN SEPARATED ynto MALE 


SO en 
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AVERAGE VOLTAGE 
TAL ‘TIME SHOCKED, MAXIMUM 


ADMINISTERED PER CONDITION, 
VOLTAGE (ActUAL), AND Maximum VOLTAGE (GUESS) 
Variable Control Model Conformity Female High model 
(n = 31) (n = 28) (n = 12) (n = (n = 9) 
Males 
Average voltage administered 
_ per condition 
124 179 276 135 173 
t ose = 1,52 — AS —2.00** 
Total time shocked 
xX 28.7 63.5 32.7 23.7 24.9 
t ANTT: —.56 1.20 4 
Maximum voltage (actual) 
187 267 295 222 239 
t sos LSe —1.27 -1. 
Maximum voltage (guess) 
156 200 278 219 237 
t —1.42 dolar —2,24"" =2.51** 
Females 
Variable Conte CASEY Conform y EST H igh BA el 
WE 
ETA 
Average voltage administered PEA T a 
_ per condition x Us 
135 8 171 127 ac 173 
1 Mee | —1.53 se —1.73"* 
Total time shocked = 
x 31.9 49.6 40.1 264° 24.9 
t —4,39"* —1.27 1.41 1.06 
Maximum voltage (actual) 
208 231 328 194 239 
t ARoS E 57 —1.06 
Maximum voltage (guess) 
x 157 156 317 199 238 
t 05 —3.34""" —1.46 SESS 
*p <.07 or better. 
+k p < .05 or better. 
ek p < 005 or better. 
Conger, & Kagan, 1969), The experimental ity is involved in much institutionalized vio- 
situation fulfilled at least two conditions under Jence and accounts, according to Pettigrew 
which modeling can occur (Mussen et al, (1961 ), for the largest proportion of hostile 


1969). These two conditions are lack of pun- 
ishment for aggression and the possibility of 
displacing aggression under socially justifiable 
conditions. 

In the conformity condition, the mean 
values were also in the predicted direction, 
with three of the relationships being reliable 
(total voltage administered, maximum voltage 
[actual], and maximum voltage [guess] ). In 
other words, the subjects conformed to the 
pressure of the two confederates and followed 
them up the shock-intensity scale. Conform- 


prejudicial behavior toward black people. Our 
findings lend support to the importance of 
conformity in the willingness to administer 
pain in the laboratory setting. 

We had expected that subjects would score 
lower on the various dependent measures in 
the female learner condition. On total voltage 
administered and maximum voltage (actual), 
no reliable differences occurred, whereas the 
total time shocked was less in the female 
Jearner condition, and guessed maximum volt- 
age was greater. The subjects shocked the 
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“a female learner for shorter times than they 


shocked the male learner in the control condi- 
tion which may, in part, be attributed to the 
assumed vulnerability of the target. It is per- 
haps an assumed commonsense notion that 
females have a lower threshold for pain, and, 
therefore, the subjects adjusted the shocking 
time accordingly, At the same time, shocking 
a vulnerable target might produce guilt and 
therefore produce a significant overestimation 
of the maximum voltage administered when 
compared to the control group, which under- 
estimated when asked’ to recall its maximum 
voltage. Both of these notions are supported 
by the data. 

In the high model condition, the subjects 
scored higher on two of the predicted assess- 
ments (total voltage administered and maxi- 
mum voltage [guess] ), whereas the other two 
showed no reliable differences. The reason for 
this finding may partially be found in the 
“suggestion” that the previous “teacher” had 
` shocked at 350 volts or beyond. In other 
words, the subject perhaps felt that it was 
acceptable and expected to shock at these 
intense levels and felt socially justified in 
so doing. 


Sex Differences and Shock Administration 


Means and ¢ values for the various condi- 
tions and dependent measures when analyzed 
separately for males and females are shown 
in Table 3. Of greatest interest was the fact 
that the pattern of behavior in the various 
conditions was very similar for the two sexes. 
Both male and female subjects tended to score 
higher on total voltage administered, total 
time shocked, and maximum voltage (actual), 
while showing no reliable differences on maxi- 
mum voltage (guess), Similar results were 
found for the conformity condition where 
both male and female subjects scored higher 
on maximum voltage (actual) and maximum 
voltage (guess) when compared to controls, 
but they showed no differences on total volt- 
age administered or total time shocked. The 
female learner and the high model conditions 
also showed similar patterns in the direction 
of mean values and the reliability of signifi- 
cance tests. In short, the results suggest that 
few if any differences occur hetween the sexes 
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TABLE 4 
t VALUES BETWEEN MALE AND FEMALE SAMPLES FOR 
CONTROL AND EXPERIMENTAL SETTINGS ANALYZING 
FOR DIFFERENCES IN TOTAL VOLTAGE Ap- 
MINISTERED PER CONDITION, TOTAL 
‘Time SHOCKED, MAXIMUM 
VOLTAGE (ACTUAL), AND 
MAXIMUM VOLTAGE 


(Gurss) 
Variable Control | Model meee Female 
ormity 

Total voltage 
administered —.69 | .02 1,06 “Al 
Total time shocked —.75 | 1.94* | —.86 | —.74 
Maximum voltage 
(actual) —1.13 | 1.95* | —.62 .89 
Maximum voltage 
(guess) —.02 | 146 | —.79 63 


* p <.05 or better. 


in administering shock, contrary to the find- 
ings of Buss (1961). This is clearly brought 
out in Table 4. 

As may be observed, only two values were 
significant. Males tended to score higher on 
total time shocked and maximum voltage 
(actual) administered in the model condition. 


Guilt in Administering Shock? 


In the experimental conditions, there was 
nothing to indicate to the subjects that any 
type of record was kept of their behavior, 
Nevertheless, some may have felt guilt about 
administering severe shock. One way to handle 
such guilt would be to underestimate the 
maximum voltage administered. Looking at 
Tables 3 and 4, it may be noted that the 
maximum-voltage guessed values are less than 
the maximum-voltage actual values across all 
conditions except for females, who estimated 
a slightly greater value for the female learner 
condition. One difference is reliable (excluding 
control). For both males (¢ = 2.31, p < .025, 
m=57) and females (¢= 3.75, p< .005, 
n = 96), the subjects tended to underestimate 
the amount of voltage administered in the 
model condition. Since, with the exception 
noted, all of the mean values are underesti- 
mates, one could postulate a guilt-reduction 
mechanism. Note that this underestimation 
occurs even with clear reference to the shock 
equipment dials. 

Some verbal comments and behavioral mani- 
festations during the course of the experiment: 
Comments were picked up before, during, and 


erat 


294 


after the experiment. These comments were 
generally written down verbatim immediately 
after the subject left. Some tenseness was 
shown by nearly all of the subjects, but no 
obvious overt disruptive behavior was mani- 
fested. That the subjects felt that the situation 
was real was, however, readily indicated by 
their comments and statements on the debrief- 
ing form. Most of the comments indicated the 
conflict between “obedience” and the sub- 
jects’ moral convictions which Milgram de- 
scribed in his studies. It would seem from 
the observations made of the subjects that 
most of them felt uncomfortable when faced 
with the task, and most of the subjects felt 
a conflict between proceeding with a distaste- 
ful task and fulfilling a responsibility to the 
experiment. This conflict was resolved in vari- 
ous ways not unlike the resolution of other 
types of conflict and included verbalization 
of the distaste, rationalization, and verbalized 
indifference. It is significant to note that 
despite all protestations, all of the subjects 
completed the experiment and often at the 
maximum voltage possible. 

Expectations of fellow university students. 
Following Milgram, we carried out an addi- 
tional experiment in which we described the 
control condition to 80 undergraduate stu- 
dents (40 males and 40 females) selected 
haphazardly from liberal arts (n = 40) and 
science (n = 40) majors, After describing the 
experiment, we asked: “What is the maximum 
voltage you would administer to the learner?” 
For the entire group, the mean was 113; 
for’ females, 80; for males, 147; for liberal 
„arts, 114; and for sciences, 116. In observing 

< the mean values for maximum actual voltage 
in Table 4, the aforementioned mean values 
were considerable underestimates of the actual 
maximum voltage across all conditions. This 
suggested that the situation created pressure 
which induced the subjects to shock more 
severely than they would have outside the 
situation. All persons like to maintain “good” 
self-images, and shocking another person even 
in the framework of a teacher—learner experi- 
ment may not appear socially desirable, When 
an individual is actually in the situation, with 
no ready escape, the individual is under pres- 
sure from situational factors (e.g., even in 
the control situation, certain expectations of 
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permissible maximum-voltage administration 
might be elicited by the simple fact that the 
machine is labeled up to 390 volts). Again, 
what people say they might do and what they 
actually do might be highly discrepant be- 
cause of the situational factors that change 
the individuals’ behavior. 


Debriefing Comments 


After the debriefing, the subjects were asked 
to report their reasons for participating in the 
experiment, by checking as many as appropri- 
ate among eight statements, plus an open- 
ended question. (a) Sixty responses were 
made to the statement “Because the experi- 
menter asked me to.” (b) Two subjects re- 
sponded “It seemed like the thing to do.” 
(c) Nineteen subjects said “Because I thought 
it might be interesting.” (d) Three subjects 
responded “I was afraid to say no.” (e) Fifty- 
six subjects responded “As a contribution to 
scientific investigation.” (f) Ten subjects said 
“I enjoyed being a teacher.” (g) Forty sub- 
jects responded “Because the learner was 
being paid.” (%) Twenty- e subjects said 
“J felt that the experimenter knew what he 
was doing.” SE 

The open-ended question generally pro- 
duced what appeared to be rationalization of 
various types. Both conformity (Items ¢@ and 
h) and rationalization of conflict (Items e 
and g) were the types of items frequently 
checked. These responses would suggest, as 
did the verbal comments during the experi- 
ment, that the subjects participated in a 
distasteful act, but they rationalized thei 
guilt and conformed anyway. 


Some Concluding Remarks 


This brings us to the problem of the rela- 


tive importance of personality traits and ` 


situational structures in affecting behavior. 
Assuming the relevance of our measures, and 
considering the verbal comments of our sub- 
jects, it seems clear that the situational struc- 
ture is the all-important variable in predict- 
ing behavior, and that persons in fact often 
act opposite to their predisposition to act 
when faced with situational pressures. The 
sobering conclusion is that our homes and 
society evidently produce young people who 
are reluctant but nevertheless willing to act 


Be 
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against their own conscience, The al] impor- 
tant question is: How can we aid in strength- 
ening an individual’s social Conscience so he 


vent social situations 
Produce brutal, antisocial behavior? 
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FIRING A WEAPON AND AGGRESSION * 


ARNOLD BUSS,? ANN BOOKER, anD EDITH BUSS 


University of Texas 


Five experiments explored the relationship of guns to aggression in the labora- 


tory. The first four concerned the effect of target 


shooting on subsequent 


aggression (administration of electric shock). There was no consistent effect. 
The fifth experiment attempted to replicate a previously found effect of the 
presence of weapons on aggression. This time, there was no effect. Thus, there 
is no evidence that the presence, firing, or long-term use of guns enhances sub- 


sequent aggression. 


A series of violent events during the past 
few years has sensitized this country to the 
use and misuse of weapons. The assassina- 
tions of the Kennedys and of Martin Luther 
King, the occurrence of mass murders by dis- 
turbed individuals, the flurry of ghetto rioting, 
and the shooting of college students at Kent 
State and Jackson State have left searing 
memories behind. The effect of possessing and 
firing weapons, as in hunting and target shoot- 
ing, has been debated in newspapers and 
magazines, and on radio and television. In 
spite of the bright glare of publicity and con- 
gressional action on the licensing of guns, 
there has been little laboratory research on 
the effect of weapons and aggression. 

There are two issues here. First, it is obvi- 
ous that a nearby weapon is more likely to be 
used than one not immediately at hand, and 
we need no research to document this point. 
Thus, the availability of weapons, although of 
considerable social importance, is trivial psy- 
chologically: availability of weapons clearly 
enhances the probability that aggression will 
occur. 

The second issue is important psychologi- 
cally: Does the presence or use of weapons 
enhance the general tendency to aggress? The 
first part of this question, presence of wea- 
pons, has received a tentatively affirmative 
answer. The mere presence of a rifle and re- 
volver is sufficient to intensify aggression 
(Berkowitz & LePage, 1967). The second 
part of the question has not been answered. 


1This research was partly supported by a grant 
from the Graduate School, University of Texas. 

2 Requests for reprints should be sent to Arnold 
H. Buss, Department of Psychology, 211 Mezes 
Hall, University of Texas, Austin, Texas 78712. 


There have been opinions from experts, sur- 
mises from knowledgeable persons, and state- 
ments made in books, magazines, newspapers, 
radio, and television—but there are no ex- 
perimental data. The present research was 
directed to this point: Does firing a weapon 
enhance nonweapon aggression? 


EXPERIMENT I 
Method 


Measurement of aggression. The apparatus, an 
“aggression machine” described in pre 
tions (Buss, 1961, 1963, 1966a, 1960b) wil be de- 
scribed very briefly, The subject is told, with an ap- 
propriate rationale, that he is to be the experimenter 
in a concept learning task. The learner in this bogus 
task is an experimental accomplice, who is shocked 
every time he makes an error, He makes 35 errors in 
80 trials, that is, gives the subject 35 opportunities to 
deliver shock, There are 10 shock buttons, ranging 
from nonpainful (1) through painful (2 and 3) and 
very painful (4 and 5) to extremely painful (above 
5). A shock from Button 1 would be sufficient to 
signal an error. Shock from higher buttons adds 
aversiveness and is therefore aggressive in the same 
sense as is the pain administered by a sadistic dentist 
or a vengeful parent. 

Firing a weapon. The weapon was a small air- 
powered rifle which fired small, harmless pellets. 
Subjects fired this “BB gun” five times at a card- 
board target placed 20 feet away. 

Procedure. The subjects were told that the re- 
search involved several kinds of learning, both indi- 
vidual and group. To bolster this cover story, they 
were given an additional 1-minute task. On a nar- 
row board were three pegs, one with nine disks on 
it; these varied in diameter, with the smallest on 
top. The goal was to move the disks to one of the 
other two pegs. Only the top disk could be moved 
each time, and it could be placed only on a larger 
disk or on an empty peg. The task, which quickly 
becomes complex, has been found to have consid- 
erable interest for students. 

There were two groups of subjects. The sequence 
for the control subjects was (a) peg task and (b) 
aggression machine; they did not fire the gun. For 
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half of the experimental subjects, the sequence was 
shooting, (c) aggression 
machine; for the other half, it Was (a) target shoot- 


left. 

* Subjects. The subjects were 22 men, recruited 
from an introductory Psychology class at Rutgers 
University, Two subjects, one experimental and one 
Control, were dropped from the experiment for fail- 
ure to follow instructions, leaving an n of 10: per 
group. The accomplice was always a man, and four 
Tesearch assistants were rotated in this role. 


Results 


The dependent variable was the mean in- 
tensity of shock delivered over 35 shock trials. 
The two 


aggres- 
sion machine, (b) peg task, (c) aggression machine. 


In both groups, there was a slight tendency 
to give higher shock to the second victim 
(accomplice), a trend that replicated a previ- 
ous study (Buss, 1966b). Beyond this, there 
were no discernible trends in the data, Analy- 
sis of difference Scores (shock to the second 
victim minus shock to the first victim) re- 
vealed a Statistically honsignificant difference 
between the experimental and the control 
group (F = 04), 


297 


These findings replicated those of the first 
experiment, Clearly, the unrestricted hypothe- 


Weapon might affect only those with a history 
of weapons use or only those who had never 


any effect on subsequent aggression. The 
third experiment examined this Possibility. 


EXPERIMENT HI 
Method 


This time the male subjects were drawn from an 
introductory psychology 
Texas. The class was given a questionnaire which 


gun-use subjects fired the pistol (experimental), and 
half did not (control), No-gun-use subjects 
divided in the same way. This was a 2 X2 design 
(gun use versus no gun use and experimental versus 
control) with an x of 10 subjects per cell. 


TABLE 1 
ANALYSIS oF VARIANCE: Exprrimenr [IT 
Variable df MS E 
S 
Gun users versus nonusers (A) 1| 41.20] 1.8 
Experimental versus control (B) 1| 6.54 e 
XB 1| 39.91 | 1.7 
Error 36 | 22.91 
First versus second victim (C) 1 | 12.45 | 18,6** 
AK 1 -08 pi! 
BXC 1| 4.16} 6.2% 
AXBXC 36| .67 
Trials (D) 6 | 23.67 
AXD 6} 1.07 
BXD O53 
AXBXD CA 
Error 216 108, 
CXD 6 78 
AXCXD 6 23 
BXCxD 6 | 33.58 
AXBXCXxD 6 18 
Error 216 AS 
* > <.05, oe 
aa EN 
Hp < 001, 
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Fic. 1. Effect of gun users firing a pistol. 


Results be a constant of research with this instru- 


The shock-intensity data were grouped into ment, and it need not be discussed further. 
seven blocks of five trials each and plotted in The five other significant Fs are discussed in 
Figures 1 (gun users) and 2 (nonusers) For sequence: (a) There was a main effect of first 
the users, the subjects who fired the pistol versus second victim (F = 18.6, p< 01). The 
gave more intense shock to the second victim; second victim was shocked higher than the 
control subjects did not. The same pattern first, an effect due mainly to thewperformance 
held for nonusers: firing the weapon led to of the experimental subjects. ) There y 
more aggression. First versus Second Victim X Experimental 

The significance of these trends was evalue > Versus Control interaction (F = 62,25 05). 
ated by an analysis of variance, which is pre- This confirmed the point just made: experi- 
sented in Table 1. The trials effect was highly mental subjects shocked the second victim 
significant; the upward drift of shock inten- higher than the first, but control subjects did 
sity has occurred in virtually every study with Hotei lei pieres wassa User versus Nonuser 
the aggression machine. Thus, it appears to x Trials interaction (F = 3.2, p< 01). The 
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Fic. 2. Effect of nonusers firing a pistol, 
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Fic. 3, Replication of Experiment III, experimental condition only. 


users tended to increase shock over trials for 
both victims more than did the nonusers. (d) 
There was a User versus Nonuser X Experi- 
mental versus Control x Trials interaction 
(F = 6.5, p < .001). For gun users, the in- 
crease in shock over trials was greater for the 
second victim than for the first, but for non- 
users, the increase over trials was greater for 
the first victim than for the second. (e) There 
was an Experimental versus Control X First 


- versus Second Victim X Trials interaction (F 


= 70.0, p< .001). With respect to the dif- 
ference in shock over trials from the first to 
the second victim, the experimental subjects 
showed a pattern different from that of the 
controls. 

Three of these five significant effects in- 
volved an interaction between experimental 
versus control and other variables, which 
called for separate analysis of the data of 
these two groups. The analysis of variance of 
the control data yielded no significant effects 
(other than for trials). The analysis of vari- 
ance of the experimental data yielded three 
significant effects. First, the users shocked 
higher than the nonusers (F = 6.3, p < .03), 
Second, the second victim was shocked higher 
than the first (F = 8.6, p < .01). Third, the 
nonusers increased shock over trials more 
than the users (F = 3.0, p < .01). 

Among these complex findings, two facts 
stand out. First, firing the pistol did enhance 
ageression: the second victim received higher 
shock than the first only when the pistol was 


fired. Second, familiarity with guns was an 
important variable, with nonusers surprisingly 
increasing shock over trials more than did 
users, 

A puzzling feature was the performance of 
the nonuser experimental group (see Figure 
2). These subjects sharply increased the in- 
tensity of shock to the first victim, in con- 
trast to the mild increase of the nonuser con- 
trol group. This difference is difficult to under- 
stand because the experimental and control 
groups were treated identically until after the 
first victim was run; subsequently, the ex- 
perimental subjects fired the pistol. In the 
absence of any explanation for the divergence 
between two groups treated alike, the possibil- 
ity of sampling error was considered. The is- 
sue is important in that the performance of 
the nonuser experimental subjects contributed 
strongly to most of the significant effects. The 
only way to resolve the question was to repli- 
cate the experimental conditions. 


EXPERIMENT IV 
Method 


A new sample of 20 male students was drawn, 10 
users and 10 nonusers. They were run in the ex- 
perimental condition, firing the pistol after running 
the first of two victims on the aggression machine. 


Results 


The data are presented in Figure 3. This 
time, shock intensity given to the first victim 
was more in line with that of previous experi- 
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TABLE 2 
MEAN NUMBER or SHOCKS GIVEN 
IN Eac CONDITION 
Berkowitz & ZA n 
LePage (1967) Present study 
Condition 


No. shocks received 


1 ii 1 7 
No weapons 3.07 4,67 2.06 | 5.67 
Weapons present | 2.60 6.07 1.53 4.80 


ments. The effect of firing the pistol was negli- 
gible: there was a slight decrease from the 
first to the second victim by the users. An 
analysis of variance confirmed these trends: 
there were no significant Fs. 

In Experiment III, firing a pistol enhanced 
aggression. In the present experiment, there 
was a slight and nonsignificant trend in the 
opposite direction. The most straightforward 
interpretation was that the findings of Experi- 
ment ITI were the result of sampling error and 
that firing a weapon did not enhance aggres- 
sion. 

This conclusion suggested a reexamination 
of the finding that the mere presence of a 
weapon increases aggression. The Berkowitz 
and LePage (1967) experiment needed to be 
replicated. 


EXPERIMENT V 
Method 


The replication was exact, with one exception. In 
the experimental group, a shotgun and pistol lying 
on the table were associated with the confederate. 
Berkowitz and LePage told their subjects that the 
weapons belonged to the confederate, a student who 
had left them there while conducting another ex- 
periment. This instruction resulted in a large propor- 
tion of subjects being eliminated from the experi- 
ment as being too suspicious. As Berkowitz and 
LePage (1967) noted, 


This information evidently was the major source 
of suspicion; some of the subjects doubted that a 
student running an experiment would be used as 
a subject in another study, even if he were only an 
undergraduate [p. 204]. 


Accordingly, the cover story was changed. Subjects 
were told that the weapons were lying there be- 
cause the confederate was going to lend them to a 
friend who was in another experiment. This story 
served to associate the weapons with the confederate 
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without arousing excessive suspicion (4 subjects out f° 


of 34 were suspicious). 

The experiment was the same in all other respects. 
Subjects were told that their ideas would first be 
criticized by the confederate, using electric shock. 
One shock signified a very good rating; 10 shocks, a 
very bad one. Half of the subjects received 1 shock, 
and half received 7 shocks. Then it was the subject’s 
turn to evaluate the confederate’s ideas, and the 
number of shocks he administered comprised the 
dependent variable. 

Thus, there were two independent variables: as- 
sociated weapons versus no weapons and one shock 
received versus seven shocks received. Fifteen men 
were run in each of these four groups, and another 9 
were eliminated from the experiment; suspicion, 4; 
reporting an incorrect number of shocks, 3; and 
equipment malfunction, 2. 


Results 


The results of our experiment and those of 
Berkowitz and LePage (1967) are presented 
in Table 2. They found a significant weapons 
effect for subjects who received seven shocks, 
the presence of guns increasing the number 


of shocks. In the present experiment the “ss 


presence of weapons significantly decreased 
the number of shocks (F 25.7, p < .05). 

In the face of these opposed findings, an- 
other replication was attempted, The shock 
level had been set at approximately that used 
by Berkowitz and LePage (1967), but it may 
have been too high, Therefore, shock intensity 
was reduced to the level of barely painful. 
Only the condition in which the subject re- 
ceived seven shocks was repeated, again with 
a weapons and a no-weapons group. This time 
the presence of weapons had no effect at all, 
the two means being identical (5.50), Thus, 
in three experiments (Berkowitz & LePage, 
1967; and the two present replications), the 
presence of weapons increased or decreased 
shock frequency or had no effect. Such data 
do not generate confidence in any effect of 
weapons on aggression. 


Discussion 


It may be helpful to review the findings. 
Experiment I showed that firing a weapon had 
no effect on subsequent aggression, and Ex- 
periment II confirmed this, using an own- 
control design and a different weapon. Experi- 
ment III compared gun users with nonusers 
and found (a) differences in initial aggres- 


sion and that (b) firing a Weapon’ does en- 
hance subsequent aggression, Experiment IV 
repeated the experimental condition only 


sults and no results, respectively. The discus- 
sion will focus on two issues: 


Methodological Issues 


It might be argued that a human-shaped 
target might be more appropriate than the 
traditional bull’s-eye, However, it is presum- 
ably the weapon itself (or firing it) that is 
important, not the particular target. When 
gun users practice shooting, they use a bull’s- 
eye target, which Suggests that this is the 
Most appropriate target if one wishes to 
generalize findings, 

It might be contended that shooting at a 
target might not enhance aggression, whereas 
hunting and killing animals does. There is no 
direct answer to this Speculation, and a labo- 
ratory test of it seems unlikely, Nevertheless, 
there is indirect evidence. If hunting makes 
a long history of 
chronically higher 


difference. These facts Suggest that hunting, 
target shooting, and a fondness for guns do 
not enhance aggression, 

A devil’s advocate might also protest that 
the measures were not “valid,” but the aggres- 
sion machine and its variants have been 
widely used. A sampling includes aggression 
in relation to frustration (Buss, 1963, 1966a; 
Geen & Berkowitz, 1967; repression (Buss & 
alcohol (Bennett, Buss, & 
Carpenter, 1969); authoritarianism (Epstein, 
1965): and dissonance (Brock & Buss, 1962). 
Berkowitz’s Procedure has also been used ef- 
fectively in many different contexts (see 
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Berkowitz, 1964, 1969), Thus, “validity” is 
not an issue here, 
The last issue of 


Population do tend to: vary, and such vari- 
ability may yield unreliable data, One way of 
dealing with the problem is to use an own- 
control design, 
ability, but the best solution is to repeat 
experiments, 


and LePage (1967) 
fact, the three successive experiments, theirs 
and the two present ones, yielded just the 
kind of data found in statistics texts to 
illustrate a population mean of zero: weapons 
increase aggression, 
gression, and no difference between Weapons 
and nô weapons. 


tional aggression, and 
toward inhibition of aggression with weapons 
present.® 


Broader Implications 


The present negative findings leave other 
facts about aggression untouched. That vio- 


1968; 
and the arousing effects of film vio- 
lence are well known ( Berkowitz, 1964). But 
Berkowitz (1968) has 


when the researchers tried to 


with the confederate the way 


cover story, which confirms our own speculations 
(sce Experiment V). 


that: “Guns not only permit violence, they 
can stimulate it as well. The finger pulls the 
trigger, but the trigger may also be pulling 
the finger [p. 22].” This conclusion appears 
to be based on only a single laboratory study 
“(Berkowitz & LePage, 1967), which we and 
another group of researchers have failed to 
replicate. 

In light of the apparent violence that is 
so much discussed today, researchers are 
likely to overemphasize the potency of mass 
media and weapons as causes of violence. 
Fashions change, and what is popular today 
as the cause of violence may tomorrow fade 
into oblivion. Such fickleness urges caution in 
extrapolating from isolated experiments and 
controlled laboratory research to the complex 
and confounded world of everyday life. 

A final word of caution: none of this dis- 
cussion touches on the availability issue. If 
guns are handy, they are likely to be used—a 
fact that suggests restricting the availability 
of weapons. But the effect of weapons on 
subsequent aggression is a separate issue. The 
present data suggest that neither the long- 
term use nor the transient firing of a weapon 
enhances subsequent aggression. 
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EFFECTS OF ATTITUDE AND EGO INVOLVEMENT 
ON THE LEARNING AND RETENTION OF 
CONTROVERSIAL MATERIAL 


SALI S. SMITH anp BRUCE D, JAMIESON 1 
University of Canterbury, Christchurch, New Zealand 


An experiment was designed to investigate the effects of attitude and ego 
involvement on selective 


Until recently, the evidence supporting the 
occurrence of selective learning and retention 
of controversial material was quite consistent, 
Seven studies (Alper & Korchin, 1952; 
Edwards, 1941: Jones & Aneshansel, 1956; 
Jones & Kohler, 1958; Levine & Murphy, 
1943; Taft, 1954; Zimmerman & Bauer, 
1956) demonstrated the operation of selective 
learning to some degree, and in the process 
the original attitudinal hypothesis suggested 
by Levine and Murphy was extended. Jones 
and Aneshansel showed that the expected 
superior learning and retention of attitudinally 
congenial material could be reversed when 
subjects anticipated that they would subse- 
quently make use of opposing statements in a 
debate. Jones and Kohler found that subjects 
` learned plausible statements that were con- 
sistent with their attitudes significantly better 
than they learned implausible statements sup- 
porting their attitudes and plausible state- 
ments opposing them. Only Williams (1950), 
using a different experimental Paradigm, was 
unable to find evidence of selective learning. 

The results of more recent studies (Brigham 
& Cook, 1969; Fitzgerald & Ausubel, 1963; 
~ Greenwald & Sakumura, 1967; Waly & Cook, 
1966), however, cast serious doubts on the 


1 Copies of the experimental stimuli, the procedure, 
the instructions to the subjects, the scales used for 
sample determination, the methods of construction, 
and requests for reprints should be sent to Bruce 
D. Jamieson, Department of Psychology, University 
of Canterbury, Christchurch 1, New Zealand. 
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existence of attitudinal mediation in the learn- 
ing and retention of controversial material, 
Although the Levine-Murphy paradigm has 
generally been followed, investigators have 
made use of a number of issues and have 
varied stimulus materials, experimental pro- 
cedures, and Scoring systems. This increases 
the difficulty of identifying possible factors 
that can explain why earlier studies suc- 
ceeded in demonstrating selective learning, 
while later studies have failed to do so. 

It is possible that the existence of selective 
learning phenomena is related to the issue 
underlying the controversial material used and 
that this is a relationship subject to temporal 
variation. The tendency to use particularly 
topical social or political issues (Jones & 
Kohler, 1958; Levine & Murphy, 1943; Waly 
& Cook, 1966; Zimmerman & Bauer, 1956) 
decreases the likelihood of replication due to 
changes in modal attitudinal patterns between 
subject groups measured in succeeding studies. 
Such an interpretation appears to have been 
favored by Waly and Cook in their failure to 
replicate the results of Jones and Kohler. It 
would therefore seem desirable to use an issue 
less subject to topical influences, but one 
that results in wide attitudinal variance among 
different subjects. It is obviously not possible 
to eliminate temporal variability in issues, and 
so the distinction is a relative one. 

Earlier investigators (Alper & Korchin, 
1952; Dutta & Kanungo, 1967; Kanungo & 
Dutta, 1966; Taft, 1954) have suggested the 
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possible importance of an attitude-related 
variable, that is, ego involvement, in this 
context. However, this variable has never 
been experimentally manipulated nor used as 
a basis for subject selection. Possibly, pre- 
vious studies, while varying attitude between 
experimental groups, have not controlled for 
the effects of the degree of ego involvement 
in the issue. It is suggested that attitude 
intensity and ego involvement are distinct 
constructs, and although they may be cor- 
related (Riland, 1959; Weksel & Hennes, 
1965), the relationship will not be strong. 
Certainly the effects on learning and retention 
of ego involvement in the particular issue 
should be investigated. 

One major inadequacy of the traditional 
approach is the difficulty of separating the 
effects of selective learning and selective re- 
tention, While most investigators have been 
able to obtain an unambiguous measure of 
learning, some have presented measures of 
retention that are contaminated by the learn- 
ing levels. It seems that selective retention, 
per se, can only be said to have occurred if 
the measure of retention is independent of 
the levels of learning. If selective learning 
has occurred earlier and those parts of the 
stimulus material that were learned first have 
been practiced, while learning of the other 
parts of the stimulus material continues, then 
the most parsimonious explanation of any 
selective retention is in terms of the varying 
degrees of Jearning of the stimulus material. 
Williams (1950) also recognized this prob- 
lem, but his failure to incorporate any method 
for determining the success of experimental 
induction of needs makes his results ambigu- 
ous. A more controlled procedure would be 
to have all of the subjects learn the stimulus 
material to a specific criterion. Use of the 
adjusted learning method (Underwood, 1954) 
would also assist in maintaining an approxi- 
mately equal degree of learning among items 
in the stimulus material. Using this method, 
the experimenter presents a set of stimuli to 
the subject. As each stimulus is learned to a 
given criterion, it is dropped from the set, 
and only those stimuli that have not been 
learned to the criterion are presented on the 
succeeding learning trial. 
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Most studies of selective learning have used 15 


either free recall (Brigham & Cook, 1969; 
Greenwald & Sakumura, 1967; Jones & 
Aneshansel, 1956; Jones & Kohler, 1958; 
Levine & Murphy, 1943; Taft, 1954; Waly 
& Cook, 1966; Watson & Hartmann, 1939; 
Zimmerman & Bauer, 1956) or a recognition 
test (Brigham & Cook, 1969; Clark, 1940; 
Edwards, 1941; Fitzgerald & Ausubel, 1963) 
to measure retention. With the exception of 
Williams (1950), researchers have neglected 
the relearning method of measuring retention. 
Relearning can yield a savings score, which is 
claimed to be a more sensitive measure of 
retention than recall (Mednick, 1964), and 
therefore seems potentially suited to a situa- 
tion in which forgetting is known or expected 
to be great. The use of a control learning 
period in which subjects can become accus- 
tomed to the experimental method but in 
which different experimental material is used 
should reduce the problems of “learning how 


to learn” sometimes associated with the use¢¢. 


of relearning as a retention measure. 

The present research was designed to meet 
some of the above criticisms and to test the 
following hypotheses: =+ 

1. Subjects will learn statements that sup- 
port their own attitudes more easily than 
statements that conflict with these attitudes. 


2. High ego-involved subjects will learn the . 
statements more easily than low ego-involved ” 


subjects. 
3. Subjects will retain more content from 


statements that support their attitudes than | 
from statements that conflict with these atti- fy, 


tudes. 

4. High ego-involved subjects will retain 
more content from statements than low ego- 
involved subjects. 


METHOD 


A 3X2X2X2 factorial design (Attitude X In- 
volvement X Sessions X Statement Types) with re- 


peated observation on sessions and statements was? 7 


used to test the hypotheses. Three experimental ses- 
sions were carried out, but the data from the first, 
the control learning session, were analyzed sepa- 
rately. 


Issue 


The controversial issue chosen was belief in the 
existence of God. This issue was chosen because it 
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seemed less sensitive to short-term changes as a func- 
tion of topicality and because it appears, intuitively, 
to possess fewer dimensions than some of the issues 
used in previous studies. 


Stimulus Material 


The experimental material used in the study con- 
sisted of six 15-18 word statements, three of which 
were favorable to belief in the existence of God and 
three of which were opposed to this belief, Examples 
of each are: 

Prostatement 

Belief in God is necessary for man to understand 
the purpose and meaning of his existence, 

Antistatement 

Only those people who are too dependent to face 
life unaided need to believe in God. 


Subjects 3 


The sample used in the experiment was 54 male 
and female subjects recruited from 265 first-year and 
58 second-year psychology students enrolled at the 
University of Canterbury. The sample was deter- 

-mined by responses to a selection questionnaire ad- 
ministered several weeks prior to the experiment. 

The selection questionnaire used in the determina- 
tion of sample groups comprised three scales and a 
self-report of religious behavior. _ 

1. Form A: Chave and Thurstone Attitude toward 
God: Reality of God scale (Shaw & Wright, 1967). 
The responses to this scale were scored using the 
values reported by Ayad and Farnsworth (1953). 
Scores of 6.6 or higher were classified pro-God; 
scores of 4.5-6.6, neutral; and a score of 4.4 or less 
was taken to indicate an anti-God attitude, 

2. Involvement scale. This scale consisted of six 
ordered statements ranging from “My whole life is 
dedicated to teaching others about the truth of my 
convictions” to “This issue does not concern me.” 
The statements were selected from 17 items indicating 
differing degrees of involvement, which were ranked 
from highest to lowest involvment by 20 post- 
graduate psychology and chemistry students. The 
subset of six items retained in the scale had mean 
rankings ranging from 1,0 to 16.15, and all had 
standard deviations of less than 2.60, Each subject 
was asked to check which of the six items was closest 
to his own behavior with respect to the issue. 
Respondents to the first three statements were clas- 
sified high involvement, and the remainder were 
classified low involvement. 

3. Attitude Involvement scale. This scale of 14 
items was developed by the authors in an attempt to 
Place subjects in Attitude X Involvement cells and to 
check on responses to the two former scales. The 
14 items in the scale were those categorized with 
high levels of agreement by 50 undergraduate and 
postgraduate psychology and chemistry students and 
20 members of the public visiting the university 
from an original scale of 27 items. Each of the six 
Attitude X Involvement combinations and an ag- 
nostic viewpoint were represented by 2 items, the 
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14 items being presented in random order. The sub- 
jects in the experimental population were asked to. 
rank order the three statements closest to their 
position on the issue. 

4. Self-report data. Questions were addressed to 
the subjects concerning age, sex, years at the uni- 
versity, religious upbringing, and current religious 
behavior. 

On the basis of these responses, the subjects were 
categorized into eight groups: proattitudes, anti- 
attitudes, and neutral attitudes toward the existence 
of God, each at high and low levels of ego involve- 
ment, and two additional groups—namely, agnostics 
and inconsistent respondents. Inconsistent respond- 
ents were made up of 48 subjects whose self-report 
data and whose scores on the three scales did not 
indicate a consistent Attitude X Involvement cate- 
gorization, This group, together with the 41 sub- 
jects who classified themselves as agnostic, did not 
take part in the experiment. It was considered that 
inclusion of agnostics would introduce a further di- 
mension to the issue, while inconsistent respondents 
were omitted because of the difficulties of making an 
unambiguous categorization, 

Six experimental groups of nine subjects each were 
chosen randomly from within each Attitude X In- 
volvement pool of respondents who met the above 
criteria. The 27 male and 27 female subjects each 
received credit toward a course requirement of at- 
tendance as experimental subjects. In addition, they 
received $1.50 for the third session, which was held 
close to end-of-course examinations. Ages ranged 
from 17 to 30 years, with a mean of 20.02 years. 


Procedure 


Control learning session. Each subject was required 
to rate six statements on “television and the young 
child” in regard to intensity of affect, familiarity, 
agreement, novelty, and plausibility. The subject was 
then told that he would participate in two related 
experiments. He was told that he would first read 
through six statements, each on a separate card, and 
that he would then move on to the second experi- 
ment, a digit-symbol test that was being used in a 
departmental study of warm-up and practice effects 
on intelligence test tasks. The subject was then told 
that after 15 seconds on the digit-symbol task, he 
would be asked to write down as many of the six 
statements as he could recall and then return to the 
digit-symbol task for a further 90 seconds. The digit- 
symbol task was in fact introduced as an interpo- 
lated task to separate learning trials and to prevent 
rehearsal during the scoring of the previous trial 
responses. 

The six statements were presented to the subject in 
a predetermined random order that varied between 
trials. Any statement that was recalled correctly on 
two consecutive trials was removed from the set 
presented to the subject on the next trial. When all 
statements had been learned to this criterion, a test 
trial was run in which all six statements were pre- 
sented again and were recalled by the subject. This 


trial was to control for recency effects and to permit 
a measure of short-term retention and decay. 

Experimental learning session. This session was 
conducted at another time convenient for each sub- 
ject, with the interval after the first session averag- 
ing 22 days, Basically the procedure was the same 
as for the control learning session, but the topic this 
time was “belief in the existence of God.” In addi- 
tion, in this session, an immediate recall test was 
taken after the initial rating of statements prior to 
the first learning trial, in order to take some measure 
of incidental learning- 

Relearning session. This session was run 3 weeks 
after each subject’s experimental learning session. The 
subject was instructed to write as many of the six 
statements as he could recall from’ the previous ses- 
sion, As the learning material in the first two sessions 
had been unrelated, subjects should not have antici- 
pated being asked to recall the statements in the 
third session, Retrospective comments from the 
subjects showed that this assumption was justified. 

Following this free recall, the subjects then re- 
learned the six statements, using the same pro- 
cedure followed in the previous sessions, When the 
test trial was completed, the subject was asked to 
rate the six statements again on the five attributes— 
intensity of affect, familiarity, agreement, novelty, 
and plausibility. 

The subject was then paid and thanked for his 
cooperation. The experimental procedure and pur- 
poses of the study were explained in as much de- 
tail as the subject requested, and he or she was re- 
quested not to discuss the investigation with other 


subjects. 


RESULTS 

Measures 

In common with previous studies, the rate 
of learning was measured by the number of 
trials to learn the statements to criterion, 
Content of learning was measured using a 
scoring system similar to Taft’s (1954) and 
Clark’s (1940). Each statement was divided 
into six idea units. A response was scored 
2 if the meaning of an idea unit was correctly 
recalled, 1 if the meaning of the unit was 
altered or partially recalled, and 0 if the unit 
was omitted, In spite of the limited time 
available for scoring following responses on 
any one trial, an interjudge reliability measure 
on recalls of a sample of six subjects yielded 
a correlation coefficient of .80, with the two 


judges agreeing in 90% of the cases. 


Experimental Controls 
The relationship between the two major 
ariables, attitude and ego in- 


independent v: : 
determined by correlating 


volvement, was 
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scores on the Involvement scale with +: 
the Chave and Thurstone Attitude 

God: Reality of God scale, for a rand: 
ple of 50 subjects drawn from the orig: 
subjects who completed the selection 


tionnaire. The obtained correlation | 
although reaching statistical significan = 
2.40, df=48, p < .05), indicates that the 
relationship is not very strong and suggests 
that the two variables are distinct consiructs. 


This result compares with a correlation of .28 
reported by Riland (1959). 

As a check on the attitudinal categoriza- 
tion of subjects and statements, mean agree- 
ment scores for the three groups on a rating 
scale ranging from 1 (I disagree completely) 
to 9 (I agree completely) were compared. 
Mean agreement ratings for the prostatements 
were 6.3 for prosubjects, 4.2 for neutral sub- 
jects, and 2.0 for antisubjects, while for the 
antistatements the mean ratings were 2.5, 4.5, 
and 7.0, respectively. A.3 X 2 analysis of vari- 
ance (Attitudes X Statements) on these data 
yielded a highly significant interaction effect 
(F = 74.15, df = 2.81, p < .001)7 

Further 3 X 2 analyses of variance (Atti- 
tudes X Involvement) revealed that there 
were no differences in the learning abilities of 
the experimental groups as revealed by rate of 
learning and test-trial recall data from the 
control learning session. 

Finally, a check was made to determin 
whether there were any differences in the con- 
tent of learning between groups on ihe test 
trial at the end of the experimental learning 
session, A 2X 3X2 analysis of variance 
(Statements X Attitude X Involvement) was 
used to analyze these data, but no significant 
main effects or interactions occurred. Thus, it 
could be assumed that any subsequent selec- 
tive retention was reasonably uncontaminated * 


by selective learning. 


Learning 

Three measures of learning are relevant io 
Hypotheses 1 and 2—rate of learning and two 
measures of content of learning. Incidental 
learning was measured after the ratings in the 
experimental learning session, and the con- 
ventional fifth-trial content measure was ob- 
tained in the same session. 
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TABLE 1 


Mran NUMBER or TRIALS TO LEARN EACH STATEMENT IN EXPERIMENTAL 
LEARNING AND RELEARNING SESSIONS 


Proattitude Neutral attitude | Antiattitude 


Involvement of subjects 


Session High | Low High 


Low | High l Low 


Statement 


Pro Anti Pro Anti Pro Anti Pro Anti Pro Anli Pro Anti 


Experimental learning 6.22 | 4.78 | 6.96 7.07 | 7.63 | 4.67 7.26 | 4.85 | 6.15 5.48 | 7.48 | 6.96 
_ Relearning 3.52 | 3.04 | 4.56 | 5.04 5.22 | 3.41 | 3.81 | 3.81 5.59 | 3.93 | 5.26 | 4.93 


Table 1 presents the mean number of trials sis of variance of the incidental learning data 
to learn each Statement in the experimental that appear in Table 3 also yielded a signifi- 
learning and relearning sessions, A four-way cant Statements x Attitude interaction (Bi 
analysis of variance (Attitudes x Involve- 9.18, df= 2/48, p< 01). While the group 
ment x Sessions x Statements), with repeated means in Table 3 are in the expected direc- 
measures on sessions and statements, was con- tion, the important group differences are not 

“ducted on these data and is summarized in significant. Prosubjects learned more content 
Table 2, from prostatements than they did from anti- 

The F ratios for two main effects were sig- statements (t = 1.69, df = 17, ns), and anti- 
nificant. The subjects took fewer trials to subjects learned more content from antistate- 
learn each of the three antistatements (X= ments than prostatements (¢ = 40, dj = 17, 
4.83) than they did to learn each of the three 


Prostatements (X = 5.81). As anticipated, TABLE 2 
ession. i C highly signi 
the pois ain efect es igh M significant SUMMARY TABLE OF RATE oF LEARNING 
with a mean of 6.29 trials per statement in ANALYSIS oF VARIANCE 
the experimental learning session, and a mean 
of 4.34 trials per statement in the relearning Source PASA aus F 
session, Four interactions yielded significant F Bergen dublects 5176.038 53 RAE 
: . At ie e a 
ratios: Statements x Attitudes Statements x Involvement (B) 230.463/ 1! 236.463 | 2.53 
A AXB 214.037) 2| 137.018 | 1.46 
Involvement, Statements Xx Sessions, and Subjects within : it ; 
Statements X Sessions x Attitudes. poins POSALE SASN 93.850 
The Statements X Attitude interaction that hiss ir ho fea 00:162 1849.186 | 65,70 
tests Hypothesis 1 and that has relevance RAS Byers 29620 | 308 
to the Levine-Murphy (1943) hypothesis. . Seales Subjects | 28815]. 2] 11907] “793 
yielded a significant F ratio (F = 481 df= Within Groups 1350.945 | 48 | 28,144 
2 .05). r, some of the differ-  Statem. nts (D) 462.297 |. 1 |. 462.207 | 24,56% 
/48, Ż < 05). Howeve ne g “ AXD E 181.147 | 2| 90.573 4.81% 
ences among pooled means were not in the Bp 136.9631 1} 136.963] 7.27% 
A 2 + AXBXD 6.371 2 3185} 17 f 
` directions predicted by Hypothesis 1, Hy- D X Subjects With- 
a 5 A in Groups 903.722] 48 18.827 i 
Pothesis 1 would predict that prosubjects en à : 
A 56.017 1 56.017 07% 
should learn prostatements more quickly than Axcxp 90.700] 2) as:353| Soi 
A ° * 11.575 1 1.5 8 
antistatements, and that antisubjects would AXBXCxD 22.258) 3 ie ok 
learn @ntistatements more quickly than pro- c XD X Subjects ‘ 
y z ithin G 
statements. The data support this comparison nin Groups REIR 248| 13.760 
only for antisubjects, but at a nonsignificant 


level (¢ = 1.38, df = 17, p > .10). An analy- if $95, 
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TABLE 3 


MEAN CONTENT SCORES PER STATEMENT FOR 
INCENTAL LEARNING IN THE EXPERI- 
MENTAL LEARNING SESSION 


Proattitude |Neutral attitude| Antiattitude 
Ty a 
hype oe Involvement of subjects 
| 
High | Low | High | Low High | Low 
Pro 
M 6.67 | 4.44 | 3.18 | 4.30 | 5.81 4.04 
Anti 
M 4.74 | 3.81 | 7.15 | 6.67 | 6.52 4.56 


ns). A considerable amount of the variance in 
the significant interaction from the analysis of 
variance is contributed by the neutral group. 
Finally, when data from the content of learn- 
ing at the end of the fifth trial were analyzed, 
the Statements X Attitude interaction was not 
significant (F = 2.71, df= 2/48), although 
the differences in means were again in the ex- 
pected direction, As a result of these three 
measures, Hypothesis 1 is rejected. 

Although Table 1 shows that high ego-in- 
volved subjects took fewer trials to learn 
statements than did low ego-involved subjects 
(the two means being 5.01 and 5.70, respec- 
tively), the analysis of variance in Table 2 
showed no significant main effect for involve- 
ment (F = 2.52, df= 1/48). However, the 
incidental learning data in Table 3, when ana- 
lyzed, did yield a significant main effect for 
involvement, with high ego-involved subjects 
exhibiting more incidental learning than low 
ego-involved subjects (F = 4.36, df = 1/48, $ 
< 05). Similarly, an analysis of the data 
from the content of learning following the 
fifth trial indicated that high ego-involved 


TABLE 4 


Mean RECALL SCORES PER STATEMENT FOR DELAYED 
RECALL IN THE RELEARNING SESSION 


Proattitude |Neutrat attitude] Antiattitude 

eee Involvement of subjects 
High | Low | High | Low High | Low 
Po | 8a7 | 389 | 5.19 | 407 | 5:59 | 252 
Anti 6.56 | 5.07 6.04 | 6.78 | 6.59 3.89 
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subjects did learn more than low ego-involved Px. 
subjects (F = 5.38, dj = 1/48, p< 05). y 
Taken together, these three measures provide 
partial support for Hypothesis 2. 


Retention 


Mean recall scores for the delayed recall 
measure at the beginning of the relearning 
session are presented in Table 4. When these "711 
data were analyzed, the Statements X Atti- 
tude interaction was not significant (F = 1 
13, df = 2/48) in this case, nor was it when , 
the relearning data expressed as savings scores 
in Table 5 were analyzed. Thus, there was no i 
support for a Levine and Murphy (1943) re- v2 
tention effect, and Hypothesis 3 was rejected. “ 

When the effects of ego involvement on 
retention were examined, it was found that 
high ego-involved subjects recalled more than 
low ego-involved subjects (F=7.96, df= 
1/48, p < .01) but that there were no differ- ; 
ences between these involvement groups on ; 
the relearning measure. These results, then 
only give partial support for Hypothesis 4. 


Rating Data 


Correlation coefficients were used to investi-. 
gate whether or not ease of learning was res 
lated to some property of the stimulus ma- 
terial. The number of trials f learn each f 
statement and the rank order of learning were 
correlated with ratings of familiarity, novelty, 
intensity of affect, and agreement. The corre- 
lations between these ratings and the two „, 
measures of learning were small (r= .05- 
—.08). 


TABLE 5 
MEAN PERCENTAGE SAVINGS SCORES 
FOR EXPERIMENTAL GROUPS De 
Proattitude |Neutral attitude| Antiattitude 
PaO Involvement of subjects # 
High | Low | High | Low | High | Low 
Pro 39.92 | 25.11 | 29.11 | 41.26 | 10.07 | 25.24 
Anti | 34.27 | 28.30 | 12.40 | 16.89 | 23.21 | 22.09 
Note.—Score = 


No. of trials experimental learning session 
— No. of trials relearning session 


No. of trials experimental learning session x 


100. 


E DISCUSSION 


The results of this study offer no support 
_ for attitudinal selective learning or retention. 
None of the predictions based on the Levine- 
Murphy hypothesis was Supported by data 
from measures of rate of learning, content of 
learning, or retention. While the Statements 
X Attitude interaction from the rate of learn- 
ing data was significant, the differences in 
pooled means were not in the direction pre- 
dicted by Hypothesis 1. Indeed, the signifi- 
cant interaction appears to arise from the 
neutral subjects showing larger pro- and anti- 
statement differences than those for either of 
the other two groups. Why this difference has 
occurred is not clear, and the authors can 
: offer no convincing explanation of the result. 
The major outcome of the study is the 
emergence of ego involvement in an issue and 
` its influence on selective learning, even when 
attitude, per se, failed to have any effect. The 
ego-involvement variable may offer some ex- 
planation to account for varying results among 
previous studies. It appears likely that in 
= ` categorizing subjects for or against a given 
issue, investigators may well have failed to 
control for the involvement variable, particu- 
larly if it were unequally represented in the 
pro- and antigroups. Furthermore, it seems 
,, teasonable to suggest that temporal variables 
~ will influence the mean levels of involvement 
for two attitudinal groups on an issue. Anti- 
Segregation groups used by Brigham and Cook 
(1969) may well have been more involved 
than those used by Jones and Kohler (1958), 
as this issue undoubtedly became increasingly 
significant over the intervening years. The 
results suggest that the conceptualization of 
ego involvement adopted by the writers has 
¿ some heuristic merit and that it could well be 
worth investigating in other related areas of 
attitude research. 
The failure to demonstrate any involvement 
effect on savings scores as a measure of reten- 
tion is interesting. It does illustrate the prob- 
lem of deciding on which alternative retention 
measures should be used in a study: It is 
possible that savings scores picked up resid- 
ual learning in the low involvement group, 
which would have remained undetected by the 
more demanding recall measure. Further, the 
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reported differential sensitivity of the two 
measures (Mednick, 1964) appears to be 
based on research using nonsense syllables 
(Luh, 1922), and this makes generalization to 
complex verbal stimuli necessarily tentative, 
As Osgood (1953) suggests, one must con- 
clude that a complex pattern of behavior is 
involved, only part of which is tapped by any 
single method of measuring retention. 

Correlational analyses of rating data with 
two measures of learning lent no support to 
previously suggested explanations of selec- 
tive learning effects—novelty (Greenwald & 
Sakumura, 1967), familiarity (Waly & Cook, 
1966), and intensity of affect (Kanungo & 
Dutta, 1966). All correlations were low and 
indeed in no case were they even consistently 
positive or negative across statement cate- 
gories. 

The results of this study suggest that the 
original purposes of the investigation were 
largely met. The procedure enabled learning 
and retention effects to be separated. The 
problem of contradictory results emerging 
from the use of different measures of reten- 
tion was demonstrated. In addition, the results 
demonstrated that attitudinally related varia- 
bles, such as ego involvement, are of signifi- 
cance in this context. 

Once again the Levine-Murphy hypothesis 
has remained unconfirmed, but selective learn- 
ing and retention as a function of ego involve- 
ment have been shown to occur, in this case, 
using what amounted to a restricted-groups 
design. Subsequent researchers may wish to 
employ a design that permits cell size to vary 
with true differences in these variables within 
a population. Such studies could well examine 
under what conditions, if any, attitudinal 
factors lead to selective learning phenomena, 
when ego involvement is the critical factor, 
and under what conditions the variables inter- 
act. 
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SOME EFFECTS OF ANXIETY LEVEL AND FOOD CUES 
ON THE EATING BEHAVIOR OF OBESE AND N ORMAL 
SUBJECTS: 

A COMPARISON OF THE SCHACHTERIAN AND PSYCHOSOMATIC 


CONCEPTIONS + 
RALPH J. McKENNA 2 
University of Connecticut 


Two theoretical conceptions of obesity were examined experimentally, Food 
preloaded subjects, believing themselves participants in a market research study 


revealed a significant interaction between weight status and anxiety level; 
Overweight subjects ate more under high than low anxiety, whereas normal 
weight subjects ate less under high- than low-anxiety conditions. This finding 


Several recent studies by Schachter and his 
associates (Goldman, Jaffa, & Schachter, 
1968; Nisbett, 1968; Nisbett & Kanouse, 
1969; Schachter, 1967, 1971; Schachter, Gold- 
man, & Gordon, 1968; Schachter & Gross, 
1968) have attempted to assess hypothesized 
differences in the relative strength of external 
and internal cues as determinants of eating be- 
havior in obese and normal weight individuals. 
Specifically, Schachter Suggests that over- 
weight persons pay particular attention to 
external eating cues (appearance of food, 
“y taste, habit, social stimuli) and tend to ignore 
internal eating cues (gastric contractions, 
blood sugar level). The eating behavior of 
normals is thought to reflect a more balanced 
consideration of these two sets of stimuli, 

Similarly, Cabanac and Duclaux (1970) 
have shown that the taste sensation of a 
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Sucrose solution changes from pleasant to 
unpleasant for fasting normal weight subjects 
following a stomach loading of aqueous glu- 
cose, but remains pleasant for overweight 
individuals subject to the same manipulations. 
The authors interpret this result in terms of 
decreased sensitivity to internal food intake 
signals among the obese. 

A distinctly different formulation, the psy- 
chosomatic hypothesis, views nonnutritive eat- 
ing as a learned coping response associated 
with anxiety reduction, usually established in 
childhood (Kaplan & Kaplan, 1957). In later 
life, when personal and social difficulties asso- 
ciated with excess weight gain lead to in- 
creasing anxiety, the Overweight individual is 
thought to resort to the behavioral coping 
response that has been established in the 
past—eating. Unfortunately, this mode of 
anxiety reduction leads into a cycle of greater 
obesity, more anxiety, and continued over- 
eating. 

Following the theoretical lines of both posi- 
tions, one arrives at instances of mutually 
exclusive implications. For example, given an 
obese person in a state of high anxiety, the 
psychosomatic hypothesis would predict eating 
if food—any food—were made available; 
Schachter’s (1967) formulation would pre- 
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dict eating only if external food cues were 
positive—that is, food that was appetizing in 
appearance, smell, or taste. Similarly, given 
low anxiety and positive food cues, the obese 
should be likely to eat according to Schachter 
and likely not to eat (if not hungry) accord- 
ing to the psychosomatic hypothesis. 

Schachter et al, (1968) attempted to assess 

the effects of fear and food satiation on the 
eating behavior of obese and normal weight 
individuals. Both food preloading and high 
fear greatly decreased the number of snack 
crackers eaten by normals; however, the same 
manipulations had little effect on the eating 
behavior of obese subjects. Since the obese 
did not eat less when preloaded nor when 
presumably experiencing the ordinarily appe- 
tite-inhibiting physiological consequences of 
fear (i.e., reduced gastric motility, hyper- 
glycemia; Cannon, 1929), the authors’ inter- 
pretation was that the obese tend to ignore 
relevant internal cues that control eating in 
normals. Nonetheless, the absence of a sta- 
tistically significant effect, per se, is not usu- 
ally considered strong support for an experi- 
mental hypothesis, 

Anxiety measures in the Schachter et al. 

(1968) study showed a slight decrease for 
both normal and obese subjects following food 
consumption, which the authors attributed to 
eating. However, the fact that there were no 
differences of consequence between obese and 
normal subjects does not refute the logic of the 
psychosomatic approach. What may be indi- 
cated is that (a) some aspect of eating does 
have fear-reducing qualities for all individuals 
(although food consumption may ordinarily 
be a chosen mode of fear reduction only for 
the obese), or (b) anxiety levels tend to drop 
with the passage of time following a fear ma- 
nipulation (no noneating controls were in- 
cluded in this study). 

The essential factor that distinguishes the 
two formulations is their relative emphasis on 
internal instigation (psychosomatic) and ex- 
ternal instigation (Schachterian) to eating. 
Schachter et al. (1968) manipulated two op- 
erationally distinguishable states of internal 
instigation (hunger and anxiety). What is 
proposed is a comparison of the effects of an 
external instigator with such an internal insti- 
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gator as they employed. In th sent 
tigation, the external instigatic: satir 
manipulated by varying the 2; 

taste of the test food (cookies 

ternal instigation was varied! 

either high or low anxiety in tlo 
obese and normal weight subie 
jects were systematically food p 
to testing in order to minimize 
variance in subsequent eating b 


Experimental Hypotheses 


Normal weight subjects shoul: 
eat very little in all conditions, 
stomach feedback should sugges! 
hungry. No differential theoretic:! 
are made regarding the eating bc! 
loaded normals, their primary {inct 
to provide a base line for assessi'.: `i 
of anxiety and food-cue valence on eating 
patterns of the obese. 

The difference between the anı 
food eaten by overweight and n 
subjects should depend on wh 
cues (Schachterian) or internal c 
somatic) exert .primary contro 
among the obese. The Schachteriai formula- 
tion would predict differences in food con- 
sumption to be greater in the attractive than 
the unattractive test food condition. This 
Weight X Food-Cue Valence interaction pre- 
diction is based on the idea that the eating 
behavior of the obese is controlled pee 
by the apparent quality of the ay bie 
The psychosomatic formulation yw: 
a greater consumption difference in the high- 
than in the low-anxiety condition. This 
Weight X Anxiety interaction expectation is 
predicated on the idea that eating among the 
obese is controlled by certain anxiety-related 
internal states in addition to actual hunger, 

Finally, the inclusion of noneating control 
subjects permits a test of the psycliosomatic- 
related hypothesis that eating is effective as a 
means of reducing anxiety. The data relevant 
to this hypothesis are intended to link the 
function of eating among the obese more di- 
rectly to anxiety reduction than has been 
characteristic of earlier research by permitting 
the separation of anxiety reduction due to the 
passage of time from anxiety reduction di- 
rectly attributable to eating. 
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METHOD 
Overview 


Subjects. were led to believe that they were par- 
ticipating in marketing research being conducted in 
conjunction with a national research firm. During 
the initial stage of the experiment, the subjects were 
preloaded with a variety of foods and then sub- 
jected. to an anxiety manipulation. Experimental 
subjects neg tasted and rated a preweighed sample 
of the test food, either attractive (homemade choco- 
late chip) or unattractive (greenish-gray shortbread) 
cookies. Control subjects rated the cookies on ap- 
pearance, but they were not permitted to taste them. 
The major dependent variables were (a) amount 
eaten during a 15-minute rating session and (b) 
subject self-reports of anxiety level. 


_ Experimental Design and Subjects 


Three independent variables were manipulated in 
a2X2x2+2 incomplete factorial design (weight, 
food-cue valence, and anxiety level), The subjects 
(male introductory psychology students) were classi- 
fied as obese or normal on the basis of statistical 
norms published by the Metropolitan Life Insurance 
Company (1959), utilizing the customary obesity 
criterion—a deviation of at least 15% above the 
normal weight for a given height. Athletes partici- 
pating in body-contact sports were excluded, as these 
individuals tend to be overweight due to heavy bone 
structure and muscle development, rather than the 
accumulation of excess adipose tissue that typifies 
obesity. Two levels of external food cues were pro- 
vided, a high-valence condition, in which the test 
food was both tasty and appetizing in appearance, 
and a low-valence condition, in which the food was 
bland and visually unappealing. Two levels of anxi- 
ety were created: one in which an attempt was made 
to deliberately maximize anxiety and a second level 
in which a minimum of anxiety was sought. Half 
of the experimental subjects rated their anxiety 
level prior to eating the test food, and the other half 
rated their anxiety level following test food con- 
sumption. Finally, two high-anxiety groups, one 
obese and one normal, provided ratings of the food 
products but were not permitted to eat, thus allow- 
ing assessment of the effect of mere Passage of time 
on changes in expressed anxiety, 


Procedure 


Subjects were selected for participation on the 
basis of relative obesity, as determined from self- 
reports of height and weight obtained earlier in the 
semester. Subjects were scheduled in groups of four, 
two obese and two normals, to distribute experi- 
menter-bias effects (Rosenthal, 1963) and demand- 
characteristic effects (Orne, 1962) that might occur 
across both weight conditions, Upon arrival, the 
subjects were seated behind a long table, partitioned 
in such a manner as to prohibit communication. 

Introductory remarks. The experimenter explained 
that the experiment would deal with psychological as- 
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task would be to rate a food which was to be test 
marketed in about 6 months. 

Compensating for individual taste sensitivity is 
accomplished, the experimenter explained, by having 
subjects taste and rate four standardized foods 
for sweetness, saltiness, sourness, and bitterness. 
Comparison of subject ratings with “ideal ratings,” 
he continued, provides an adjustment factor, allow- 
ing for direct comparison between subjects with 
different taste sensitivities. 

Preloading. In accord with this rationale, the ex- 
perimenter commenced with “taste sensitization” 
trials. After rating initial hunger, each subject con- 
sumed and rated four foods: two caramels (sweet- 
ness), a cup of hot bouillon soup (saltiness), two 
dried apricots (sourness), and approximately an 
ounce of quinine water (bitterness), The “caramels” 
were Ayds weight-reducing candies, which, when 
consumed with a hot liquid (eg., bouillon soup), 
produce an increase in blood Sugar level and sup- 
posedly reduce feelings of hunger markedly in ap- 
proximately 10-15 minutes (Campana Corporation, 
undated), 

The experimenter next explained the necessity of 
simulating real-world Phenomena in laboratory ex- 
perimentation. Noting that the product to be rated 
Was a cookie, typically consumed following a meal, he 
announced that the subjects would therefore be given 
the laboratory equivalent of a meal. Producing fresh 
roast beef sandwiches, the experimenter gave two to 
each subject, stating “Don’t be bashful, we've got 
plenty of them. Eat as much as you want; eat ’til 
you’re full.” He then distributed a biographical ques- 
tionnaire, which the subjects completed as they ate, 
The experimenter removed himself to a desk chair sit- 
uated in such a position that he was not visible to 
any subject whenever the subjects were consuming 
food, to minimize any social Pressure that he might 
place on eating behavior. On two occasions during 
the subsequent 12-15 minutes, the experimenter 
strongly encouraged the subjects to have another 
sandwich or half of a sandwich, 

Anxiety manipulation, In the low-anxiety condi- 
tion, the subjects were made to feel as comfortable 
and relaxed as possible. The experimenter noted that 
the experiment had only recently begun, was still in 
the “pilot phase,” 
would be helpful “in a very general sort of way,” 


sa 


picking them up, he tossed them into a wastebasket 
and continued with his preparations. He next opened 
four paper packets containing hypodermic needles and 
removed the protective cap from each needle. After 
holding each hypodermic at eye level and testing 
the plunger, he carefully placed them in a phalanx 
arrangement on a sterile white cloth, 

The experimenter then explained that it is also im- 
portant to know how subjects’ bodies react physio- 
logically to the food that they routinely consume. 
To this end, he continued, Dr. Link from the in- 
firmary had agreed to assist in the experiment, and 
would be over later to take “some direct physio- 
logical measurements”—(a) a rather extensive blood 
sample, (b) a urine specimen, and (c) a rectal stool 
sample. 

The experimenter then noted that he was required 
to ask three questions before the experiment could 
continue. The questions were: (a) Any history of 
fainting? (b) Any heart trouble? (c) Any history 
of convulsions? If a subject replied positively, the 
experimenter said; “Well, we'll go on. If things 
get too rough for you, just let me know—okay?” 
Finally, he took a 15-second pulse reading for each 
subject, commenting that “a couple of the readings 
are pretty high,” 

Pre-only anxiety measure. For half of the subjects 
(all subjects in any given session), the experimenter 
noted that many factors can subjectively affect 
ratings of taste, and that the project reaction inven- 
tory is designed to control for some of these factors. 
Embedded in this 10-item checklist were several 
critical scales that functioned both as dependent 
measures and checks on the anxiety and preloading 
manipulations. The subjects in the post-only anxiety 
measure sessions received a filler form at this point 
requesting information on previous participation in 
experiments and favorite foods. 

Food-cue valence manipulation. The experimenter 
noted the importance of controlling for the effect of 
irrelevant cues on the subjects’ taste ratings. The 
subjects were told that this involved elimination of 
the final marketing steps of stamping and design for 
cookies, and packaging the product in wrapperless 
boxes (a rationale for the appearance of homemade 
cookies). 

The high-valence product was a homemade choco- 
late chip cookie, judged extremely appetizing and 
tasty in pilot testing. The low-valence product was 
a greenish-gray Scotch shortbread cookie, in which 
the recipe’s sugar content had been halved. The 
resulting product was bland, practically tasteless, dry, 
and quite crumbly. Twenty cookies in plastic divider 
trays were packed in each box and sealed in 
plastic food-wrap bags, the final appearance closely 
resembling commercial packaging. 

In order to make statistically meaningful compari- 
sons between different products, the experimenter 
explained, it was necessary to standardize the length 
of time allotted for rating. Thus, all subjects would 
be given a full 15 minutes, even though they might 
find it possible to complete their taste ratings in 
less time, 
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At this point, the experimenter distributed a closed 
box of cookies and a form entitled “product rating 
scale” to each subject, refilled the subjects’ water 
cups, and read the following “tasting instructions”: 


You may taste as many or as few of the product 
as you desire in making your ratings. As only 
unopened samples are used in the experiment, the 
sample is contaminated once you have broken the 
seal, and the remaining contents will simply be 
disposed of following the experiment. 

This is the concluding taste trial for today, 
although there are other phases of the experiment 
still to be covered, 


The experimenter then set the timer, told the sub- 
jects to begin tasting and rating, and retired to 
the desk chair. About 8 minutes later, he arose, 
offered water refills, and returned to his seat. 

Control subjects were told that their function was 
to aid in assessing the possible effect of visual cues 
on taste ratings. Therefore, they could see and 
handle the cookies (they were given boxes containing 
samples of both kinds), but they were not allowed 
to taste the products. They were allotted the same 
rating time, but they completed a modified rating 
scale. 

Post-only anxiety measure. Following the test food 
period, the subjects completed the anxiety measure 
or the filler form—whichever the given peur had 
not previously received. 

Physiological measures, suspicion probe, aes 
briefing. The experimenter noted, in the low- -oN 
condition, that the next phase of the experiment 
would involve some simple physiological measure- 
ments. In the high-anxiety condition, the experi- 
menter observed that the fictitious Dr. Link had not 
arrived, but that he (the experimenter) would pro- 
ceed with the physiological measures that he was 
capable of administering. Subjects in all conditions 
were then weighed and measured. 

Following an extensive written and oral suspicion 
probe, the experimenter gave a thorough and essen- 
tially complete explanation of the experiment to 
all subjects. Only direct references to obesity were 
omitted in the debriefing to avoid potential embar- 
rassment to overweight subjects, 


RESULTS 
Subject Demographics 


Subjects ranged from 92.56% overweight to 
11.93% underweight; however, the mean 
weight for normal subjects was quite close to 
zero (+.8%), and the mean weight for obese 
subjects (27.4%) was considerably above the 
15% cutoff point. Cell ns were equalized for 
analysis by retaining the most obese and most 
normal subjects in percentage of weight de- 
viation. Subject demographic data are sum- 
marized more fully in Table 1. 
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TABLE 1 


SUBJECT DEMOGRAPHIC INFORMATION 


Mean height 


Mean weight 


Mean weight deviation Mean ade in years 


in inches in pounds in percentage 
Condition aes 
Obese Normal Obese Normal Obese Normal Obese Normal 
Low anxiety-high valence 70.97 70.48 203.6 166.3 26.35 2.06 20.9 19.7 
Low anziety-low valence 71.60 71.32 200.0 167.6 29.80 3.04 | 21.6 19.6 
High anxiety-high valence | 70.22 70.93 208.3 161.5 31.91 40 19.7 20.9 
High anxiety—low valence 70.21 71.70 205.6 164.6 29.80 19 20.0 19.9 
High anxiety-control 70.33 70.98 189.1 159.6 19.21 —1.61 19.6 19.8 


Independent Variable Manipulations 


A correlated ¢ test of mean hunger before 
preloading (3.44 on a 7-point scale) versus 
mean hunger following preloading (2.04) was 
significant at p< .001. Similarly, analyses 
performed on (a) food-cue valence ratings 
(appetizing appearance, taste excellence, in- 
tent to purchase product) and (b) anxiety 
self-reports (mean combined rating: calm- 


“> anxious, relaxed—nervous, comfortable-uncom- 


fortable) were also significant well beyond 
p< 001. 


Amount Eaten 


Cookie consumption was determined by 
subtracting the weight of the uneaten cookies 
plus box following the experiment from the 
weight of the filled box prior to the session. 
Cell means are presented in Table 2, and the 
analysis of variance of amount eaten is pre- 
sented in Table 3. 

The most important theoretical comparison 
involves the relative strength of the Weight x 
Anxiety versus the Weight x Food-Cue Va- 
lence interaction, as it is here that the 
Schachterian and psychosomatic predictions 
of eating behavior are in clearest contrast. 


TABLE 2 


EFFECTS oF ANXIETY AND CUE VALENCE 
on Foop Consumption 


Low anxiety High anxiety 
Food cue 
valence 
Obese Normal Obese Normal 
High 122 74.8 96.3 45.9 
Low 39.7 35.3 45:4 34.8 


Note.—Analyses of amount eaten are reported in grams. 


The nature of the obtained interactions is 
presented graphically in Figure 1. The 
Weight x Anxiety interaction is significant 
(F = 5.034, p< .05). Cell means indicate 
that the obese ate considerably more under 
high anxiety than low anxiety, as predicted 
by the psychosomatic hypothesis, and that 
this pattern is more pronounced for the high 
valence food. The Weight x Anxiety interac- 
tion also reflects the marked tendency of 
normals to eat less high-valence food under 
high anxiety. 

The Weight x Food-Cue Valence interac- 
tion, representing Schachterian predictions, 
failed to attain significance. Normals actually 
out ate obese subjects in the high-valence— 
low-anxiety condition, contrary to expecta- 
tions based on the external cue conception. 
Furthermore, the obese ate more of the low- 
valence food than did normals, and they 
increased their consumption under high anxi- 
ety, both trends contrary to Schachterian 
theorizing, 


TABLE 3 


ANALYSIS OF VARTANCE 


Source SS df MS F 
Weight (A) 4929 1| 4929 5.663** 
Anxiety (B) 2 <1 
Food-cue 

valence (C) | 22445 1| 22445 2918085 
AXB 4382 1| 4382 5.034** 
AXC 1346 1} 1346 1.546 
BXC 125 1 125 xt 
AXR XC 2737 EE RIY e 3.145* 

Within error 62668 |72 870.4 
Total 98632 |79 
*p <.10. 
ED <05. 
KD <.001. 
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Fic. 1. Obtained interaction curves showing (a) 
the Weight X Anxiety interaction (collapsing val- 
ence) and (b) the Weight X Food-Cue Valence in- 
teraction (collapsing anxiety). s 


Individual comparisons within the Weight 
x Anxiety interaction indicated significant 
contributions by both normal and obese sub- 
jects. The comparison most similar to real-life 
situations, amount of high-valence food con- 
sumed by obese subjects under high versus 
low anxiety, yielded an F of 3.336 
(05 <~<.10). A similar comparison for 
normals is significant at p< .05. Several 
internal analyses (e.g., raising the obesity cri- 
terion, eliminating subjects not responsive 
to the independent variable manipulations) 
served only to reaffirm and strengthen the 
basic finding. 


TABLE 4 
ANXIETY REDUCTION As A Function oF Foop CONSUMPTION 
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Anxiety Reduction 


The anxiety-reduction data, presented in 
Table 4, appear to offer little support for the 
supposition that eating reduces anxiety. No 
trend is apparent in the means for normals, 
and there is only a slight, nonsignificant 
tendency for anxiety to be lower among obese 
subjects following food consumption, Ironi- 
cally, the largest mean anxiety reduction 
occurred among the noneating obese controls 
(t = 1.591, ns). Partial correlation coefficients 
indicated that this failure to obtain a more 
substantial anxiety decrease is not a function * 
of the combination of individual scales into 
a single anxiety index. In retrospect, the 
possibility that this pattern of results may ; 
reflect a less than optimum timing of the 
post-only anxiety measure must be considered. - , 


; 


DISCUSSION 


While the major findings of the present 
investigation appear to provide more support K- 
for the psychosomatic view than recent experi- 
mental research by Schachter and his col- < 
leagues would suggest, certain interpretive dif- * 
ficulties are created by the simplistic nature 
of both conceptions. While every effort was 
made to minimize any internal hunger cues 
by use of an elaborate preloading procedure, 
neither theory specifies the effect of non- 
hunger arousal cues as determinants of eating 
among normal weight individuals, other than 
to suggest that physiological correlates of 
anxiety may also serve to inhibit hunger 
(Schachter et al., 1968). If in fact there are - 
systematic effects of arousal cues on the eating 
behavior of preloaded normals (conceivably 
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Obese Normal 
Condition Z 

n Before n After n Before n After 
Low anxiety-high valence 5 2:27 8 2.83 7 2.91 4 2.04 
Low anxiety-low valence 7 2.57 5 2.13 6 1.89 8 3.27 
High anxiety—high valence | 4 4.33 6 3.78 8 4.21 6 3.94 
High anxiety—low valence 5 3.87 7 3.76 8 4.13 ó 4.42 
High anxiety—control 4 4.25 5 3.33 8 4,00 8 3.42 


a Represents mean combined anxiety index, based on a 7-point scale, of self-ratings on threerelevant dimensions: calm-anxious, 


relaxed-nervous, and comfortable-uncomfortable. 


— 
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different from any comparable effects among 
the obese), differences between amounts eaten 
by overweight and normal weight subjects 
would be less easily attributed to systematic 
differences in anxiety or food-cue responsive- 
ness among the obese. The unexpected pattern 
of eating by normals, and its contribution 
to the Weight x Anxiety interaction, is the 
primary interpretive case in point. 

Given the eating behavior of the base- 
establishing normals, the consumption data of 
the obese subjects do not support a differen- 
tial sensitivity to external eating signals. Both 
weight groups demonstrated a strong, essen- 
tially similar, tendency to eat as a function of 
food-cue valence. Schachterian predictions of 
(a) an obese-normal difference in consump- 
‘on of high-valence food under low anxiety, 

5) no difference in obese consumption as a 
unction of anxiety level, and (c) that, under 
low anxiety, the obese should eat less of the 
low-valence food than normals, were not 
substantiated. 


‘uviety Reduction 


Although anxiety reduction for the obese 
ubjects was in the direction predicted by the 
psychosomatic hypothesis in both high-anxiety 
cells, no significant reduction was demon- 
strated in the before versus after ratings 
( .739, ns). However, there are at least 
two ways in which eating might serve to reduce 
anxiety: (a) by temporarily removing atten- 
tion from the anxiety stimulus by focusing 
attention on the problems of eating, that is, 
observing, biting, chewing the food, or (b) by 
‘roviding a more lasting sense of relaxation 
owing the consumption of food. 

the present experiment, there was a 
frequent time lapse of 12-17 minutes between 
cessation of eating and the postmeasure of 
anxiety (many subjects utilized only the first 
few minutes of the rating session for food 
consumption). Of the two ways outlined in 
which eating might reduce anxiety, the pro- 
cedure in this experiment would be sensitive 
only to the second. Hence, despite the failure 
to observe a measured decrease in anxiety, 
there may actually have been a temporary 
‘ion during the time that the subjects 
were cating, followed by a revivification of 
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anxiety when food consumption ceased and 
thoughts turned again to the impending 
physiological measurements. A test of this 
supposition would necessitate measuring anxi- 
ety levels while subjects were in the process 
of eating, but the data for such a test are 
not available in this experiment, 

It thus seems conceivable that earlier no- 
tions regarding food consumption and anxiety 
reduction may have been oversimplistic in not 
specifying (a) when during the time pre- 
ceding, during, and following the act of eat- 
ing anxiety level changes and (b) what 
specific correlates of food consumption, for 
example, attention distraction via the me- 
chanics of eating or the more lasting satisfac- 
tion of a full stomach, are effective in reducing 
anxiety, 


Eating Patterns and Attitudes 


The average obese subject tended to have 
gone significantly longer without a meal than 
his normal weight counterpart ($ < .02). 
Since obese and normal subjects were run in 
equal numbers in each experimental session, 


this result cannot be an artifact of sched- 


uling. An obvious hypothesis, that the obese 
eat more often between meals, was discounted 
by the questionnaire results which indicated 
that these obese subjects ate nonsignificantly 
less often between meals. 

Overweight subjects reported skipping 
breakfast (p< 05) and lunch (p< .05) 
more often than normals, the former relation- 
ship replicating a Schachter and Gross (1968) 
finding. Both weight groups indicated that 
most of their between-meal eating occurs in 
the evening. Thus, if the obese skip breakfast 
and lunch more often than normals, and if 
they fail to eat more frequently between 
meals, one must logically conclude that (a) 
they tend to consume much more at the 
evening meal, and/or (b) they tend to make 
their evening snacks more substantial and fre- 
quent. This pieced-together pattern of obese 
eating behavior is strongly reminiscent of the 
“night eating syndrome,” characterized by 
morning anorexia and evening hyperphagia, 


reported among obese patients by Stunkard 


(1959). 
If eating is a potential means of anxiety 
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reduction, one would expect this mode of 
behavior to be employed more frequently by 
firstborns, since firstborns have been shown 
to be more susceptible to feelings of anxiety 
than later borns (Schachter, 1959). One 
would therefore logically predict a greater 
percentage of firstborns among the obese than 
among normal weight subjects. A marginally 
significant relationship (x° = 2.748, p < .10) 
was found which seems best interpreted as 
tentative additional support for the anxiety- 
reduction hypothesis. 


Theoretical Implications 


The obese ineised their food consumption 
under high anxiety primarily in response to 
the high-valence food, indicating a sensitivity 
to both anxiety level and external cues (the 
latter effect obscured by the behavior of the 
normal weight subjects). This tendency may 
offer grounds for a consolidation of theoretical 
positions; that is, the obese eat in response to 
anxiety, but they are attentive to external 
cues and will not merely consume anything 

that can be labeled “food.” 

If one were to view anxiety as the sole 
determinant of overeating, it follows that one 

= must also postulate periods of anxiety fre- 
quent enough to maintain obesity. This semi- 
chronic anxiety seems more typical of clinical 
cases than most subjects in this study. The 
husky, good-natured, barrel-chested, bearlike 
individual who frequently qualified as obese 
statistically may not be on the same behay- 
ioral continuum as the prototypical 5 foot 

8 inch, 350-pound institionalized patient de- 

scribed in psychosomatic case studies. 

This raises another theoretical problem for 
the psychosomatic hypothesis. Given the 
assumption that the obese reduce anxiety 
through eating (for argumentative purposes), 
Is this tendency to eat in response to anxiety 
uniform for all levels of obesity? The psycho- 
somatic formulation makes no provisions for 
differences in eating responsivity as a function 
of degree of obesity. 

An extension of Schachterian theorizing, 
however, does make such a provision within 
the framework of the external cue model. 
Nisbett (1968) has proposed that external 
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cue sensitivity is a direct function of degree 
of obesity. 

Thus, in attempting to differentiate the 
relationship between degree of obesity and 
eating behavior, two hypotheses appear plau- 
sible: (æ) tendency to eat in response to 
anxiety increases as a direct function of obe- 
sity level (psychosomatic extension), and (0) 
tendency to eat in response to external food- 
relevant cues increases as a direct function of 
obesity level (Schachterian extension). 

If eating in response to anxiety is a direct 
function of obesity level (psychosomatic ex- 
tension), the amount eaten by obese subjects 
under high anxiety should correlate with the 
deviation from ideal weight 
relevant correlations are p = .59, p < .10, for 
the high-valence food, and p = 29, ns, for the 
low-valence food, supporting the hypotheses 
that as the degres of obesity increases, the 
tendency to eat in response to anxiety also 
increases. 

The extension of the Schac! ate 


yformula- 


tion may be examined by c y weight 
deviation with food consun under low- 


anxiety conditions (elimin: 
anxiety as an eating deter: 
subjects. These correlations < 
the high-valence food, and » 
low-valence food. While both 


the effect of 
vat) for obese 
SO LOL 


correlations are 
nonsignificant, they are directionally suppor- 
tive in that sensitivity to food-cue valence 
seems to increase as a function of degree 
of obesity. 

These correlations are, of course, outy 


suga 
gestive. Nevertheless, a more t model 
combining extensions of the eoretical 
approaches is suggested. A functional state- - 
ment of this model might be: 


Overeating = f 
[obesity level (felt anxiety 
+ food-cue valence) |. 


This model, applicable to all degrees of obe- 
sity, suggests that the greater the deviation 
from normal weight, the greater an indivi") i's 
sensitivity to both food-cue valente 212i anxi- 
ety as determinants of ow This 
formulation does not, of cour: west that 
overeating is the result of chesity (quite 
the opposite is true); rather, it attempts to 


specify the extent to which obese individuals, 
within the wide range from “husky” to 
instutionalized, differ in susceptibility to anxi- 
ety and external eating cues as codeterminants 
of overeating. Although this theoretical merger 
-is quite tentative, it seems more parsimonious, 
in view of the data available, to consider the 
be plausibility of a rapprochement based on 
differences in behavior as a function of the 
degree of obesity. 
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TASTE SENSITIVITY AND SUSCEPTIBILITY TO EXTERNAL 
INFLUENCE IN OBESE AND NORMAL WEIGHT SUBJECTS * 


k JOEL GRINKER, JULES HIRSCH, anv DAVID V. SMITH ° 
Rockefeller University 


m Using a criterion-free method (two-alternative forced choice), 
were found in detectability for sucrose between obese and normal weight sub- 
jects or among obese subjects differing in age at onset of obesity. The decision 
criterion for sucrose (measured by a yes-no procedure) of both obese and 


no differences 


normal weight subjects was uninfluenced by the addition of a tasteless, red 
vegetable dye. However, in another experiment, using a magnitude estimation 
procedure, all subjects rated red suprathreshold solutions of sucrose more sweet 
than clear solutions. There were no differences in the psychophysical magnitude 
functions between the groups. Obese subjects were not more susceptible to the 
biasing influence of color than were normal weight subjects. These findings are 


discussed in relation to current theories of obesity. 


Several lines of experimental investigation 
point to differences in taste preference be- 
tween overweight and normal weight animals 
(Fuller & Cooper, 1967; Fuller & Jacoby, 
1955). The increased taste responsiveness of 
ventromedially lesioned rats has been con- 
sistently reported (Corbit & Stellar, 1964; 
Kennedy, 1950; Teitelbaum, 1955). Hyper- 
phagic animals generally eat proportionately 
more sweet-tasting substances and eat consid- 
erably less when these substances are adul- 
terated with bitter-tasting quinine. Similarly, 
obese humans have been reported to be more 
responsive in eating behavior to taste cues 
than normal weight subjects (Decke, 1971; 
Schachter, 1968, 1971). For example, over- 
weight subjects eat more of an ice cream that 
they rate as good tasting than do normal or 
underweight subjects (Nisbett, 1968). Recent 
studies have restricted the range of external 
cues producing greater responsiveness in obese 
individuals to specific food-related perceptual 
cues (e.g, taste or visibility; Nisbett & 
Storms, in press). It is possible that the dif- 


1 This research was supported in part by National 
Institutes of Health Research Grant HD-AMO03719, 
Grant (RR-102) from the General Clinical Research 
Centers Program of the Division of Research Re- 
sources, and National Institutes of Health and Nu- 
trition Foundation Grant 423. The authors gratefully 
acknowledge the experimental assistance of Gail 
Viseltear, 

2 Requests for reprints should be sent to Joel 
Grinker, Rockefeller University, 66th Street and York 
Avenue, New York, New York 10021, 

3 Now at the University of Wyoming. 
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ferent eating behavior of obese animals may be 
a function of changed taste parameters, such 
as preference or sensitivity, or it may be that 
taste parameters are not different in obese 


animals and that measures of food consump- „y7 


tion reflect the influence of other factors. 
A series of experiments using appropriate 


psychophysical procedures were designed to & 


allow a comparison of both taste sensitivity 
and susceptibility to influence by external 
factors in obese and normal weight subjects. 
Linker, Moore, and Galanter (1964) and 
Moore, Linker, and Purcell (1965) have dem- 
onstrated the effectiveness of the application 
of a signal-detection paradigm to the study of 
taste thresholds. Using this paradigm, it is 


possible to obtain independent estimates of 


sensitivity and decision criterion (see Green £4 
ae 


& Swets, 1966). The response of a subject in 
a classical psychophysical task is determined 
by both his sensory ability and his willingness 
to respond when he is uncertain of the pres- 


ence of a stimulus, The decision criterion can s 


have a large effect on his measured sensory 
threshold. If the subject is very conservative 
in his criterion, the measured threshold will _ 
appear higher, while if the subject is very 
liberal in his response criterion, he will appear 
to be more sensitive. By measuring both the 
probability of the subject’s reporting “yes” to 
the presence of a signal (hit) and his proba- 
bility of saying “yes” to no signal (false 
alarm), it is possible to obtain a measure of 


TASTE AND EXTERNAL INFLUENCE IN OBESE AND NORMAL SUBJECTS 


sensitivity that is free of the influences of 
possible response biases. 

The first two experiments studied the two 
components of the subject’s response to the 
psychophysical procedure, one contingent on 
‘sensory input and neurophysiological response 
and the other contingent on attitudinal and 
motivational factors. The first experiment was 
designed to estimate the taste sensitivity for 
sucrose in obese and normal weight subjects 
using a criterion-free psychophysical pro- 
cedure (two-alternative forced choice). The 
second experiment was designed to measure 
shifts in the decision criteria for sweetness 
between obese and normal weight subjects. 
Experiment II attempted to change the sub- 
ject’s criterion by giving additional informa- 
tion designed to influence the subject’s percep- 
tion of the stimulus. The third experiment also 
assessed susceptibility to differences in stimu- 
lus appearance by comparing the magnitude 
estimates of obese and normal weight subjects 
for the sweetness of several suprathreshold 
concentrations of sucrose. 


GENERAL METHOD 
Subjects 


Thirty-nine obese subjects applying to the Rocke- 
feller University Hospital for weight reduction were 
selected. Subjects were rejected if they had other 
medical ailments, psychiatric symptoms, or were over 
the age of 55. Their weights ranged from 70.11 kilo- 
grams to 201.96 kilograms with a mean of 121.26 kilo- 
grams. Their percentage of overweight ranged from 
36% to 217%, with a mean of 105%, based on norms 
from the Metropolitan Life Insurance Company ta- 
bles (1959). The subjects were divided into four 
groups according to age at onset of obesity. Group 1 
consisted of those who became obese between the 
ages of 1 and 8 years (n = 15). For Group 2, the age 
of onset fell between 9 and 11 years (n=9), for 
Group 3, between 12 and 18 years (n=9). Group 4 
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consisted of those who became obese after the age of 
19 (1=6). Table 1 outlines the salient variables of 
age, sex, and percentage of overweight of all groups. 
A control group consisting of 13 normal weight vol- 
unteers was recruited from the college population in 
the New York metropolitan area. 


Apparatus 


All solutions were made with distilled water and 
maintained at 34 degrees Centigrade in a circulating 
water bath (Braun constant temperature regulator). 
All test solutions were prepared daily from reagent 
grade sucrose by diluting a refrigerated stock solu- 
tion that was prepared weekly, Three milliliters of 
solution were delivered by an automatic syringe 
pipette into 1-ounce medicine cups. New cups were 
used for each trial. 


Procedure 


All obese subjects were hospitalized for 1 week for 
experimental procedures. During this hospitalization, | 
they received a normal diet designed to maintain 
their admission weights. Normal weight subjects 
were tested on eight separate occasions. They were 
paid $20 for their participation at the completion of 
all experiments. All subjects, both obese and normal 
weight, participated in every experiment. 

Subjects were required to sip each test solution, 
hold it in their mouths for 15 seconds, and ex- 
pectorate in the funnel provided. Subjects followed 
each stimulus with a distilled water rinse using this 
same procedure. Subjects were tested in groups of 
three or less in individual cubicles. All stimuli were 
brought to the subjects who remained seated through- 
out the experiment. Subjects were required to re- 
frain from eating or smoking for 1 hour prior to 
testing. 


EXPERIMENT I: SUCROSE SENSITIVITY 
Method 


Solutions. The test solution and the comparison 
stimulus were .175% sucrose (by weight) and dis- 
tilled water, respectively. This sucrose concentration 
was within the range of uncertainty (Linker et al, 
1964). 

Procedure, Subjects received both the sucrose solu- 


TABLE 1 
Sex, AGE, AND WEIGHT (IN KILOGRAMS) PARAMETERS OF ALL GROUPS 
Sex Age Weight % overweight 
Group 

Males | Females | Range ne Range s$ Range a 

Obese 
i(n = 15) 4 11 21-351 33 84.71-201.96 | 124.84 57-217 117 
2 (n =9) 5 4 26-48 36 70.11-163.90 | 120.48 36-149 92 
3 (n = 9) és 6 24-50 38 94.01-148.69 | 122.12 73-167 114 
4 (n = 6) 2 4 29-49 40 83.00-187.30 | 117.60 53-127 82 
Normal weight (N = 13) 9 4 17-27 23 44.9-90.5 70.14 | — 11-20 8 
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TABLE 2 


EXPERIMENT I: SUCROSE SENSITIVITY OF OBESE 
AND NORMAL WEIGHT GROUPS 


Group a’ SEs 

Obese 
1 (n = 15) 1.86 .223 
2 (n = 9) 1.18 -268 
3 (n = 9) 1.30 234 
_ 4 (n = 6) 1.43 309 
X obese subjects (V = 39) 1,51 130 
Normal weight (N = 13) 171 204 


Note.—Nonpaired ¢ test for all comparisons, p > .05. The 
values represent the mean (d') + the standard error of the 
SEAN ui 
tion and the distilled water in pairs on each trial, 
and they were required to identify the sweet solu- 
tion. The order of presentation of the sugar solution 
in each pair was randomly determined. Subjects 
served in five 1-hour daily sessions of 35 paired 
trials for a total of 175 trials. The results of all 
trials were averaged to obtain a stable estimate of 
detectability for each subject. 

Subjects were given the following instructions: 


We are testing your taste sensitivity. Your job is 
to taste a variety of substances and report whether 
you taste sugar or water. You will receive a tray 
with two test solutions and a cup of rinse water. 
Decide which one of the two solutions is the 
sugar solution. Taste Number 1 first, do not 
swallow it—think about it—rinse, and then taste 
Number 2. Again, do not swallow. Spit out the 
solutions and the rinse water into the funnel pro- 
vided, Then tell me by raising one finger for 
Number 1 or two fingers for Number 2, which one 
you think is the sugar solution. After making a 
decision, rinse again with the remaining rinse 
water. Sometimes Number 1 will be the sugar solu- 
tion, and sometimes Number 2 will be the sugar 
solution, It will always be one or the other, so you 
must make a decision. You may have trouble de- 
tecting the sugar because of its low concentration. 
However, you must guess one or the other. I will 
give you a sample of sugar and a sample of water 
so that you can become familiar with the taste. 
On all trials I shall tell you whether you are right 
or wrong. Don’t worry about your performance. 
Just do the best you can. There will be no com- 
municating with each other during the experiment. 


Results 


Individual measures of detectability were 
derived from the total percentage of hits and 
false alarms.* No differences in detectability 


4In a forced-choice task, a hit or a false alarm is 
deciding correctly or incorrectly that the stimulus 
is the first member of a pair. The measure of de- 
tectability (d’) was obtained by converting the prob- 


J. Grinxer, J. Hirscu, AnD D. SMITH 


were obtained among the four obese groups 
(nonpaired ¢ test) or between the obese and 
the normal weight subjects (nonpaired ¢ test). 
A minor aim of these experiments was to ex- 
amine the contribution of age at onset of obe- 
sity to various taste parameters, In none of the 
experiments were there significant differences 
between these groups. Consequently, all 
analyses represent the combined groups. Ta- 
ble 2 shows the values of detectability (d’) 
obtained for each group. 


EXPERIMENT II: RESPONSE BIAS 
Method 


Solutions. The concentration of the test solution 
was the same as that used in Experiment I (.175% 
sucrose). In Part I of the experiment, both sucrose 
and distilled water were colorless, while in Part II, 
all solutions were adulterated with an odorless, taste- 
less, red dye (2.25 milliliters per liter of United States 
Department of Agriculture vegetable dye). \ 

Procedure. In both parts, a yes-no procedure with 
a symmetrical payoff matrix was used. Subjects were 
instructed to identify correctly successive samples of 
either sugar or distilled water, The presentation prob- 
ability of sugar was 51%, and the sequence was” 
randomly determined. Subjects received 1¢ for cor- 
rect answers (responding “S” when sugar was present — 
or “W” when water was present) and had 1¢ sub- 
tracted from their earnings for incorrect responses, 
Subjects did not receive feedback until the end of 
the experiment. Each part consisted of 90 trials and 
lasted approximately 1 hour. 

Subjects received the following instructions: 


You will be tasting a variety of substances. Again 
try to decide whether they are sugar or water. You 
will receive one sample at a time. Taste it, do not 
swallow it, spit it out, and record your answer on 
the appropriately numbered card. Write an S for 
sugar and a W for water. Then rinse thoroughly 
with the rinse water, You will then be ready for 
the next trial. Sometimes the solution will be 
sugar, and sometimes it will not. Try to decide as 
quickly as you can and write your answer down. 
Try to do your best. Remember the solutions will 
vary. Part of the time they will be sugar, and par 
of the time, water. You may win money by being 
right. For every right answer, you will get 1¢; for 
every wrong answer, you will have 1¢ subtracted 
from your earnings. 


Results 


The proportions of hits and false alarms in 
the clear and red conditions appear in Table 3. 
The amount of criterion shift represented 


abilities of hits and false alarms into normal curve 
deviates (Green & Swets, 1966, p. 61). 


TABLE 3 


EXPERIMENT II; SENSITIVITY AND CRITERION IN 
CLEAR AND RED CONDITIONS FOR OBESE AND 
NORMAL WEIGHT SUBJECTS 


Proportions of hits (S+) and 
false alarms (S—) 


Group 


Clear Red 
S+ s- S+ S= 
Obese -70 33 -68 sh 
Normal weight -63 23 68 B3 


the degree to which each subject was influ- 
i enced by the addition of the red color. An 
“8 estimate was obtained by a graphic procedure 
(Stunkard & Koch, 1964). A perpendicular 
line was dropped from each point (the co- 
ordinates were the percentage of hits and the 
percentage of false alarms) to the positive 
diagonal as shown in Figure 1. (The positive 
» diagonal represents degrees of bias with .5, .5 
representing no bias or chance.) The distance 
between each subject’s clear and red score 
along this diagonal was measured in twenti- 
eths of an inch, As can be seen from Figure 1, 
there were no significant differences in the 
amounts of criterion shift between the obese 
and normal weight subjects (X = SEy: obese 


0.9 


0.7 


0.5) 


P (S/s) 


03) 


O Obese -clear 

á e “u -red 
A Normol-cleor 

" -red 


i o.l 
0. 03 05 07 0.9 
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Fic. 1, Estimates of response bias for obese and 
normal weight subjects in the red and clear color 
conditions. 
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— .034 + .114, normal 


.264, p > .10). 


weight = + .330 


He Il 


EXPERIMENT III; SUSCEPTIBILITY TO 
EXTERNAL INFLUENCE 


Method 


Solutions. Subjects received five suprathreshold 
concentrations of sucrose in 4-log steps: 1.951%, 
3.423%, 6.162%, 10.954%, and 19.512% (weight by 
volume). Each concentration was presented in both 
a clear and a red form. The coloring procedure was 
identical to that of Experiment II. 

Procedure. The intensities of ‘the solutions were 
judged by a direct magnitude estimation procedure 
(Stevens, 1961). Each sucrose*concentration was ran- 
domly presented three times in clear form and three 
times in red form for a total of 30 trials. Subjects 
always received the clear 6.162% solution first, and 
they were told to give it a sweetness rating of 10. 
Subjects were then instructed to estimate the inten- 
sity of the sweetness of each Stimulus in proportion 
to the sweetness of the immediately preceding one. 
Subjects received the following instructions: 


You will be given a series of sweet solutions. Your 
job will be to judge the intensity of their sweet- 
ness. Assign to the first solution the number 10 to 
represent how sweet it tastes to you. Then give 
numbers to the succeeding stimuli in proportion 
to how strong they taste relative to the immedi- 
ately preceding one. For example, if the second 
stimulus tastes twice as sweet as the first, call it 
20. If it is one-half as sweet, call it 5; if it is 5 
times as sweet call it 50, etc. You may use any 
number including fractions or decimals as large or 
as small as you wish; use whatever number 
seems appropriate to describe the sweetness of the 
solution in proportion to the last one you tasted. 
Do not use negative numbers, and reserve zero 
for the stimuli that have no taste at all. Do not 
worry about being consistent but use whatever 
number seems appropriate. There are no right or 
wrong answers. Just do the best you can. Re- 
member, assign numbers to the stimuli in propor- 
tion to their sweetness relative to the previous one 
tasted. 


Results 


The means of the logarithms of the magni- 
tude estimates are presented in Figure 2. Zero 
responses were given a value equal to one log 
step below that subject’s lowest magnitude 
estimate. It can be seen from Figure 2 that 
sweetness estimates increased as a power func- 
tion of sucrose concentration. A straight line 
was fitted to each function by calculation of 
the regression line. All functions in Figure 2 
were significantly linear (p < .01), and both 


Meon log mognitude estimate 


Abscissa displacement 


ARTES SEN 
10.95 19.5 
Sucrose concentration (% weight/volume) 


Fic. 2, Mean logarithmic magnitude estimates of 

ı sweetness as a function of sucrose concentration and 

stimulus appearance. (Functions for the normal 
weight subjects are displaced along the abscissa.) 


functions for the red condition had a signifi- 
cant quadratic component (p < .05). Esti- 
mates of intermediate concentrations appeared 
to be more influenced by color than those at 
either extreme. A direct test of the difference 
between the slopes of the functions was con- 


TABLE 4 


Experiment III: Summaries OF ANALYSES OF 
VARIANCE OF MAGNITUDE ESTIMATES IN 
CLEAR AND RED CONDITIONS 


Source df MS F 


Normal weight 


Concentration (A) 4 4.408 30,259** 

Clear vs. red (B) at -195 5.456* 

AXB 4 .206 1.415 
Error 120 .146 


Red: Linear = 57.524** 
Quadratic = 4,744* 


Clear: Linear = 59.760** 
Quadratic = .246 


Obese 
Concentration (A) 4 16.799 55.647** 
Clear vs. red (B) 1 4.299 14.241** 
AXB 4 -183 606 
Error 380 -302 


Red: Linear = 118.379** 
Quadratic =9.286** 


Clear: Linear = 98.105** 
Quadratic = 1.149 


Note.—For the clear condition, obese versus normal weight 
t = — .264, df = 50, p >.70. For the red condition, obese 
versus normal weight £ = .302, df = 50, p > .70, For the red 
minus clear difference score, obese versus normal weight 
t = 876, p >.30, 

*p < .05. 


** p < 0l 
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ducted by an analysis of the difference be- 
tween regression coefficients. No differences 
were found between obese and normal weight 
subjects in the clear condition (F = .9545, p 
>.25) or the red condition (F = .0896, p 
> .25). 

The magnitude estimates of sweetness were 
significantly greater in the red condition than 
in the clear condition for both obese and 
normal weight subjects. A summary of the 
analyses of variance is presented in Table 4. 
Overweight subjects were not more suscepti- 
ble to the biasing effects of the red condition 
(clear minus red difference scores £ = .8762); 
that is, they did not rate the red solutions sig- 
nificantly sweeter than normal weight sub- 
jects. 


Discussion 


The results of the first experiment demon- 
strated no difference between overweight and 
normal weight subjects in sucrose sensitivity. 
While age at onset of obesity is correlated 
with the strength of affective responses to 
weight reduction, there appeared to be no 
differences in taste sensitivity amon the vari- 
ous obese groups (Grinker, Hirsch, & Levin, 
in press). Thus, it appears that obesity neither 
produces nor is the result of specific differences 
in the peripheral taste mechanism. 

The results of Experiments IT and IIT dem- 
onstrated that obese subjects were not differ- 
entially responsive to the visual cues provided 
by the appearance of the stimuli, The addi- 
tion of a red color influenced the judgments of 
the sweetness of suprathreshold solutions, but 
not differentially for obese and normal weight 
subjects. Other investigators have reported 
similar effects of color in normal weight sub- 
jects. Dunker (1939) observed that white 
chocolate tasted less like chocolate than: did 
the more familiar dark chocolate. Pangborn 
(1960) reported that untrained subjects at- 
tributed greater sweetness to red sucrose solu- 
tions of suprathreshold intensity than to clear 
solutions. Sweetness judgments of water were 
not affected by the addition of a red color. 
These previous results are similar to those 
found in the present study which indicate that 
color had no effect on the decision criterion 
for sweetness when the stimulus concentra- 
tion was at threshold (Experiment TI). 


TASTE AND EXTERNAL INFLUENCE 


Within the limits of a controlled psycho- 
physical experiment, the taste parameters of 
obese subjects were found to be equivalently 
influenced by visual cues when compared with 
those of normal weight subjects. No direct 
measures of food consumption were attempted, 
and, therefore, generalizations to eating be- 
havior are not possible. While the obese sub- 
jects are not physiologically more sensitive or 
responsive to taste cues, they may be more in- 
fluenced by such cues or attach more impor- 
tance to them in the context of an eating situ- 
ation. It is also possible that other factors 
(e.g., demand characteristics) are present in 
the eating situation (Orne, 1962). 
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DETERMINANTS OF THE 
HAVING INFLUENCED 


JOHN SCHOPLER ? AnD BRUCE LAYTON 
University of North Carolina 
Our analysis of the influence process suggests that Person A’s attribution of 


having influenced Person B will be maximum if B’s state after A’s influence 
attempt is (a) not predictable from knowledge of B’s previous state, (b) 


evaluatively positive, and (c) predictable 


tions of a target person’s skill, the target person’s actual success—failure, and 
the subject’s own ability were manipulated in an investigation of two of the 
determinants of perceived influence. Results confirmed both hypotheses. Per- 
ceived influence was greater when the target person performed in an unex- 
pected manner than when he performed as expected and was greater when the 
target person succeeded than when he failed. Responses for the target person’s 
amount of effort and liking for the target person are also examined and dis- 


cussed. 


Interpersonal influence, as Schopler (1965) 
has noted, typically is defined as the induction 
of change in another person. The attribution 
of interpersonal influence, perceiving that Per- 
son A has influenced Person B, involves judg- 
ing how much of the change observed in B 
was caused by A. The perception of inter- 
personal influence is thus an instance of the 
general attributional problem of specifying 
how a perceiver allocates the origins, or causes, 
of a particular effect (Heider, 1944, 1958; 
Kelley, 1967). The three components that 
seem to us to be important determinants of 
attributions of influence are identified below, 
but only the two components manipulated in 
the present experiment are elaborated. 

The sequence of events defining interper- 
sonal influence is roughly as follows: Given 
Person B’s state at Time 1, Person A is seen 
to have influence over B if A directs a re- 
sponse to B, who subsequently does something 
at Time 2 which does not follow from his state 
at Time 1. From A’s point of view, the things 
that follow from B’s state at Time 1 are 
simply the totality of expectations A holds 
from a knowledge of B and B’s circumstances 
at Time 1. If Person A makes an influence 
attempt, he will see himself as having influ- 


1 This research was supported by National Science 
Foundation Grant GS-2563 to the first author. 

2 Requests for reprints should be sent to John 
Schopler, Department of Psychology, Davie Hall, 
University of North Carolina, Chapel Hill, North 
Carolina 27514, 
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SELF-ATTRIBUTION OF 
ANOTHER PERSON ’ 


from A’s influence attempt. Expecta- 


enced Person B if B’s performance at Time 2 
was not expected from what was known about 
him at Time 1. The basic ingredient in seeing 
oneself as a successful witch doctor or teacher 
is to convince oneself that the patient’s or 
student’s excellence at Time 2 would not, be 
expected from what was known about their 
states at Time 1, On the other hand, if B’s 
performance at Time 2 were expected from 
what was known at Time 1, no influence would _ 
be attributed. One of the central hypotheses s 
to be tested in this study is that A will be 
more likely to see himself as having influ- ' 
enced B if B’s performance subsequent to 2: 
influence attempt is not expected, compar: 
when B’s performance is expected. 

The second component, which is on- 4 
tained in the sequence defining irf is j 
the strength of the response directed at Per- 
son B, This component was not manipulated 
in the present experiment, and it will suffice 
to note that perception of influence will also 
be a function of the expected probability be- ~. 
tween A’s response and B’s state at Timi pS- 
If that probability is high, perceived ‘uence 
should be high. If that probability is low, per- 
ceived influence should be low.* 

These two components should be < 3 
applicable to predicting the attribu! ade 
by an outsider observing A and B, ell as 
to the self-attributions of A. Furtzermore, if 


Ace, 


i 


8A complete statement of this process would in- 
clude the effect on perceived influence of B doing 
something opposite to what is expected. 


the observer’s and the self’s prior expecta- 
tions are identical, their attributions should 
be interchangeable, There is, however, a third 
component that is likely to differentially af- 
fect self-attributions and observer-attributions 
about another’s influence. Balance principles 
suggest that a person will be more likely to 
take credit for causing a benefit to another 
than to take the blame of causing a negative 
outcome. Thus, it seems likely that if the state 
B attains at Time 2 is evaluatively positive, 
there is more likelihood of making a self- 
attribution of influence than if B’s state is 
evaluatively negative. This is the second ma- 
jor hypothesis tested in this experiment. Be- 
cause the observer’s self-concept is not at 
stake, the nature of B’s state at Time 2 may 
be less relevant for him. Therefore, the hy- 
pothesis will probably not pertain to observ- 
ers. 

While the framework discussed above could 
be applied to a number of studies, it is most 
relevant to an experiment reported by John- 
son, Feigenbaum, and Weiby (1964). Student 
teachers were placed in a simulated teaching 
situation with children as the target persons. 
It was found that when a target person con- 
sistently performed poorly over two sets of 
trials, the teachers attributed the performance 
to qualities inherent in the target person; 
that is, the teachers saw themselves as having 
no influence when the performance was repeti- 
tively poor. If, however, the target person im- 
proved from the first set of trials to the second 
set, the teachers then attributed responsibil- 
ity for the improvement to themselves. Influ- 
ence was present only when the target person 
improved. Johnson et al. see their results as 
fitting with another aspect of Heider’s (1958) 
balance theory, different from the one dis- 
cussed above. For the student who does not 
improve, the second poor performance is con- 
gruent with his characteristics or abilities, 
and the second performance is perceived as 
being internal to him. When the improvement 
is not congruent with the known character- 
istics of the actor, then the action is attrib- 
uted externally—in this instance, to the 
teacher. In our framework, this explanation is 
comparable to whether B’s state at Time 2 is, 
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or is not, predictable from what is known 
about his state at Time 1. 

Because the only change manipulation used 
by Johnson et al. was from poor to good per- 
formance, it is not possible to assess which of 
the processes discussed above was prepotent. 
For example, did the subject-teachers believe 
that they had more influence only when their 
students improved or whenever a student per- 
formed well? (In the experiment, each sub- 
ject-teacher actually had experience with two 
students at a time. The second student always 
performed well, but no influence ratings were 
reported for the consistently good student.) 
In addition, the experiment contained features 
likely to accentuate any tendency for self- 
enhancement through causal attributions. The 
subjects were actual student teachers who 
believed that they were to be graded on the 
basis of the performance of their students. By 
considering those students who did not im- 
prove as responsible for their own poor per- 
formance, the subjects avoided blaming them- 
selves in a situation where their teaching 
ability was being evaluated. Further, by re- 
peatedly emphasizing the superior—subordi- 
nate relationship between teacher and student 
in the instructions, the likelihood of perceiv- 
ing improvement as influence was enhanced. 

The present study is an attempt to pro- 
vide a more complete picture of the determi- 
nants of attribution of influence by assessing 
the effect of two of the three components dis- 
cussed above. It will be recalled that perceived 
influence is thought to be maximum when B’s 
state after the influence attempt (a) is not 
expected, or predictable, from his previous 
state; (b) is predictable from the influence 
attempt; and (c) is evaluatively positive. The 
first and third components were embedded in 
the experiment, which required the subjects 
to complete a task and then to take an ad- 
visory role to a partner who was solving a 
similar task. The sign of the evaluation of the 
partner's state at Time 2 was manipulated by 
whether the partner succeeded or failed on the 
second task, The strength of the influence 
attempt, the second component, was not ma- 
nipulated, but was held constant at roughly 
a moderate level of strength by having the 
subjects send their answers, which the partner 
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was free to accept or reject. (The social status 
of the subject and his partner were constantly 
-equal.) In addition, because of suggestive pilot 
data, the subject’s perception of his own abil- 
ity on the task was manipulated. This also 
permitted assessing whether the Johnson et al. 
results would be restricted to influence agents 
with high ability. The design was thus a 
2X22 factorial. The major dependent 
variable was self-attribution of having influ- 
enced a partner. Although the theoretical 
focus of the study was on influence ratings, 
knowledge of a partner’s task performance 
(the major source of information available to 
subjects) might have invoked additional at- 
tributions. Heider (1958) has noted the ways 
in which a person’s level of task performance 
may provide information about the joint rela- 
tionships among three factors; the person’s 
ability, the person’s amount of effort, and 
the difficulty level of the task. Because ability 
level was manipulated and task difficulty was 
constant, only attributions about the part- 
ner’s effort remained as a possible alternative 
attribution to having influenced him. An item 
measuring effort was therefore included. Fi- 
nally, in the hope of learning more about the 
situation, an item assessing liking for the 
partner was also included. 


METHOD 
Subjects 


Fifty-eight males from the introductory psychology 
course at the University of North Carolina partici- 
pated in the experiment as.a partial fulfillment of a 
course requirement. Data from three subjects were 
discarded because of failure to follow instructions 
and/or incompletion of all experimental forms. Data 
from the remaining 55 subjects are included in the 
analysis. 


Procedure 


Subjects were run in groups of 2-6. When each 
subject arrived at the experimental room, he was 
greeted by the experimenter (the second author), 
who ushered him into a small room and told him 
to put on a pair of earphones. Each subject was in 
a different room. All instructions were tape-recorded 
and given through the earphones. 

When the last subject arrived, the experimenter 
started the tape, which contained a message explain- 
ing that the experiment was concerned with informa- 
tion processes and information exchange. A rationale 
was then given for studying the effects of limited 
information as measured by a test of social sensitiv- 
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ity. Subjects were told that in the first part of the 
experiment they would be given several trials on the 
social sensitivity test in order to help familiarize 
them with it. 

At the conclusion of these instructions, the experi- 
menter gave each subject a copy of the social sensi- 
tivity test which contained 20 items requiring him 
to make certain inferences about a given person, 
Subjects filled out some biographical information on 
the cover of the test and then listened to the taped 
instructions explaining the administration of the test. 
The test was divided into four trials of five items 
each. Three minutes were allotted for each trial; the 
experimenter scored a trial in the subject’s presence 
by placing a red “X” next to the “incorrect” 
items according to a prearranged schedule. He then 
instructed the subject to proceed to the next trial 

Subjects were randomly assigned to conditions of 
high and low ability. Those in the high ability 
condition received the following scores: Trial 1, 4 
correct; Trial 2, 5 correct; Trial 3, 4 correct; Trial 4, 
5 correct, Subjects in the low ability condition re- 
ceived a pattern of 1 correct, 2 correct, 1 correct, and 
2 correct, respectively, for the four trials. 

After scoring the last trial, the experimenter asked 
each subject to complete a questionnaire that con- 
tained some filler items and a manipulation check 
as to the subject’s perception of his ability on the 
task. All questions were answered on an 11-point 
scale with appropriate anchors such as “very un- 
skillful” and “very skillful” at each end of the scale. 

Upon completion of the questionnaires, the subjects 
received instructions through the earphones that they 
would be divided into sender—recipient pairs. Through 
a system of lights and switches, all subjects were led 
to believe that they were senders. They then received 
the following instructions: 


Each member of a sender-recipient pair will receive 
the second section of the test that you have just 
taken. The sender decides what he believes the 
correct answer to be for a given problem, writes 
it down, and then reads out that answer to his 
partner. For the purposes of this experiment, it is 
very important that the sender read out only the 
one word answer and nothing else. After receipt 
of this information, the recipient will decide what 
he believes the correct answer to be and will signal 
me according to the following instructions. 


The instructions were then rephrased and repeated. 
The test was divided into trials again, but “because 
of the experimental interest in the effects of limited 
information,” the recipients would not be given feed- 
back as to whether a specific problem was right or 
wrong but rather would get feedback after each 
section as a whole. The feedback came via the ear- 
phones which, “because of the wiring setup,” the 
sender would overhear. One trial was worked on at 
a time. It was made clear to the subject (sender) 
that his answers would not be known to the 
experimenter until the end of the experiment. 

The experimenter then brought the test to the 
subjects. In order to manipulate the subjects’ percep- 
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tions of the target person’s state at Time 1, the 
experimenter also gave each subject a form indicating 
his recipient’s score on the first test. Subjects were 
either assigned to a high-ability target person, who 
had 18 correct out of the possible 20 in the first series 
of trials, or to a low-ability target person, who had 
gotten 5 correct out of the possible 20. It was made 
clear to each subject that his target person did not 
know how well the subject had performed on the 
first test. 

Subjects then began the task. Upon a signal from 
the subject that a given trial had been completed, 
the experimenter gave each “recipient” feedback on 
his progress which was overheard, of course, by the 
appropriate subject. Subjects were again randomly 
assigned to conditions of the target person’s success 
or failure, Those in the success condition heard that 
their target person scored five out of five on the 
first trial, four correct on the second, four correct 
on the third, and five correct on the last trial. Sub- 
jects in the failure condition, however, heard that 
their target person scored two correct out of five 
on the first trial, one correct on the second, one 
correct on the third, and two correct on the last trial. 

When all four trials of the test had been completed, 
the experimenter gave each subject a questionnaire 
containing five 11-point scale items dealing with the 


“ target person’s ability, the subject’s own ability, the 


subject’s perceived influence on his target person, 
his estimation of the target person’s effort on the 
second task, and a measure of his liking for the 
target person. Tay 

Upon completion of the questionnaires, the experi- 
ment was terminated. Subjects were debriefed, decep- 
tions were explained, and promises not to reveal the 
nature of the experiment were elicited. 


RESULTS 


It should first be noted that there was»no 
manipulation check with respect to the sub- 
jects’ perceptions of the target person’s ability 
on the first experimental task or of the target 
person’s expected performance on the second 
task. Because each subject had the written 
statement of his target person’s previous 
score in front of him for the duration of the 
second half of the experiment (including the 
completion of the second questionnaire), it 
did not seem necessary, or even well advised, 
to include such measures. To do so might 
have aroused the subjects’ suspicions about 
the purpose of the experiment. The ratings of 
the target person’s actual performance on 
the second task indicated an adequate ma- 
nipulation of this variable. The means for the 
successful and unsuccessful target persons 
were 9.38 and 4.52, respectively (F = 79.0, 
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TABLE 1 


MEAN INFLUENCE IN RELATION TO OWN CONFIDENCE, 
TARGET PERSON’S ABILITY, AND SUCCESS 


High target person's | Low target person's 


ability ability 
Group 
Success | Failure | Success | Failure 
High self-ability | 6.33 6.00 7.71 4.57 
n 6 8 7 7 
Low self-ability 6.17 7.43 7.57 4.14 
n 6 7 7 7 


Note.—Higher numbers indicate more influence. 


df = 1/47, p < .001). The subjects’ percep- 
tions of their own abilities were also ade- 
quately manipulated. The self-ratings for the 
high and low ability groups after the first task 
were 8.39 and 2.96, respectively (F = 87.7, 
df = 1/47, p < .001). Although the magni- 
tude of this difference was reduced for ratings 
after the second task to 6.86 and 5.11, re- 
spectively, the difference remained significant 
(F = 0:1; df = 1/47, p < .002). 

The effects of the manipulations on the 
main dependent variable, perceived influence, 
are presented in Table 1. For convenience, 
the means for perceived effort and liking are 
presented in Tables 2 and 3, and a summary 
of a three-factor multivariate analysis of 
variance is presented in Table 4. 

The hypothesis that any change in the 
target person’s performance would induce a 
self-attribution of influence is tested by the 
interaction between the target person’s initial 
ability and his success or failure on the second 
task. As can be seen from Table 4, this effect 
is significant at p < .01. The means in Table 1 
indicate that the differences are in the pre- 
dicted direction. Self-attributions of influence 


TABLE 2 


MEAN TARGET PERSON’s EFFORT IN RELATION TO 
Own CONFIDENCE, TARGET Prerson’s ABILITY, 
AND SUCCESS 


High target person’s | Low target person’s 


ability ability 
Group 
Success | Failure | Success | Failure 
High self-ability | 9.00 7.13 6.14 | 6.43 
Low self-ability | 6.50 8.14 7.57 7.43 


Note.—Higher numbers indicate more effort. 


330 


TABLE 3 


MEAN LIKING FOR THE TARGET PERSON IN RELATION 
TO OWN CONFIDENCE, THE TARGET PERSON’S 
ABILITY, AND SUCCESS 


7 
High target person’s | Low target person's 
ability ability 
Group 
Success | Failure | Success | Failure 
High self-ability 7.83 6.38 7.71 5.71 v 6 
Low self-ability 7A7 7.29 7.00 7.29 = 
o 
Note.—Higher numbers indicate greater liking. = 
SED 


are greater when a high-ability target person 
fails than when he succeeds, and more influ- 
ence is attributed when a low-ability target 
person succeeds than when he fails. 

The main effect for success—failure, Table 4, 
on attributed influence is also significant 


(F = 5.61, df = 1/47, p < .02). If the target — 
_ Person succeeds, the subjects believe them- 


selves to have been more influential than if 
the target person fails. This result adds con- 
firmation to the second hypothesis—that in- 
fluence will be greater if the target person’s 
attained state is evaluatively positive. The 
result simultaneously qualifies the first hy- 
pothesis. Discussion of the qualifications fol- 
lows before turning to the remainder of the 
results, 

The interaction effect of the target person’s 
ability and success—failure on attributed influ- 
ence is shown graphically in Figure 1. It can 
readily be seen in Figure 1 that the high- 
ability target person induces a flatter slope 
compared to the low-ability target person. 
The meaning of an unexpected performance 
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Low Ability TP 
====High Ability TP 


Failure Success 
Performance on 2nd Task 


+ a 
Fic. 1. Effects of target person’s ability and success $ 


on amount of influence, 


will, of course, differ for high- and low-ability 
target persons. The unexpected outcome for 
a high-ability target person is failure; the 
unexpected performance for the Jow-ability 
target person is success. The unexpected 
success of the low-ability target person in- 
duces the attribution of influence because it 
invokes two of the components theoretically 
related to attributing influence. This circum- 
stance contains both the change and the posi- 
tively valued end-state components, The un- 


TABLE 4 


MULTIVARIATE ANALYSIS OF VARIANCE ON THE MEAN EFFECTS OF Own CONFIDENCE, THE 
TARGET PERSON’S ABILITY, AND SUCCESS ON INFLUENCE, LIKING, AND EFFORT 


ae 


= 
Influence Liking Effort 
Source ete HASSE 
MS F MS F MS F 
Target person’s ability (A) al 4.40 83 73 26 8.38 2.30 96 
Success—Failure (B) 1 29.80 | 5.61* 8.21 2.90 01 .00 3.19* 
Own ability (C) 1 42 -08 1.38 AO 1.69 AT 25 
AXB 1 47.61 8.97%" 13 04 16 04 3.08* 
AXC 1 3.21 -60 36 13 11.65 3.20 1.30 
BXC 1 1.36 26 12.85 4.53* 7.62 2.09 1,88 
AXBXC 1 3.02 OM. 43 AS 13.28 3.64 1.48 
*p <.05. 
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~ expected failure of the high-ability target 


person creates a circumstance in which two 
of the components determining influence are 
pitted against each other. The target person’s 
performance is unexpected, but, in this in- 
stance, it is not a positively valued perform- 
ance. Influence ratings of the Aigh-ability 
target person, as can be seen in Table 1, seem 
to be affected by the subject’s ability level. 
The low-ability subjects rate themselves as 


having more influence over the high-ability ~ 


target person who fails than over the high- 


„ ability target person who succeeds, while a 


slight reversal is evident for the high-ability 
subjects. The magnitude of this effect is not 
sufficient to produce a triple interaction in 
the absence of any effect for own ability on 
ratings of the low-ability target person. 
Ratings of the target person’s effort did not 
produce any significant effects. Two effects 
approached significance: the interaction of 
the target person’s ability and own ability 
F = 3.20, df = 1/47, p < .08) and a triple 
interaction (F = 3.64, df = 1/47, p < .06). It 
is clear from the patterns of the means that 
in neither case do ratings of effort serve as 
substitutes for attributions of influence. 
Liking for the target person was involved 
in only one significant effect. Table 4 shows 
that the subjects’ ability interacted with 
success-failure to determine the amount of 
liking. From Table 3 it can be seen that the 
subjects liked a target person who performed 
similarly to their own ability level. Thus, low- 
ability subjects liked a target person who 
failed more than one who succeeded, and 
high-ability subjects liked the target person 
who succeeded more than the one who failed. 


Discussion 


’ The results provide confirmation for the 


be 


hypothesis that high influence would be at- 
tributed when the target person’s performance 
was unexpected, compared to when it was 
expected, The interaction effect for the target 
person’s ability and success—failure was sig- 
nificant. The results were also consistent with 
the hypothesis that a positively valued end 
state would induce higher influence ratings 
than a negatively valued end state. The re- 
sults, however, must be understood in terms 
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of the basic difference between the meaning 
of unexpected performance for a high- and 
a low-ability target person. 

As has previously been noted, the un- 
expected performance of a low-ability target 
person is success. In such a circumstance, the 
implications of both components of the origi- 
nal formulation (positively valued end state 
and unexpected change) are in the direction 
of inducing a self-attribution of influence. The 
low-ability target person’s unexpected success 
is unabashedly attributed to influence, by 
both high- and low-ability subjects, while the 
low-ability target person’s expected failure is 
consistently interpreted as signifying consider- 
ably less influence. This part of the design and 
the attendant results are comparable to the 
study reported by Johnson et al. (1964). In 
distinction to their study, the present results 
could not be a consequence of ascribed status 
differences or the influence agent’s perception 
of superior ability. Status was held constant, 
and the subject’s own ability clearly had no 
effect on ratings of influence over a low-ability 
target person. The limitations of predicting in- 
fluence only from unexpected performance (or 
incongruent performance) are illustrated in 
the results for the high-ability target person. 

The consequences for a subject confronting 
a high-ability target person are quite different. 
His unexpected failure or his expected success 
each pit the change and the positive end-state 
components against each other. In this circum- 
stance the mean influence ratings for high- 
and low-ability subjects are reversed between 
the success and failure cells. Because the 
Success—Failure X Own Ability interaction uni- 
variate F is not significant, it is, unfortunately, 
not legitimate to test the Success—Failure x 
Own Ability interaction only for the high- 
ability target person cells, In the present de- 
sign it is not possible to determine the stabil- 
ity of the reversal. It still seems worthwhile 
to attempt to understand the reversal. In 
keeping with the change hypothesis, the low- 
ability subjects attribute more influence for 
unexpected failure than for the expected suc- 
cess. In keeping with the evaluative end-state 
hypothesis, the high-ability subjects rate, 
albeit only slightly, more influence for ex- 
pected success than for unexpected failure. 
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Influence ratings are thus higher when the 
target person’s performance is in the direction 
of the subjects’ own ability. Manipulation of 
the subjects’ own ability may have inadvert- 
ently affected the expected strength of his 
influence response—with the second compo- 
nent determining influence mentioned above. 
It is then possible to suggest that when the 
indications of influence provided by the other 
two components are in conflict, even weak 
differences in the strength of the influence 
response make a contribution to the final in- 
fluence ratings, 

No evidence was found to indicate that the 
perceived amount of the partner’s effort was 
used as an alternative to attributing influence 
over him. Not only were there no significant 
effects for amount of effort, but the pattern 
of means in Table 2 seems to follow lines 
determined by similarity between the subjects’ 
ability and the target person’s expected abil- 
ity. In any event, those cells which generate 
the highest influence ratings clearly do not 
contain the lowest ratings of effort. The sub- 
ject’s perceptions of the motivational compo- 
nents determining task performance in this 
situation were evidently sufficiently stable to 
preclude obtaining a negative relationship 
between influence and effort, 

Liking for the target person does not 
appear to mediate any of the effects observed 
for influence. Instead, an interaction between 
own ability and success—failure was observed. 
Low-ability subjects like target persons who 
fail more than those that succeed, and high- 
ability subjects like target persons who suc- 
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ceed more than those who fail. This sugg : 
that people are attracted to others who pes- ` 

form at a level that is similar to their ov, 

ability. The effect, however, is largely deter 
mined by the high-ability subjects. They 
clearly prefer the target persons who succeed 
over those who fail. The low-ability s 


prefer success and to derogate failure might 
imply self-depreciation to the low-ability s 
jects, and yet failure is not itself a viabi 
basis for significant increments of liking. 
The analysis of the influencé process pre- 
sented in the present study seems to contain 
some promise for being an appropriate frame- 
work that can be applied to any interpersonal 
interaction involving influence, The confirma- 
tion provided by the experiment of the com- 
ponents that are deemed critical in deter- 
mining influence should encourage further 
applications of the analysis. 
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Investigators and observers of the student 
movement concur in finding rejection of soci- 
etal values at the core of youthful dissent. 
However, interpretations diverge regarding the 


Yi» motivational basis underlying such contem- 


pa 


porary social protest. The present study 
identifies two differently motivated kinds of 
societal rejections and explores empirically 
their respective antecedents and psychological 
concomitants. 

One interpretation of student protest and 
activism is based on psychoanalytic theory. 
Although they elaborate the essential thesis 
differently, Feuer (1969), Bettelheim (1969), 
and Rubinstein and Levitt (1969) coalesce in 
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GENERATIONAL CONTINUITY AND DISCONTINUITY IN THE 
UNDERSTANDING OF SOCIETAL REJECTION * 


JEANNE H. BLOCK °? 


Universiiy of California, Berkeley 


The study identified two groups of students differing in their generational 
continuity—discontinuity: a group of students rejecting both societal institutions 
and their parents (discontinuity group) and a second group of students who 
have rejected their society while still positively regarding their parents (conti- 
nuity group). Some personological and socialization factors differentially associ- 
ated with generational continuity-discontinuity were evaluated. Adjective Q- 
f sort self- and ideal-self descriptions and the Child-Rearing Practices Report 
d were found to reliably distinguish the two groups, with the continuity group 
generally advantaged in the comparisons. A replication of the socialization data 
based on a sample of parents was reported and a controlled analysis was com- 
pleted to determine the contribution of both ideological and socialization 
factors to generational continuity—discontinuity. 


ascribing radical student protest to unresolved 
Oedipal conflicts wherein the adolescent is 
rebelling against the parent, either directly 
or symbolically. These observers of the con- 
temporary student scene agree about the 
origins of rebellion (unresolved Oedipal con- 
flicts compounded by permissive child rear- 
ing), the focus of social protest (on the 
institutional surrogates for the parent), and 
the psychopathological implications of social 
rejection so motivated (little legitimacy is 
accorded student protests addressed to society 
and its institutions). 

Another understanding of student social 
protest is represented in the studies of Flacks 
(1967), Keniston (1968, 1969), Block, Haan, 
and Smith (1969), and Smith, Haan, and 
Block (1970). The findings of these investi- 
gators have shown that parents of student ac- 
tivists are themselves politically both liberal 
and active and socialize their young in ways 
that predispose them to political-social in- 
volvement. The convergence of parent and 
child value systems indicates that identifica- 
tion with, rather than rebellion against, the 
parent can be a motivating factor in protest 
activities, The findings supporting this sociali- 
zation hypothesis suggest that, although the 
origins of social protest are manifold, a central 
component for many activists involves rational 
processes applied in critical examination of 
contemporary society and a moral commit- 
ment to societal change. Both this rationality 
and this moral commitment may be encour- 
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aged by socialization processes emphasizing 
individuation, authenticity, and humanitarian 
values (Block, Haan, & Smith, 1968; 
Haan, Smith, & Block, 1968). According to 
the socialization-toward-involvement interpre- 
tation, the focus of student protest is pri- 
marily against particular societal institutions 
and values rather than primarily against the 
parents. There is the additional implication 
in this view that some degree of legitimacy 
attends the moral protests of the young, 
which are directed against dehumanizing insti- 
tutional practices and special societal values. 
The two types of societal rejection repre- 
sented by these contrasted interpretations can 
be readily operationalized and comparatively 
evaluated. Students seeking societal and insti- 
tutional change must first be identified. Then, 
this subset of students must be further sub- 
divided into those who reject their parents as 
well as the larger society (the discontinuous 
individuals) versus those who identify with 
their parents, although fundamentally critical 
of the broader society (the continuous indi- 
viduals). Clearly, these two types of societal 
rejection have different implications for the 


quality of personal development, the nature : 


of consequent political-societal protest, 

the response of societal institutions. To ex- 
plore the basis for these two routes to soci- 
etal disaffection, the present inquiry was 
undertaken. 

Three primary comparisons were developed. 
First, samples of discontinuous and continu- 
ous individuals were compared to ascertain 
their differences with respect to their self- 
percepts, their ego ideals or value systems, 
and their views of their upbringing. Second, 
the parents of two additional samples of dis- 
continuous and continuous individuals were 
compared with respect to their child-rearing 
orientations to replicate and to extend rela- 
tionships found between socialization practices 
and generational discontinuity or continuity. 
Third, a sample of continuous conservatives— 
college students conservative in their political 
outlook and also in political agreement with 
their parents—was identifed so that the child- 
rearing attitudes of their parents could be 
studied to clarify some questions raised by 
earlier comparisons. 
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Srupy I; THe SELF-PERCEPTS AND Eco 
IDEALS OF CONTINUOUS VERSUS 
DISCONTINUOUS STUDENTS 


Method 


Selection of comparison groups. The subject pool 
contained 1,051 subjects from the University of 
California, Berkeley, and San Francisco State Col- 
lege, representing a broad spectrum of political 
ideologies and socioeconomic and educational levels, 
as well as different orientations on political-social 
issues. The subjects were studied in 1964. Details of 
data collection. and sample constitution may be 
found in Smith et al. (1970). From this pool of 
subjects, two groups of subjects with different pat- 
terns of rebellion were selected on the basis of their 
questionnaire responses; (a) those rejecting both their 
parents and society and (b) those rejecting society 
but basically identifying with their parents. 

The criterion for rejection of traditional societal 
values was based on the subject’s self-rating on an 
8-point continuum of political orientation, which 
ranged from extreme conservatism (1) to radicalism 
(8). The subjects who rated themselves as liberal- 
radical or radical (Steps 7 or 8, positions one stan- 
dard deviation or more above the mean of the total i 
sample) were considered to have rejected traditional’ 
societal values. Using this criterion, 147 subjects were 
identified. 

The subjects meeting this first criterion were then 
evaluated with respect to their rejection or acceptance 
‘of parental values, Six scores reflecting agreement 
with or influence by the parents were available for 
each subject: (a) the agreement of the subject with 
the political party preference of the mother; (b) the 
agreement of the subject with the political party 
preference of the father; (c) a composite score 
reflecting agreement of the subject with his mother 
on six political-social and personal issues (e.g., civil 
rights, student demonstration, religion); (d) a com- 
posite score reflecting agreement of the subject with 
his father on these six political-social and persona! 
issues; (€) a composite score reflecting the subject’s 
judgment of his mother’s influence on his personal, 
ethical, and political development; and (f) a com- 
posite score reflecting the subject’s judgment of his 
father’s influence on his personal, ethical, and 
political development. : 

A subject was identified as rejecting parental values 
when (a) there was a discrepancy of 4 or more 
points (on the 8-point continuum) between the sub- 
ject’s self-related radicalism or liberal radicalism and 
his rating of the ideological position of his mother 
and of his father (ie., both parents were viewed as 
moderate conservatives or more extreme conserva- 
tives), and (b) the subject manifested low agree- 
ment with and influence by his mother and his 
father (scores one standard deviation or more below 
the agreement and influence means of the total 
sample). Thus, the young people meeting these cri- 
teria rejected their parents’ political and value orien- 
tations and, more important, denied a positive influ- 
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ence of their parents on their developing personalities, 
These subjects are designated the discontinuity group 
to emphasize their estrangement from their parents’ 
values as well as those held by society at large. 
Twenty-five males and 15 females met these selection 
criteria, 

A subject was identified as accepting parental 
values if the discrepancy between his political po- 
sitions and that of each of his parents did not exceed 
3 points on the rating scale (the mean discrepancy 
proved to be 1.24 and typically, these parents were 
viewed as liberals) and, also, the scores on the agree- 
ment and influence indexes for both parents exceeded 
the means characterizing the total sample. The subjects 
meeting these criteria attributed a positive influence 
to their parents and manifested little discrepancy 
between their own beliefs and values and those of 
their parents. They are designated as the continuity 
group, which includes 37 males and 22 females. 

Forty-eight subjects who were radical or liberal- 
radical in their political persuasions failed to obtain 
scores on the parental agreement and parental influ- 
ence dimensions that qualified them for the continuity 
and discontinuity groups. These subjects were not 
included in the subsequent analyses. 


The findings to be reported are based on 
three data sources available from a more 


- extensive study of student activism (Block et 


al., 1968; Block, Haan, & Smith, 1969; Haan, 
et al., 1968; Smith, Haan, & Block, 1970). 
Specifically, each subject contributed descrip- 
tions of both his perceived self and ideal self 
using a 63-item adjective Q set. For each Q 


~ item, the differences between the two compari- 


a 


son groups was evaluated by the use of ¢ tests. 
The sexes were analyzed separately because 
of the expectation that the psychology of 


- radicalism in men can differ from the psychol- 


ogy of radicalism in women. 
Self-descriptions. With respect to the self- 
characterizations of these young people as 
reflected by adjective @ descriptions, the male 
continuity individuals describe themselves, 
comparatively, as somewhat less unconven- 
tional (p< .05) and as more responsible 
(p< 01), masculine (p< 05), orderly 
(p < .05), and practical (p < .10) than their 
counterparts in the discontinuity group, for 
whom amusing (p< .10) and creative 
($ < .10) were distinguishing adjectives. 
The women in the continuity group charac- 
terized themselves, comparatively, as having 
vitality (p < .05) and confidence (p < .05) 
and as being significantly more independent 
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(p< .01), assertive (p<.01), talkative 
(P< .05), informed (p< .05), perceptive 
(p< .10), and responsible (p< 10). The 
discontinuity young women, alienated from 
both societal and parental values, appear to 
be troubled, insecure, anxious girls, as 
shown by the differentiating adjectives, re- 
bellious (p < .01), doubting (p < 05), shy 
(p < 05), self-denying (p < .05) stubborn 
(p< .05), needs approval (p< .10), and 
worrying (p < .10). 

Ideal-self descriptions. In the context of 
rejection of traditional societal values, young 
continuity people, who have a mutually re- 
specting relationship with their parents, assert 
values for themselves that might be con- 
sidered comparatively “straight,” while the 
students in the discontinuity group proclaim 
a “counterculture”? orientation. These con- 
clusions derive from the value implications 
of the ideal-self description that the subjects 


` conveyed, In their ideal-self descriptions, the 


male continuity subjects comparatively em- 
phasize the values of foresight (p < .05), of 
being sedf-controlled (p < .05), critical (p < 
05), and argumentative (p < .05). The fe- 
males in the continuity group place greater 
value on being logical (p < 01), considerate 
(p < 05), and foresightful (p < .05). 

By contrast, the distinguishing adjectives 
for the women in the discontinuity group re- 
veal a concern for adventurousness (p < .05), 
keeping one’s cool (aloof and uninvolved 
P< 05, calm p< .05, reserved and shy 
P< .05), and being jree and not “hung-up” 
(P< .10). This constellation of adjectives 
suggests that the discontinuity women zeal- 
ously aspire to an interpersonal independence. 
The men in the discontinuity group, on the 
other hand, expressed ideals that are both 
more classically feminine and consonant with 
the nominal ethic of the love-generation com- 
munity: they aspire to being genuine and 
authentic (p < .01), creative (p < .05), artis- 
tic (p < 10), playful (p < .10), and loving 
(p< .10). 

These results suggest an interesting reversal 
of emphasis in the sex roles sought by dis- 
continuity subjects: the discontinuity women 
project ideal-self-images that seem typically 
more masculine, while the discontinuity males 
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place greater emphasis on achieving qualities 
conventionally viewed as feminine. 

Perceptions of parental child-rearing values. 
Socialization antecedents associated with dif- 
ferent political-social orientations were as- 
sessed by the students’ retrospective descrip- 
tions, using the Child-Rearing Practices 
Report (CRPR; Block, 1965), a 91-item Q 
set focusing on the parental values and prac- 
tices by which they had been reared. Descrip- 
tions of both maternal and paternal child- 
rearing values were obtained. Obviously, these 
data can involve some degree of distortion 
and rationalization and are imperfect regis- 
tries of the actual child-rearing behaviors of 
the parents. However, they are valid indi- 
cators of the perceptions or understandings 
held by the subjects of the ways in which 
they were reared, and, as such, they are inter- 
__.s8ting in their own right. Because the percep- 
tions of the more alienated discontinuity 
group might involve some post hoc rational- 
ized justification for their current estrange- 
ment from their parents, thus affecting the 
meaning of the child-rearing findings, the 
analysis of CRPR data from the actual par- 
ents of the discontinuity and continuity stu- 
dents, reported in the next section, becomes 
especially relevant. 


TABLE 1 


STUDENT Q-Sort DESCRIPTIONS oF MOTHERS’ CHILD-REARING ATTITUDES SIGNIFICANTLY 
DIFFERENTIATING CONTINUITY AND DISCONTINUITY GROUPS AT OR 
BEYOND THE .01 LEVEL 
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Tables 1 and 2 present the differences i: 
the socialization practices experienced by ths 
two groups, as reflected in the CRPR. The 
CRPR analyses were done separately and 
combined for the sexes. Since the results were 
generally equivalent for the sexes, only the 
results of the combined analyses were pre- 
sented to save space. The items reported are __ 
all significant at or beyond the .01 level (34% 
of the items for the mothers and 44% of th 
items for the fathers were significant at o) 
beyond the .05 level), 

With respect to both mothers and fathers 
a consistent pattern emerges. The parents of 
the continuity group were described as en- 
couraging of reflection, of curiosity, and of 
questioning. They appeared to find greate 
satisfaction in their parental roles, to respec’ | 
the child’s opinions, and to be interested i: i 
and appreciative of his accomplishments. The 
individuation of the child was emphasized, 
according to the perceptions of these young 
people, in that they were helped by thei 
parents to develop a sense of autonomy. It is 
interesting to note that young people in the 
continuity group were not ‘permissively”’ 
reared—a term that has unfortunately bee: 
misinterpreted to mean license, indulgence, 
and freedom from limits. Rather, the parents Í 


CRPR item 


My mother: 
Respected my opinions 


Didn’t think young children should see each other naked 
Wished my father were more interested in his children 

Kept me away from families who had different ideas or values 
Encouraged me to wonder and think about life 

Felt I should have time to think and daydream 

Let me make decisions myself 

Talked it over and reasoned with me when I misbehaved 

Gave me a good many responsibilities 

Thought children had to take chances as they grew up 
Encouraged me to be curious 

Explained things by using the supernatural 

Expected me to be grateful 

Let me know when she was angry with me 

Let me know how much she sacrificed for me 

Let me know how ashamed and disappointed she was when I misbehaved 
Encouraged me to be independent 

Found it interesting to be with her children 

Controlled me by warning of all the bad things that could happen 


Continuity | Discontinuity 
group mean group mean trati 
(n = 55) (n = 36) 
5.78 4.33 3.87 
2.73 3.64 3.01 
3.20 4.31 2.67 
1.71 IAL 5.37 
5.04 3.19 4.92 
4.85 3.78 2.74 
5.87 4.50 3.62 
4.89 3.64 3.13 
4.58 3.06 3.96 
5.13 3.89 3.03 
5.55 3.12 5.15 
1.85 3.58 4.37 
4,58 5.81 3.44 
5.69 4.58 3.24 
3.27 492 3.70 
3.09 442 3.42 
4.82 3.64 2.65 
5.00 3.69 3.76 
2.51 417 4.14 


Note.—The Q items are in somewhat abbreviated form, Seven categories of response were used, with “7 
most characteristic and "1" indicating items least characteristic, Means in italics are more characteristic, 


indicating items 
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STUDENT Q-Sort DESCRIPTIONS or FATHERS’ 
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TABLE 2 


CHID-REARING ATTITUDES SIGNIFICANTLY 


DIFFERENTIATING CONTINUITY AND DISCONTINUITY GROUPS AT OR 
BEYOND THE 0l LEVEL 


7 could be described as “responsibly respon- 
sive.” They appeared to have clear expecta- 
tions for their children, and they encouraged 
maturity, responsibility, and independence. 
x The misbehaviors of the child were neither 
ignored nor indulged, but instead were ac- 
knowledged and discussed with the child. Stu- 
i dents in the continuity group are further dis- 
„tinguished by the emphasis in their families 
on rationality, They were encouraged by both 
parents to confront dilemmas and to arrive 
at their own conclusions. The response to 
wrongdoing was rational discussion rather 
than physical punishment. 

Young people rebelling against both paren- 
tal and societal values present a different pic- 
ture. The parents of subjects in the discon- 
tinuity group were described as emphasizing 
appearances and being concerned about the 
impression others might have of their children. 
The child-rearing regime was described as sup- 
pressive and unresponsive to even the legiti- 
mate needs of the child. Socialization ap- 
peared to be concerned with attempts to 
circumscribe the children’s thinking, reactions, 
and behaviors. Guilt and anxiety arousal were 
he telied upon as mechanisms of control, ac- 


Ve 


Continuity | Discontinuity 
CRPR item group mean group mean t ratio 
x (n = 50) (n = 34) 
My father: 
me Respected my opinions 5.88 4.12 4.75 
y Helped when I was being teased 3.96 2.91 FL 
Gaye me comfort when I was upset 4.70 3.41 3.10 
E. Kept me away from families who had different ideas and values 1.70 3.26 SaS 
; Thought children should be seen and not heard 2.06 3.68 4.05 
Encouraged me to wonder and think about life 5.06 3.50 7.11 
= Felt I should have time to think and daydream 4.98 3.32 4.57 
l Y Let me make decisions myself 6.16 5.03 3.36 
Felt I was a bit of a disappointment 2.52 4.00 3.37 
Was easygoing and relaxed 4,90 3.65 2.78 
Talked it over and reasoned with me when I misbehaved 5.10 3.71 3.36 
> Gave me a good many responsibilities 4.80 3.76 2.68 
Thought children had to take chances as they grew up 5.92 4.68 3.74 
Encouraged me to be curious 5.92 4.03 5.36 
Aon Let me know he appreciated my efforts 5.74 4.59 2.93 
Dreaded answering my questions about sex 3.08 4.38 3.38 
Wanted me to make a good impression on others 4.84 5.97 3.90 
Found it interesting to be with his children 4.76 3.53 2.90 
Instructed me not to get dirty while I was playing 1.94 2.79 2.90 
= Controlled me by warning of all the bad things that could happen 2.50 38: 2.94 
ge Didn’t want me looked upon as difficult 3.48 4.68 2.92 
£ Note.—The Q items are in somewhat abbreviated form, Seven categories of response were used, with “7” indicating items most 
characteristic and “1” indicating items least characteristic, Means in italics are more characteristic. 
= 


cording to the CRPR descriptions of the dis- 
continuity group. Although there is the pos- 
sibility of a retrospective rationalization of 
their alienation by the young people in the 
discontinuity group, the quality of child rear- 
ing ascribed to the parents by their children 
is consistent with the relatively more con- 
servative political philosophy known, by 
virtue of selection criteria, to be held by these 
parents. Further, the child-rearing character- 
istics attributed to their parents by the dis- 
continuity students are in accord with the 
child-rearing orientations of the actual parents 
of the discontinuity individuals, which will be 
reported later. 

Because it was anticipated that continuity 
would be facilitated when the child-rearing 
values of both mother and father were con- 
gruent, the similiarity of the child-rearing 
orientations of parent pairs as perceived by 
their child was evaluated by correlating the 
CRPR Q descriptions of mother and father to 
develop an index of agreement. The mean 
agreement score for the continuity group 
(r = 546) was significantly higher than the 
mean agreement for the discontinuity group 
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(r = .351), a difference significant beyond the 
.001 level (one-tailed test). 


Srupy Il: THE CHILD-REARING ORIENTA- 
TIONS OF PARENTS OF CONTINUOUS AND 
DISCONTINUOUS STUDENTS 


Because of the possibility that the students’ 
perceptions of parental child-rearing values 
might be affected by self-serving distortions, 
particularly on the part of the more alienated 
discontinuity group, it seemed desirable to 
evaluate this alternative by undertaking a 
second study that would focus on students’ 
parents. 


Method 


Selection of comparison groups. In 1967, the 
CRPR and an attitude questionnaire were mailed 
to the 536 available parents of a randomly selected 
sample of 322 Berkeley undergraduates, participating 
in a study conducted by Robert Somers. Sixty-six 
percent of the parents completed the test materials. 
The questions that were asked of the students 
and of their parents corresponded closely with 
the questions that were included in the 1964 study; 
thus, it again was possible to classify subjects with 
respect to the continuity-discontinuity categories, 
using criteria similar to those used earlier. However, 
the selection criteria, while similar in nature, were 
necessarily less stringent in the present study for 
two reasons: (a) The available subject pool from 
which extremes could be selected was smaller (1,051 
subjects in 1964 and only 322 subjects in 1967), and 
(b) in the 1964 study, an explicit strategy involved 
the oversampling of activist students, who were more 
likely to be radical in the political commitments, while 
subjects in the 1967 study were a randomly selected 
sample, Since there were fewer students at the ex- 
tremes, it was necessary to relax the selection criteria 
to achieve comparison samples of a statistically useful 
size, a compromise that results in less separation of 
the 1967 continuity and discontinuity groups, making 
statistical tests more conservative. 

All students rating themselves as politically radical, 
liberal-radical, or, unlike the 1964 study, liberal 
(Steps 8, 7, and 6 of the continuum) were identified 
(the N proved to be 95). The further criteria applied 
for inclusion in the discontinuity group were (a) the 
existence of a discrepancy of 3 points or more (not 
4 points or more, as in the 1964 sample) between 
the student’s rating of his political preference and the 
ratings by his parents of their political preferences, 
and (b) the overall index of agreement between 
parent and child on such contemporary issues as the 
Vietnam war, legalization of marijuana, civil rights, 
patterns of dress, etc., was below the mean agree- 
ment (not one sigma below as in the 1964 sample) 
characterizing the total sample. 

The criteria applied for inclusion in the continuity 

group were as follows: (a) There was a discrepancy 
of no greater than 1 point between the student’s 
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rating of his political preference and the ratings b 
his parents of their political preferences (the averay 
discrepancy proved to be .54), and (b) the overall 
index of agreement between student and parent 
exceeded the mean of the total sample. No data 
reflecting parental influence on the students were 
available, and this consideration, used previously, 
could not be invoked. 

Applying these criteria to this pool of students, 
13 women and 15 men were included in the con- 
tinuity group, 8 women and 9 men were placed in thc 
discontinuity group, and 23 women and 27 men 
either could not be classified or had parents who did 
not participate. 

In contrasting the discontinuity and the continuity 
groups in the two samples, the typical discontinuity 
or continuity student in the 1964 sample defined 
himself as a radical, with only a sprinkling of liberal- 


radicals in the group (mean ratings are 7.90 and 7.64 


for the discontinuity and the continuity groups); the 
1697 discontinuity and continuity groups contained 
liberals and liberal-radicals in about even numbers, 
with only a few radicals present (mean ratings are 
6.75 and 640 for the discontinuity and continuity 
groups), The mean agreement scores of the 1964 
discontinuity and continuity groups were separated 


by more than two standard deviations, while the™* > 


1967 discontinuity and continuity groups were 
separated by less than one standard deviation. 
These restrictions of range characterizing the 1967. 
sample unfortunately attenuate the comparisons to 
be reported. 2 


Results 


Before comparing the parents of the con- 
tinuity and the discontinuity groups with re- | 
spect to their socialization values and prac- 
tices, their comparability was assessed with 
respect to educational achievement and occu- 
pational status, indexes that could influence 
socialization attitudes and confound the inter- “© 
pretation of results. No differences even ap- 
proaching significance were found between the 
parents of the continuity and the discontinuity 
groups with respect to either of these indexes; 


the parents of the discontinuity group were% 


slightly better educated, while the occupa- 
tional status of the parents of the continuity 
group was somewhat higher. 

Tables 3 and 4, below, present the CRPR ` 
terms, reliably Era ANS between the par- 
ents of the discontinuity students and the 
parents of the continuity students. The results 
are reported for the sexes combined in order 
to simplify presentation, since sex-separated 
analyses reveal generally equivalent orienta- 
tions in the parents of male and female 
children classified in a like manner. 
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TABLE 3 


Moruers’ CRPR SELF-DESCRIPTIONS SIGNIFICANTLY DIFFERENTIATING THE CONTINUITY (C), 
Discontinuity (D), AND CONSERVATIVE CONTINUITY (CC) Groups 


agon piecon oe 
k inuity inuity nE $ Hatia $ 
CRPR item group | group | 7a" | tinuly (Ces. C | CCo D 
(n = 30) | (n = 22) meen: 
(n = 26) 
»-. I encourage my child always to do his best 6.13% 6.14 ET 6.69 | 2.67** 1.94 
I often feel angry with my child 3.578 2.59 1.87 2.65 1.87 — 
I punish my child by isolation 3.45 3.50% a 2.50 | 2.03* 1.81 
I watch closely what and when my child eats 4,538 3.91 — 3.73> | 1.97 = 
I wish my husband were more involved with the 
children 2.27 Bios 2.43* 2.69 — — 
I try to stop my child from playing rough games where 
t E Aek wet hurt p 3.678 3.36 = 2.88> | 1.68 = 
T think it is good practice for a child to perform 
before others 3.57 3:50. — 4.198 | 1.72 1.69 
I gain some of my greatest satisfactions from 
Bh ny child 5.47 | 614 | 1.76 | 5.96 a = 
* Tencourage my child to wonder and think about life 5.972 5.59 — 5.15 | 2.18* — 
I teach my child that punishment will “find” him 2.67% 3.55 1.96 3.19 Te — 
~ I don’t allow my child to get angry with me 2.57 3.00" — 2.11 = 2.32s 
I give up some of my own interests for my child 4.47 4.32» — 5.048 = 1,69 
I have never caught my child lying 3.27 3.648 — 2,54» ST 2:21% 
T talk it over and reason with my child when he 
misbehaves 5272 5.54 6.158" |2.82** 1.83 
I have strict, well-established rules for my child 3.23 3.14 on 4.08" | 1.81 1.72 
al encourage my child to be curious 5:93 6.14% TE, 5.50% = 1.68 
7% I sometimes feel I am too involved with my child 3.57 SALAA = 2.77% | 1.78 2.03* 
I believe children should be weaned early 3:134 2.36 172: 3.00 — — 
4 I believe in early toilet training 3.40 2.86% — 4.198 >= 2.66* 
x~ I sometimes use the supernatural in explaining 
things to my child 3.00 3.59% = 2.65» aes 1.79 
P I encourage my child to talk about his troubles 6.508 5.82» 22S 6.31 — — 
I dread answering my child’s questions about sex 2.07 2.778 1.91 2.54 — -> 
I think my child should be encouraged to do better 
than others 3.30! 3.59 = 4.08" | 2.02* — 
I punish my child by taking away his privileges 4.10 3236F = 4.698 = 3.13** 
` T give my child extra privileges when he behaves well | 4.27% 3.45» 1.92 4.19 — 1.90 
I sometimes tease and make fun of my child 2.97 2.00” 2.27* | 3.088 = 2.37* 
I teach my child that he is responsible for what 
happens to him 4.67» 5.50% 1.74 4.69 = 1.72 
ere is a good bit of conflict between my child ea Ane Seamer 
and me A a = a -9 1.95 
I do not allow my child to question my decisions 2A: 2.954 2.13* | 2,08» eS 21h 
x [feel it is good for my child to play competitive 
games 4.80» 5.64 2.46* 5.88" | 3.66** = 
I let my child know how ashamed and disappointed 
I feel when he misbehaves 4.00 3.95» = 4.77 | 1.89 1.76 
I want my child to make a good impression on others | 5.17 4.825 = 5.54 =e 1,84 
I encourage my child to be independent 6,208 5.82 = 5.61 |. 1.77 — 
T think it is important for my child to play outside 
in fresh air 5.10 5.41 — 5.96 | 2.75** — 
s <T believe it is unwise to let my child play without 
supervision 3.272, 4.458 3.01**| 3.46 — 2.12* 
Note.—The Ọ items are in somewhat abbreviated form, Seven categories of response were used, with ‘‘7" indicating the most 
~. characteristic and ‘'1’’ indicating the least characteristic. 
SMa Macao tint arc lence characteristic, 
*p < 05, 
kD Z 01, 


Comparing to content of the distinguishing 
items, the parents of the discontinuity stu- 
dents appeared to be more concerned with 
conventional values than were the parents of 
the continuity group. Competition and the 
importance of maintaining a good impression 


were given greater emphasis by the discon- 
tinuity parents. Further, the discontinuity 
parents appeared to be more authoritarian, 
restrictive, and suppressive in their socializa- 
tion practices, as shown by their more fre- 
quent endorsement of items reflecting physical 
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TABLE 4 


FATHERS’ CRPR SELF-DESCRIPTIONS SIGNIFICANTLY DIFFERENTIATING THE ConTINUITY (C), 
DISCONTINUITY (D), AND CONSERVATIVE CONTINUITY (CC) Groups 


Conser- | 
a E 
inui i $ i i 
CRPR item up ous Coe tinuity meee | Conan 
mean mean group vs, C | CC vs. D 
(n = 28) | (n = 17) ean | 
(w = 24) | 
I respect my child’s opinions 6.46" 5.76 2,20* Si58P 2:52" = 
I help my child when he is being teased 4.48" 3.41> | 2.38* 4.21 += 1,78 
T expect my child to handle his problems by himself 3.14 4.06 | 1.94 2.79» -= 2.29* 
I don’t think young children of different sexes 
should see each other naked 2.89 4.00" | 2.04* 3.21 — — 
I think physical punishment is the best method 
of discipline 1,78» 2.82 2.52* 3.33s | 3.549 — 
T think it is good practice for my child to perform 
before others 3.710 5.00" | 2.84** | 4.00 —- 1.83 
I express affection by hugging, kissing, and holding 
my child 4.11 4,358 — 3.338 — 2.66* 
T encourage my child to wonder and think about life 6.11" 5.47 — 4.71 | 3.57** — 
I find it difficult to punish my child 5.25" | 4.59 —_ 3.83 | 2.88%" | — 
I don’t allow my child to say bad things about | 
his teachers. 2.93» 4.12 PANY fal 4.13" | 3.14** => 
I am easy going and relaxed with my child 4.86" 3.716 | 2,20* 4,42 — — 
I give up some of my own interests for my child 5.25 441b | 2.02* 5.428 — 2.20* 
I tend to spoil my child 4.648 3.41» | 1.92 3.92 — == 
I have never caught my child lying 3.00? 4.358 | 2.04* 4.12 2.05* — 
I have strict, well-established rules for my child 2579 3.00 FR 3.71" | 1.85 — 
I expect my child to be grateful 2.71 4,598 | 3.79** | 3.96 2:51. — 
I sometimes feel I am too involved with my child 2.938 2.82 = 2.21> | 1.80 — 
I believe in early toilet training 3.21? 3.59 = 4.25% | 2.34* — 
I think praising a child when he is good gets better a 
results than punishing bad behavior 5.79% 4.88> | 1.77 5.21 1,92 —- 
I don’t think children should have secrets from ‘ly 
their parents 2.825 | 4.248 | 3.10** | 3.00 hs 2.34* 
I try to keep my child from fighting 4.93% 3.5303 0S 383 A2 — 
I dread answering my child’s questions about sex 2.43» 2.82 = 3.29" | 1.68 = 
I punish my child by taking away his privileges 4,07 3.53b — 4,838 — 2.62* 
I believe my child should be aware of all 
my sacrifices 1.96 | 2:35 2 BEES leg oar AE aoe a 
I sometimes tease and make fun of my child 2.93 3.298 — 2.002- | 1.77 2,22* 
There is a good deal of conflict between my child 
and me 2.32 2.94a — 1.92> _ 1.80 
I don’t allow my child to question my decisions 2.18 2.71 — 3.04" | 2.17* — 
I feel it is good for my child to play competitive 
games 5.14 512b => 5.88" | 2.08* —= 
I like to have some time to myself away from 
my child 5.00" | 4.94 — 4.25> | 1.72 _ 
T let my child know how ashamed and disappointed 
I feel when he misbehaves 3.68 3.47 = 4.372 — 1.83 
I want my child to make a good impression on others | 4.86 5.65 | 1.77 5.21 — — 
I find it interesting to be with my child for long 
eriods 4.578 4.35 — 3.71> | 1.98 — 
I think jealousy and quarreling between siblings 
should be punished 3.14 | 3.65 — 45215 | 2.30* — 
I think children should learn early not to cry 2.07 2.53 -5 2.918 | 2.23* — 
I don’t think a child should be given sex information 
until he can understand everything 2.54 3.23 = 3.54" | 2.24* — 
I enjoy seeing my child eating well and enjoying 
his food 5.68" 4.82 | 1.83 5.29 — = 
I don’t allow my child to tease and play tricks 
on others 4.39 3.71 — 4.588 — 1.95 


Note.—The Q items are in somewhat abbreviated form, Seven categories of response were used, with “7" indicating the most 
characteristic and *‘1" indicating the least characteristic, 

a Means that are most characteristic, 

b Means that are least characteristic, 


p <.05. 
** p <.01. 
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ý -~ punishment and the “thou shalt nots” of child 


training. Children were not allowed to ques- 
tion decisions, to be critical of their teachers, 
to have secrets from their parents, or to play 
unsupervised. Finally, there is some sugges- 
tion of greater anxiety among the parents of 
the discontinuity students with respect to ade- 


_.. quacy in the parental role. They indicated 


greater concern about coping with the sexual 
development of their children, demanded 
gratitude and appreciation from their children, 
and tended to require that their children as- 


„sume the responsibility for solving their own 


problems. The earlier noted tendency of par- 
ents to use anxiety and guilt-arousing mecha- 


_ nisms of control were reproduced here in items 


reflecting gratitude and the invocation of 
threats of punishments from an impersonal, 
unpredictable, but implacable source. The 
results for the discontinuity parents seem to 
cohere with those of the earlier study. 
Turning now to the descriptions of child- 
‘earing practices provided by the parents of 
the continuity students, the results are some- 
what more equivocal, particularly on the part 
of the continuity mothers. These parents ap- 
peared to have relationships with their chil- 
dren that both respected the child and en- 
couraged his individuality, as witnessed by 
items valuing the child’s opinions, encouraging 


wy discussion of problems with the child, and 
accepting the child’s privacy needs. Inter- 


actions between mother and child also were 
described more candidly, since the continuity 
mothers were freer in admitting, or denied 
less vehemently, their negative feelings and 
their less exemplary socialization practices. 
Although the continuity parents described 
themselves as somewhat more indulgent and 
protective, they also were demanding of their 


Be «children; the mothers believed in early wean- 


b- 


ing, and the continuity fathers expected their 
children to assume houshold and family re- 
sponsibilities and were significantly more con- 
cerned with the control of aggressive behaviors 
than were fathers of the discontinuity group. 
The 1967 child-rearing descriptions by parents 
did not emphasize the importance of encour- 
aging reflectivity, curiosity, and introspection 
as was true of the continuity students’ percep- 
tions of parental practices. 

As before, the agreement between parent 
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pairs in their child-rearing attitudes was 
evaluated by correlating their respective 
CRPR Q descriptions. The findings replicate 
those of the 1964 study based on perceptions 
of parental agreement. Again, the parents of 
the continuity individuals displayed greater 
agreement in their orientations toward child 
rearing (r = .552) than did the parents of the 
discontinuity individuals (r = .450), a differ- 
ence significant beyond the .025 level (one- 
tailed test). 

Considering the CRPR results for both the 
1964 and 1967 studies, there seems to be a 
good deal of consistency in the relationships 
observed, despite the different contexts of 
application and the differences in sampling 
constraints. Unlike the 1964 study, wherein 


the unflattering portrayal of the discontinuity 


parents conceivably could be interpreted as a 
retrospective justification by the discontinu- 
ity subjects for their alienation from parents, 
the 1967 study, employing the parents of the 
discontinuity individuals, cannot be so inter- 
preted. Indeed, in the latter study, one 
would expect the parents of the discontinuity 
subjects to justify, retrospectively, their 
estrangement from their children by portray- 
ing their socialization practices more posi- 
tively in an attempt to minimize their respon- 
sibility for the subsequent alienation of their 
children. In this case, distortion in the more 
positive direction would make significant dif- 
ferences between the continuity and dis- 
continuity parents more difficult to achieve. 

On the basis of both studies, the constella- 
tion of significant items suggests that con- 
tinuity parents experienced greater comfort in 
their parental roles, were more candid, and 
encouraged the individuation of their children, 
placing less emphasis on authoritarian con- 
trol and discipline per se. Greater congruence 
of parental child-rearing philosophies also 
characterized the continuity parents in both 
studies. The discontinuity parents, on the 
other hand, appeared more tense in the 
parent-child relationship and were more con- 
cerned with establishing and maintaining 
control over their children, The parents were 
in less agreement about their child-rearing 
values, which must result in inconsistent feed- 
back to the child. Viewed in this light, the ali- 
enation of the discontinuity students may rep- 
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resent an attempt to escape excessive demands 
and intrusive, authoritarian controls imposed 
by inconsistent parents who often disagree 
with each other about their child-rearing 
attitudes.® 


Stupy III: UNCONFOUNDING IDEOLOGY AND 
GENERATIONAL CONTINUITY 


To further understand the forces of sociali- 
zation influencing the continuity and discon- 
tinuity individuals, whose parents differ 
widely in their political (and social) orienta- 
tions, a third group of students was identified 
from the 1967 subject pool that shares with 
the continuity group a continuity across gen- 
erational boundaries and that shares with the 
discontinuity group the conservative political 
affiliation of their parents. This group, the 
continuous conservative group, was composed 
of students whose self-rated political position 
placed them in the conservative or moderate 
categories (of the 8-point political continuum) 
and whose self-rated ideological position did 
not deviate more than one step from the rat- 
ings by their parents on the same political 
continuum. These parents are slightly more 
conservative than their children and are simi- 
lar in their degree of political conservatism to 
the parents of the discontinuity individuals. 
Twenty-eight subjects (18 males and 10 fe- 
males) were identified by these criteria. Of 
this group, further information revealed their 
acceptance of the societal status quo, since 
26 had not engaged in student demonstrations; 
the remaining two subjects had actively op- 
posed student demonstrations.* 

The CRPR Q descriptions of the parents of 


3 Although discontinuity parents disagree with each 
other in terms of their child-rearing attitudes, it is 
important to note that they did not manifest greater 
disagreement in the political-social sphere. When 
mothers’ and fathers’ ratings of endorsement are com- 
pared on 12 political-social dimensions (attitudes 
about student protests, drugs, effectiveness of Black 
Power, etc.), the mean standardized agreement scores 
were 49.4 and 49.0 for parents in the continuity and 
discontinuity groups, respectively, and 48.8 for the 
continuous conservative group (see below). 

4No liberal discontinuity types could be found in 
the subject pool, an observation supporting the find- 
ings of Middleton and Putney (1963), who also 
noted that conversion to a conservative political 
ideology by young people reared in a liberal political 
tradition is a rare phenomenon, 
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the continuous conservative students were - 
contrasted with those of the parents of ‘he 
continuity individuals, to identify the items 
differentiating parents holding conservat 
versus liberal political positions where their 
children have identified with parental ideology 
in each case. The CRPR Q descriptions of 
the parents of the continuous conservative 
students were also compared with those of the 
parents of the discontinuity individuals, 
identify the items differentiating parents whi 
rear children identifying with them from 
parents rearing children who have become 
disaffiliated where political ideology is held 
constant (at a conservative position), 

The results of these comparisons are also 
included in Tables 3 and 4 in the last two 
columns. The results are reported for the 
sexes combined to save space, since the results 
are not importantly associated with gender. 


ive 
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Socialization Differences Associated with 
Conservative versus Radical-Liberal Ideolog 


The child-rearing values of the continuous 
conservative group appear ci 
conservative political attitudes aracterizing 
the group. Contrasted against ‘the liberal par- 
ents of the continuity students, the continuous 
conservative parents emphasized discipline and 
self-control in child upbringing. Fathers in the 
continuous conservative group described them- 
selves as more authoritarian with their chil- 
dren, emphasized power and status differences 
between parent and child, encouraged the 
suppression of impulse and affect, and encou: - 
aged guilt, Mothers in the continuous con- 
servative group appeared more concerned 
with competition and achievement. 

Parents in the continuity group diverged 
from the continuous conservative parents in, S7 
being less concerned with the schematizing of 
experiences and the suppression of impulse. 
Rather, these parents fostered introspection 
and encouraged their children to confront ~ 
experience. The salient items suggest that the 
liberal parents of the continuity individuals 
emphasize the child’s individuation, are candid 
in their interaction with the child, are less 
concerned with obedience, and place less em- 
phasis on impulse control in areas other than 
aggression, where they are the most restrictive. 
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The socialization items differentiating the 
two continuity groups suggest that both power 
assertion (authoritarianism) and concern with 
the schematizing of experience are related to 
parental ideology; however, neither of these 
dimensions appears to be associated neces- 
sarily with alienation from parents, since 
parents placing high on these dimensions as 


- well as those placing low both have adoles- 


cent young who manifest generational conti- 
nuity. The average agreement score for par- 
ents of the continuous conservative students 
was .516, and it did not differ significantly 
from the continuity group mean. 


Socialization Differences Associated with 
Acceptance versus Rejection of Parents 


Comparisons of the two ideologically con- 
servative groups of parents who differ with 
respect to the generational continuity—discon- 
tinuity shown by their children suggest that 

«the child-rearing orientations of the parents 
of the discontinuity students are less coherent 
than those of the parents of the continuous 
conservative students. Mothers in the discon- 
tinuity group described themselves as overly 
involved with their children, they insisted that 
their children were always truthful, and they 
did not press for early training. Conjointly, 
these mothers emphasized authoritarian con- 
trol—they forbade their children to question 
their decisions, they did not allow the expres- 
sion of anger, and they felt most strongly that 
children should not be left to play without 
adult supervision, In contrast, mothers in the 
continuous conservative group appeared con- 
sistent, clear, and certain in their maternal 
roles—they were more accepting of anger, 
they had well-defined expectations for their 
children, and they dealt with transgressions. 

The fathers in the discontinuity group char- 
acterized their relationships with their chil- 
dren as relatively more ambivalent and con- 
flicted. They described themselves as wanting 
their children to handle their own problems 
but, also, as not wanting their children to 
have secrets from them. They ascribed less 
salience to items reflecting responsible parent- 
hood (providing direction, limits, guidance, 
and support), while emphasizing physical in- 
teractions with their children, In contrast, the 
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fathers in the continuous conservative group 
appeared more conscientious, helpful, and 
consistent in their parental roles. 

In searching for patterns of child rearing 
that may be associated with adolescent re- 
jection of parental values, the data suggest 
that acceptance of the child and conscientious, 
consistent, rational parenthood are more 
critical qualities or emphases than the degree - 
of control imposed on the child. The child- 
rearing orientations of parents in the discon- 
tinuity group are, comparatively, less co- 
herent and less rational. The discontinuity 
mothers tended to be permissive in some areas 
and demanded submission and compliance in 
others, while the discontinuity fathers were 
less conscientious and consistent in their pa- 
ternal roles. The data on parental agreement 
also tended to reflect the greater inconsis- 
tency of the discontinuity parents, since the 
average agreement score for the discontinuity 
parents (.450) was lower than that of the 
continuous conservative parents (.516), a 
difference significant at the .07 level (one- 
tailed test). 


Discussion 


The results of this study are in accord with a 
the observations of other investigators who 
have concerned themselves with generational 
continuities and discontinuities. Watts and 
Whittaker (1968) concluded that the different 
styles of life characteristic of Berkeley activ- 


ists and Berkeley “street people” were criti- 


cally determined by the extent to which the 
intellectual, religious, political, and social 
values of the parents are relevant to and 
incorporated by the young. From his assess- 
ment of radical and militant youth, Leibert 
(1969) developed a typology in which he dis- 
tinguished between “idealistic” and “nihi- 
listic” radicals, He found that clear discon- 
tinuities in the moral and political value 
systems between parents and adolescents were 
associated with both “nihilistic” radicalism 
and poor ego integration, while the values of 
the more integrated “idealistic” radicals had 
a strong similarity to those of their families. 
Leibert’s conclusions fit well with the findings 
reported here. Finally, these results are con- 
sistent with Keniston’s findings (1965, 1968) 
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where uncommitted, alienated youth were 
found to be estranged from their families and 
to have rejected parental value systems, while 
committed young radicals were found to hold 
values similar to their parents and to evidence 
considerable identification with their parents. 

Clearly, the findings surrounding the conti- 
nuity group challenge the monolithic inter- 
pretations of student activism offered by 
Feuer (1969) and Rubinstein and Levitt 
(1969), wherein protest is seen as a universal, 
intemperate expression of Oedipal rebellion. 
The results of the present study further docu- 
ment the heterogeneity of the left and argue 
against a single, simplistic theory of political- 
social protest, equally applicable to all critics 
of contemporary society. Parental acceptance 
of, and even impulsion toward, radical social 
action is characteristic of the continuity 
subjects in whom identification, rather than 
rebellion, appears to be the child-parent 
leitmotif. 

We must note, however, that it is possible 

to view the findings for the discontinuous 
individuals, who have come to their political- 
social commitments in the absence of a liberal 
family tradition and without parental encour- 
agement, as offering some support for the 
Oedipal thesis. The conflict attributed to the 
parent-child relationship, the rejection of pa- 
rental values, and the tendency toward sex- 
role reversal suggested by the self-descriptions 
of the discontinuity group all can be fitted to 
an interpretation in terms of unresolved 
Oedipal conflicts. But alternative explanations 
of the discontinuity subjects are possible as 
well, since generational discontinuities cannot 
be taken as prima facie evidence of Oedipal 
rebellion. In any event, the heterogeneities 
found to characterize societal rejections 
clearly challenge attempts to ascribe protest to 
a single, universal cause. 

The data further suggest that rebellion 
against parental values, as distinct from rebel- 
lion against societal views, may be viewed as 
a reaction of the young adult to the absence 
of consistency and coherence in his parents’ 
child-rearing philosophies. Loevinger (1959) 
suggested that the battle between the genera- 
tions is never more vicious than when parental 
reactions to a child are an expression of un- 
modulated impulse rather than determined by 
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reason. She proposed that any parent-)°.d 
theory of child rearing, even a theory that 
might fail to pass the muster of psychologists, 
is better than no theory at all, because 
provides the child with a parental example 
the principle of self-regulation for the welfar~ 
of another. Loevinger suggests that it is th 
socializing intention of the parent more th» 
the kind of socializing that civilizes the chi 
The data tend to support her observatio 
since the child-rearing attitudes of the pareni 
of the discontinuity subjects appear to be less 


guided by principle and to be more determined z^ 


by parental conflicts and anxieties. Althoug 
the value systems reflected in the socializatioi 


practices of parents of the continuity and con- _ 


tinuous conservative subjects diverge, each ap- 
pears to have articulated and coherent notions 
about what is important in child rearing. 
The greater intraparent consistency tha! 
appears to characterize both the continuity 
and the continuous conservative groups is re-. 
flected in greater interparent consistency as 
well, since there tends to be greater agree- 
ment between the parents about child-rearing 
values for both the continuity and the con- 
tinuous conservative groups than for parer: 
in the discontinuity group. It is interesting 
that agreement or disagreement between par- 
ents, with respect to political—social views, is 
not necessarily related to adolescent accept- 
ance or rejection of parental values as noted 
above, perhaps because ideological differences 
between parents are manifested relatively ab- 
stractly and impersonally. Indeed, differenc:: 
in parental political attitudes may contribute 
to an enriched intellectual environment for 
the developing child. Differences in socializa- 
tion values, however, center upon the chiid 
who consequently can be buffeted by his dis- 


agreeing parents and be torn in his loyalties.” 


Agreement between parents with respect to 
child-rearing values, on the other hand, adds 
to the overall consistency of the child’s so- 
cialization experiences, reduces contradictory 
feedback, and contributes to the child’s 
sense of coherence and predictability in his 
environment. 

It has long been recognized that some de- 
gree of continuity across generational bounda- 
ries is essential to personality integration. 
Erikson (1965) suggested, “To enter history, 


-< 


each generation of youth must find an identity 
consonant with its own childhood and conso- 
nant with an ideological promise in the per- 
ceptible historical process [italics added, p. 
24|.” Continuity establishes some basis for 
communication and allows for ego-syntonic 
ways of negotiating conflicts. The continuity 
subjects appear to be finding a way to main- 
tain the integrity of their own contemporane- 
ously defined but historically influenced value 
positions—to be achieving a clearer sense of 
identity—without the need to reject their 
~ parents or to sa_vifice familial ties. 

Finally, we must note that the data from 
the studies reported here were gathered in an 
. earlier political age, a time when protest was 
F generally peaceful and demonstrators were 
generally good-natured and somewhat opti- 
mistic about the usefulness of their efforts. It 
may well be that the significance of our find- 
ings for the current political area is modified 
3 - appreciably by the recent widespread view 
‘within the Left that political change will 
come too slowly to prevent societal disaster. 
Although our data cannot speak directly to 
7“ the present, some extrapolations seem possible. 

The discontinuity group is composed of 
radical, restless, searching young people cen- 
trally concerned with issues of personal ex- 
pressiveness. Their political radicalism seems 
w more a manifestation of a personal orientation 
toward immediacy than a considered and sus- 
tained commitment to social ‘change. Accord- 
ingly, they would come quickly to accept the 
view that societal change is not feasible when 
the system does not respond immediately and 

their goals are frustrated. Two alternative be- 

havioral consequences can be anticipated for 
the discontinuous individuals when their dis- 
enchantments are reinforced by an unrespon- 
sive society. They can seek their gratifications 
è in the nonpolitical realm by “dropping out,” 
or they can aggress against the source of their 
æ political frustration by violent means. It seems 
likely that the continuous individuals will be 
less pulled by either of these two alternatives. 
Unfortunately, research documentation of 
these conjectures may be long in coming, 
because research participation by today’s Left 
is becoming more difficult to achieve. 3 


“ 


>. 
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RACE AND BELIEF: 


A FURTHER ANALYSIS OF THE SOCIAL DETERMINANTS OF 
BEHAVIORAL INTENTIONS * 


MARVIN GOLDSTEIN ? anv EARL E. DAVIS! 


Graduate Faculty of Political and Social Science 
New School for Social Research, New York 


In a replication and extension of the Triandis and Davis study, 150 white 
student subjects responded on behavioral differential scales to stimulus persons 
who varied in race and two beliefs. The subjects also evaluated civil rights 
issues, along with the stimulus-person beliefs. Using multidimensional scaling 
and multimode factor analysis, two subject factors were isolated. These two 
subject types, called race rejectors and belief rejectors, differed systematically 
with respect to the importance attached to the race or belief characteristics of 
the stimulus persons. Furthermore, race was increasingly important for all sub- 
ject types as the response continuum became more intimate. A cube of data 
consisting of stimulus person characteristics, response continuum, and subject 
type was presented in interpreting the race versus belief controversy, Finally, 
the hypothesis that race rejectors would perceive their important others as more 


disapproving of interracial contacts than would belief rejectors is confirmed. 


E: a previous article, Triandis and Davis 
(1965) attempted to resolve a conflict in the 

_ literature between Triandis (1961) and Rok- 
each (1961) concerning the relative impor- 
tance of belief versus racial or ethnic group 
membership in the acceptance or rejection of 
stimulus persons. Rokeach, Smith, and Evans 
(1960) and Rokeach (1961) held that non- 
acceptance of stimulus persons was largely 
determined by perceived dissimilarity of their 
beliefs, rather than by their racial or ethnic 
membership. Triandis (1961) argued that 
prejudice consisted of more than nonaccep- 
tance of friendship behaviors; his data showed 
that when “large degrees of social distance,” 
such as excluding a person from one’s neigh- 
borhood, were involved, race rather than be- 
lief was the major determinant of behavioral 
intentions. Stein, Hardyck, and Smith (1965), 


1This article is based, in part, on a dissertation 
presented by the first author in partial fulfillment of 
the PhD degree at the Graduate Faculty of Political 
and Social Science of the New School for Social Re- 
search, under the supervision of the second author. 
The authors are indebted to H. C. Triandis for his 
valuable comments on an earlier version of the manu- 
script. 

2 Now at Rider College. 

3 Requests for reprints should be sent to Earl E. 
Davis, who is now at the Health Services Mobility 
Study, Research Foundation of the City University of 
New York, 346 Broadway, New York, New York 
10013. 


in a further study, sought to BA 
differences. In general, their findings tended 
to confirm the earlier findings of Rokeach et 
al. (1960), that belief rather than race was 
the major determinant of social acceptance or 
rejection. However, their study did show that 
with “sensitive” items on a social distance 
scale, representing “institutionalized” areas of 
prejudice, race became a very significant de- 
terminant, in some cases more significant than 
belief. These authors, though basically sup-- 
porting the Rokeach position, seemed / to 
go a step in the direction of reconciling the 
differences by showing that the Triandis posi- 
tion held with respect to certain “sensitive” 
items. Beyond this suggestion, however, these 
authors offered no systematic explanation of 
the markedly contradictory findings of Rok- 
each and Triandis, and the reader was left 
with the feeling that the controversy had not | 
been resolved. The question of the relative 
importance of race and belief seemed far from 
being “an open and shut case.” 

The Triandis and Davis (1965) article, 
utilizing data gathered as part of a larger 
study of intercultural negotiations (Davis & 
Triandis, 1965), presented a model which 
showed that the apparently contradictory find- 
ings of Triandis and Rokeach were compati- 
ble if two previously ignored parameters were 
taken into account. 
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In the first place, in the earlier work, as in 
most previous work in attitude measurement, 
an implicit assumption of unidimensionality 
of attitude structure was made. Triandis 
(1964), in a factor analytic study of ‘state- 
ments of behavioral intentions, demonstrated 
that such items are multidimensional in na- 


~~~ ture. He found five clearly interpretable fac- 


tors, and subsequent research has demon- 
strated the relative stability of most of these 
factors. Triandis (1967) provides a partial 
review of research using this technique, which 


“is known as the behavioral differential. Tri- 


andis and Davis (1965) systematically manip- 
ulated this parameter, that is, the type of item 


}- or the nature of the response continuum. 


Eight stimulus persons were utilized, gen- 
erated by all possible combinations of the 
characteristics Negro—white, male-female, and 
pro- versus anti-civil-rights legislation. This 
constituted a 2 X 2 x 2 factorial design of 


“stimulus characteristics. Six clusters of items 


or response continua were utilized, including 
the evaluative factor of the semantic differ- 
ential (Osgood, Suci, & Tannenbaum, 1957) 
and the five factors of the behavioral differ- 
ential, isolated earlier by Triandis (1964). 
One of Triandis and Davis’ (1965) general 
findings was that subjects generally responded 
more on the basis of belief, as far as the 


semantic differential and the least “intimate” 


factor of the behavioral differential were con- 
cerned, but that when responding on the more 
intimate factors (e.g., those involving such 


_ items as “would accept as close kin by mar- 


riage,” “would exclude from my neighbor- 
hood,” etc.), race controlled a significantly 
greater percentage of the variance than did 
belief. This finding would seem to be con- 


_ sistent with the Stein et al. (1965) results, 


which showed a greater race effect on “sensi- 
tive” items. However, instead of merely find- 
ing certain exceptions to an otherwise general 
belief effect on the basis of certain sensitive 
items, Triandis and Davis were able to show 
that when their six response continua were 
ordered in terms of increasing degrees of “‘inti- 
macy,” there was a direct linear relationship 
such that the effect of race increased as the 
response continua became more intimate. 
The second parameter that Triandis and 
Davis (1965) considered, and one that has 


PO re oe eh a IE ae 


347 


been largely ignored in subsequent reviews of 
their findings, is that of the nature of the 
subjects involved, Using a modified version 
of the Tucker and Messick (1963) procedure 
for factoring an individual differences matrix, 
they isolated two subject factors, or “idealized 
subject types.” High-loading subjects on both 
of these subject types evaluated civil rights 
issues negatively; however, one of the types 
tended to reject stimulus persons primarily 


on the basis of race and rated civil rights — 


issues as relatively unimportant, while the 
other type tended to reject the stimulus per- 
sons primarily on the basis of belief and rated 
civil rights issues as relatively important, Sub- 
sequent analyses of variance, performed sepa- 
rately for subjects with high loadings on the 
two different subject factors, confirmed that, 
for all response continua, the “race preju- 


diced” subjects showed a greater effect due to’ 


race than did the “belief prejudiced” subjects; 
conversely, the belief-prejudiced subjects 
showed a greater effect due to belief than did 


ey AE F 


the race-prejudiced subjects. Neutral sub- y 


jects, that is, those without high loadings on 
either of the two subject factors, were inter- 
mediate with respect to the other two types, 


although they, along with the other subject- 


types, showed the same general trend of an 
increasing race effect as the response con- 
tinua became more intimate. 

Thus, the Triandis and Davis (1965) find- 
ings suggest a model (as illustrated in Figure 
1), whereby any given response (e.g., accep- 
tance or rejection) may be conceived of as a 


Stimulus 
Persons 


Sub ject 
Characteristics 


Behavioral Intentions 
(Response Continua) 
Fic. 1. Schematic representation of the relation- 
ship between subjects, stimulus persons, and behav- 
ioral intentions, 
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point in a three-dimensional space, determined 
not only by the characteristics of the stimulus 
person (e.g., racial or belief characteristics), 
but also by the other two parameters, namely, 
the response continuum (e.g., intimate versus 
nonintimate) and the characteristics of the 
subjects (e.g., race prejudiced versus belief 
prejudiced types). 

Of course, it is recognized that other stimu- 
lus characteristics, aspects of the response 
continua, and characteristics of subjects may 
be important determinants. Furthermore, 
dimensions other than the three just men- 
tioned should be considered (e.g., situational 
variables, the effects of social norms, etc.). 
Thus, a more complete model would involve 
an n-dimensional space with each “dimension” 
being, in turn, multidimensional. However, in 
the context of the present discussion, the 
three-dimensional model suggested in Figure 1 
would seem to be the minimal level of com- 
plexity required, given available information, 

_ to deal with the questions raised by the 
Rokeach-Triandis controversy. Any attempt 
to state flatly that either race or belief is 
“the” most important determinant of social 
acceptance or rejection, thus taking into ac- 
count only the characteristics of the stimulus 
person—but ignoring the other two param- 
eters—would seem to be a misleading and 
unnecessary oversimplification. 

The purposes of the present study were 
essentially twofold. One was an attempt to 
replicate and generalize the Triandis and 
Davis (1965) findings. The other was to 
investigate the role of social norms in de- 
termining statements of behavioral intentions. 

In pursuing the first objective, a possible 
criticism of the previous study was taken into 
account. A point could be made that the belief 
relating to civil rights legislation was not 
orthogonal to attitudes toward race, thus pos- 
sibly confounding these two variables. The 
present study utilized two beliefs, both of 
which were empirically determined to be 
orthogonal with respect to racial issues, as 
will be discussed below. 

In investigating the role of social norms as 
determinants of behavioral intentions, the 
present authors take as a point of departure 
the theoretical interpretation offered by 
Triandis and Davis (1965, p. 724), and 
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it is furthermore hypothesized that race- 
prejudiced subjects perceive their important 
others as being more disapproving of inter- < 


racial contacts than do _ belief-prejudiced < 
subjects. 

METHOD 
Subjects 


One hundred and fifty white undergraduate stu- 
dents from Rider College, Trenton, New Jersey, and 
New York Community College and Hunter College, 
New York City, served as subjects 


Questionnaire 


The investigators were interested in obtaining the 
subjects’ responses to various stiiivs persons on a 
variety of scales, or response continua. They were 
also interested in the subjects’ attitudes toward a 
number of social isswes. A further concern had to do 
with the perception of social norms, particularly as 
these related to interracial behaviors. Finally, several 
“personality” characteristics were studied. The ques-~ 
tionnaire can thus be briefly described in terms of 
these elements. ¥ 


Stimulus Persons 


Three stimulus person characte 
nipulated in a 2 X 2 X 2 analysis of variance design, 
generating eight stimulus persons. In addition to 
race (Negro and white), the stimulus persons were 
presented as either favoring or opposing two belief 
propositions. One of these was that “there should be 
a guaranteed minimum annual income for all families 
living in the United States” (income issue); the 
other was that “the United States should abide b` 
any decision made by a majority of the memh 
of the United Nations” (United Natio 
Furthermore, each stimulus person 
being a college student of the same 


istics were ma- ~ 


ed ag 
subject. 


Scales 


The primary interest was in obtaining the subjects’ 
responses to the stimulus persons on behavioral dif- 
ferential scales. However, instead of relying on pre- 
vious factor-analytic studies of the behavioral dif-7 
ferential, a pretest was utilized specifically for this 
study. In the pretest, a group of subjects comparable ` 
to those used in the main study responded on 16 
behavioral differential scales to 32 stimulus persons, 
similar to the ones used in the main study. On the 
basis of these data, a factor analysis of the scales 
was performed, and three clearly interpreta? 
emerged. The first factor was calle’ aintance — 
acceptance-rejection and was char :d by items” 
such as “would accept as a speak cquaintance.” 7 
The second factor was called frie» p acceptance 
rejection, with high loadings for items such as “would 
go on a trip with.” The third factor was clearly the 
ubiquitous social distance factor, consisting of items i 


$ 
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such the classical 
neighborhood,” 4 

It is intuitively obvious that these three factors, 
though factor analytically orthogonal, form a con- 
inuam along the dimension of intimacy, with the 
uaintance factor being least intimate, the friend- 
ship factor being intermediate in intimacy, and the 

cial distance factor being most intimate (the oppo- 
site pole of “exclude from my neighborhood” is ob- 
viously intimate; also, this factor contains high load- 
ings for such items as “would introduce to a relative 
on a blind date”). 


“would exclude from my 


as 


In addition to the behavioral differential, a form - 


of the semantic differential, which has been called the 
personality differential by Osgood and Ware (Osgood, 
1962), was used. The scales utilized were drawn 
largely from a previous study by Davis (1966) ; 
however, as was the case with the behavioral dif- 
ferential, the personality differential scales were also 
refactored in a pretest conducted specifically for this 
study. Five factors emerged from this analysis. Due 
to time limitations, only four stimulus persons were 
responded to on these scales, namely, those charac- 
terized by differences in race and one of the beliefs 
(income issue). These data did not turn out to be 
useful in differentiating subjects, and, hence, the 


-factors will not be described in greater detail here. 


Issues 


In addition to person stimuli, the subjects were 
asked to respond on six semantic differential scales 
to 10 stimuli which were designations of social 
issues. The scales were selected from a previous 
study by Davis (1966), designed to develop semantic 
differential scales specifically relevant to judging 
issues. The six scales constituted the three factors of 
evaluation, familiarity, and importance. Of the 10 
issues that were rated, 8 of them pertained to civil 
rights; the other 2 were the 2 issues (income issue 
and United Nations issue) that were used to opera- 
tionalize the beliefs of the stimulus persons. Taking 
the subjects’ composite scores on the evaluation 
factor for each of the issues, the 10 issues were 
factor analyzed. Five factors emerged, four of which 
were related to civil rights or race. The fifth factor 
contained the income issue and the United Nations 
issue. Thus, the 2 issues used as beliefs were clearly 
orthogonal to civil rights issues, thereby avoiding 
the previously mentioned difficulty in the original 
Triandis and Davis (1965) study. Indeed, the fact 
that these 2 issues factored together is probably due 
merely to the fact that they were the only 2 issues 
not related to race or civil rights. Had other non- 
racial issues relating to domestic and foreign policy 
been used, it is quite probable that these items would 
have factored out separately. This interpretation is 


4For each of the three factors, the three highest 
loading items were selected, and composite scores, 
based on the three items, were computed for each 
factor. For the complete factor-analytic results of 
the pretest and a listing of the actual items used, see 
Goldstein (1969). 


further supported by the relatively low correlation 
between the two issues (r= .21), which would seem 
to indicate a high degree of independence. 


Norms 


A major purpose of this study was to investigate 
the role of social norms as correlates of behavioral 
intentions. Two types of information concerning 
social norms were obtained, The subjects were pre- 
sented with six social groups, designated as follows: 
your parents, your home community, your school 
friends, your close friends, American society in gen- 
eral, and your friends in general. They were asked to 
rate on a 7-point scale the extent to which they per- 
ceived these groups as being important in guiding 
their interactions with others. They were also asked 
to rate on a 7-point scale the extent to which they 
felt that each of these groups would approve or dis- 
approve of their having specific behavioral contacts 
with certain persons. The stimulus persons involved 
were college students of the same sex as the subjects, 
and they were either white or Negro. For each of the 
two racial stimulus persons, there were three behav- 
ioral contacts: “eat with,” “take a trip with” and 
“introduce to a relative on a blind date.” These three 
behaviors were representative of each of the three 
behavioral differential factors. 


Summary of Variables 


In line with the attempt to replicate the Triandis 
and Davis (1965) findings, the next step was to 
compute subject factors. In this part of the study, 
the data on norms relating to the subjects’ percep- 
tions of their important others’ approval or dis- 
approval of interracial contacts were omitted, since 
these were to be independently related to the subject 
types. In summary, a total of 68 variables emerged 
for use in further analysis. These may be accounted 
for as follows: 24 from the behavioral differential 
(eight stimulus persons rated on three factors), 20 
from the personality differential (four stimulus per- 
sons rated on five factors), 15 from the issues (five 
issues rated on three semantic differential factors), 
6 from the importance of group scale (one impor- 
tance rating for each of six groups), and 3 com- 
posite scores based on factors that emerged from 
Likert scales designed to measure “personality” char- 
acteristics, such as authoritarianism and conformity. 


RESULTS 
Subject Types 


The 68 variables described above, derived 
from the responses of the 150 subjects, were 
analyzed by means of a modified version of 
the Tucker and Messick (1963) procedure 
for factor analyzing an individual differences 
matrix. This technique results in subject fac- 
tors or “idealized subject types” determined 
by the covariances of the subjects’ responses 
to the variables. From the original covariance 


350 Marvin GOLDSTEIN 
matrix, this program results in two factor 
matrices: one for subjects and one for vari- 
ables. Four principal-axis factors for subjects 
were extracted and rotated to simple struc- 
ture by varimax. An interpretation of the 
rotated subject factors was made possible by 
‘an inspection of the transformed characteristic 
vectors of the variable mode (a technique 
that may be loosely referred to as “counter 
rotation”). As was the case in the Triandis 
and Davis research, two of the subject fac- 
tors (Factors 1 and 3) were clearly interpre- 
table and relevant for this study. 
Subjects who showed high loadings on the 
first subject factor responded to the stimulus 
persons primarily on the basis of their beliefs; 
t is, they rejected stimulus persons who 

“held beliefs dissimilar to their own. Rejection 
on the basis of racial dissimilarity was negligi- 
ble for this group. Those high on this factor 
also tended to have negative evaluations of 
civil rights issues. 

Those subjects who had low loadings on 
this first subject factor also tended to reject 
belief-dissimilar persons, but in contrast to 
the “highs,” they favorably evaluated civil 
rights issues. 

In contrast to this first subject factor, an 
interpretation of Subject Factor 3 indicated 
that subjects with high loadings on this factor 
rejected stimulus persons who were racially 
different from themselves and that they were 
only minimally affected by the beliefs held 
by the stimulus person. Like the highs on the 
first factor, however, subjects with high load- 
ings on this third subject factor also tended 
to evaluate civil rights issues negatively. 

In contrast to the highs, the “lows” on this 
subject factor showed no rejection of racially 
dissimilar stimulus persons. Similar to the lows 
on the first subject factor, they also rated 
civil rights issues favorably. 

In summary, the following picture emerges. 
The highs on both subject factors are 
“illiberal,” in that they tended to evaluate 
civil rights issues negatively; however, the 
highs on the first subject factor tended to 
reject stimulus persons primarily on the basis 
of their beliefs, whereas the highs on the third 

subject factor tended to reject stimulus per- 
sons primarily on the basis of race. The lows 
on both factors were “liberals,” in the 
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sense that they positively evaluated 
rights issues, 

The problem of actually selecting 
persons representative of the two sv t 
factors, or “idealized subject types,” was 
complicated by the fact that there was a 
tain degree of similarity between them. Th: 
correlation over all of the 150 subjects be 
tween their loadings on Subject Factor 1 and 
their loadings on Subject Factor 3 was .2°. 
This still shows a fair degree of independence 
(the corresponding figure in the Triandis & 
Davis study, 1965, was .54), The lower figure <¢ 
in the present study is very likely due to the 
fact that nonracial belief items were utilized. 

In view of the correlation between the two 
subject types, the following procedure was 
adopted. The subjects with the highest and 
lowest loadings on each of the two factors 
were selected regardless of the overlap. This 
initial selection resulted in 30 subjects who 
loaded high on Subject Factor 1725 subjects, 
who loaded high on Subject Factor pees oul 
subjects who loaded low on Subject Factor a 


cel 


and 30 subjects who loaded low on Subject _ 4 


Factor 3. Next, the 15 highs and 15 lows 
on each subject factor who showed the least 
degree of overlap were selected. These were 
then considered as “pure highs” and “pure 
lows.” Finally, from the remaining subjects, 
we selected a group of “mixed highs,” that is, 3 
subjects who had high loadings on both 
subject factors and a group of “mixed lows.” 

For convenience, we will refer henceforth to 
the subjects who were pure highs on Subject 
Factor 1 as belief rejectors and those subjects 
who were pure highs on Subject Factor 3 as 
race rejectors. 

It follows from the discussion of Figure 1 
above that our primary analysis is concerned 
with treating the subjects’ responses to the™: 
characteristics of the stimulus persons as part 
of a cube of data in which, in addition to the 
stimulus persons’ characteristics, the two 
other parameters of subject characteristics 
and the nature of response continua play an 
important role. 


Weights Assigned to Race and Belief 


We have described the two different types 
of subjects that emerged from the preliminary 
analysis of the data, The response continua 


me 
we 
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“on which we will focus here are the three 


factors of the behavioral differential described 
above. 

In order to examine the relative weights 
given by the respondents to the race or belief 
of the stimulus persons, separate analyses of 
variance were carried out for each of seven 
subject types, utilizing data based on compos- 
ite scores derived from each of the three 
factors that constitute the response continua. 
The seven types may be described as follows: 
the pure highs on each of the two factors—the 
belief rejectors and the race rejectors; the 
pure lows on each factor; the mixed highs 
and mixed lows; and finally, “others,” that is, 
those remaining subjects who fall into none 
of the six categories, 

Tables 1—4 present these analyses of vari- 
ance results, including sums of squares, per- 
centages of variance, and significance levels 
of F ratios for all subject types across the 
three response continua. 

The results presented in Table 1, comparing 
the belief rejectors with the race rejectors, 
show two clearcut patterns. One is that, for 
both subject types, the percentage of vari- 
ance controlled by race increases as the 
response continuum becomes more intimate. 

The second pattern lies in a systematic 
difference between the subject types. For any 
given response continuum, race controls more 
variance for the race rejectors and belief 
controls more variance for the belief rejectors. 

Thus, for the belief rejectors, race controls 
no significant amount of variance on the ac- 
quaintance factor (the least intimate of the 
three), whereas belief—in particular the 
“income” issue—controls a significant amount 
of variance. For the race rejectors, on the 
other hand, even on this less intimate factor, 
race controls a significant amount of vari- 
ance—more than four times that controlled by 
the most important belief variable. 

For the friendship factor, which may be 
considered intermediate in intimacy, race still 
does not control any significant amount of 
variance for the belief rejectors, whereas 
belief—the income issue—again controls a 
significant amount of variance. For the race 
rejectors, on the other hand, race controls 
more than 15 times as much variance as the 
highest belief variable and more than twice 
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TABLE 1 


Sums oF SQUARES AND PERCENTAGES OF VARIANCE 
CONTROLLED By RACE AND Two BELIEFS on THREE 
RESPONSE CONTINUA: COMPARISON oF SUB- 
Jects Wrta Hicu Loanincs on EACH OF THE 
Two SUBJECT Factors 


Subject type 


“Pure high “Pure high 
Soran af | Ones" : belief threes”; race 
rejectors rejectors 
(n = 15) (maS) 
SS % SS % 
Factor I; Acquaintance acceptance-rejection 
Race 2 1 1.20 48 17.13 | 14.870% 
United Nationsissue | 1 09 04 3.56 | 3.09 
Income issue 1 | 27.40 | 10,95** 1.63 1.42 
Factor II: Friendship acceptance-rejection 
Race 7 1 3.33 | 1.04 133.71 | 35.5239% 
United Nationsissue | 1 63 19 8.18 | 2.17 
Income issue 1 | 33.43 | 10.41%* 6.53 | 1.74 | 
Factor III: Social distance 
Race > 1 | 36.29 |13.23** | 396.05 | 76,94 
United Nations issue | 1 -30 ll 11.20 | 2.18* 
Income issue 1 | 30.67 | 11.18% 1.34 -26 


Note.—F ratios (omitted here) were calculated by dividing 
the mean squares for main effects by the mean-square error, 
The F ratios were tested for significance with 1 and N — § de- 
grees of freedom, and the levels of significance are shown next 
to the percentages of variance, 

*p <.05. 

wD < 01, 
Pep 001 


as much variance as in the case of the ac- 
quaintance factor, thus illustrating the dif- 


ferences due both to subject type and intimacy ~ 


of the response continuum, 

Finally, on the social distance factor, the 
most intimate of the three, race controls a 
significant amount of variance even for the 
belief rejectors, but belief continues to con- 
trol a very significant amount of the variance. 
For the race rejectors, the previously ob- 
served trend continues, with race controlling 
more than 35 times as much variance as the 
highest belief variable, and again more than 
twice as much as it did on the previous factor. 

Table 2, which compares the pure lows on 
the two subject factors, shows generally in- 
significant sums of squares and percentages 
of variance, as might be expected. 

The major exception is the significant per- 
centage of variance controlled for by race for 
the “pure low ones” on the social distance 
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TABLE 2 


Sums oF SQUARES AND PERCENTAGES OF VARIANCE 
CONTROLLED BY RACE AND Two BELIEFS ON THREE 
RESPONSE CONTINUA: COMPARISON OF SUB- 
jects Wiru Low Loapincs ON EACH OF. 
THE Two SUBJECT FACTORS 


Subject type 


“Pure low “Pure low 

Source df ones" threes” 
(n = 15) (n = 15) 
SS % SS % 


Factor I; Acquaintance acceptance-rejection 


Race 1 89 Ss 24 23 
United Nations issue | 1 58 48 -00 00 
Income issue 1 07 06 «01 01 


Factor II: Friendship acceptance-rejection 


Race 1 45 +38 


1.01 56 
United Nations issue | 1 9.26 TGF: 16 109 
Income issue 1 03 03 89 49 
Factor III; Social distance 
Race 1 50.70 | 26.17*** | 2.04 1,12 
United Nationsissue | 1 8.18 | 4,22* 67 aul 
Income issue 1 37 +19 189 49 
*p <.05 
#eD < 01, 
wee > < 001, 


factor. This illustrates an overriding tendency 
for race to be important on this most intimate 
response continuum. This tendency does not 
show up for the “pure low threes” because 
they are by definition the opposite of the 
race rejectors. 

Table 3 compares the results of the mixed 
highs and mixed lows. As might be predicted, 
the mixed highs show significant main effects 
for both race and belief; however, it is inter- 
esting to note that the race effect is substan- 
tially greater. These subjects also illustrate, 
once again, the general tendency for race to 
control more variance as the response con- 
tinuum becomes more intimate. In contrast to 
the mixed highs, the mixed lows show no 
significant effects for either race or belief. 

Table 4 presents the corresponding results 
for all “other” subjects, that is, those who 
were not selected as representative of either 
“pure” or “mixed” highs or lows on either of 
the two subject factors. The results obtained 
from these subjects primarily illustrate the 
previously observed tendency for race to con- 
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trol increasingly more variance as the respon: 
continuum becomes more intimate. Although 
race is significant on all three response factors, 
the percentage of variance (and corresponding 
F ratios) are lower than for the race rejectors 
or mixed highs. (The high significance levels 
are partially a function of the larger n for 
this group of subjects.) The fact that the 
main effects for belief are insignificant would 
seem to indicate that for these subjects, who 
in a way represent the “average” or “neutral” 
subject, belief is simply not salient as a de- | 
terminant of behavioral intentions. bs (i 

The results presented in Tables 1-4 would 
thus seem to represent a replication of the 
Triandis and Davis (1965, p. 722) findings, 
as illustrated in Tables 3 and 4 of their article. 


The Effects of Norms 


The present study was also concerned’ with 
investigating differences between race and. sr 
belief rejectors in terms of their perceptions” 
of social norms relating to interracial contacts. 


TABLE 3 


Sums OF SQUARES AND PERCENTAGES OF VARIANCE 
CONTROLLED BY RACE AND Two BELIEFS ON THREE 
Response CONTINUA: COMPARISON OF SUB- 
Jects With Hich or Low LoapIncs on 
BOTH or THE Two SUBJECT Factors 


Subject type 


“Mixed highs” 
(n 13), 


“Mixed lows” 


Source df Ge = 13) 


SS o SS % 


Factor I: Acquaintance acceptance—rejection 


14,97%* 13 +10 
1,96 331 24 
5.30* 67 52 


28.39 
3.72 
10.05 


Race X 1 
United Nations issue | 1 
Income issue 1 


Factor II; Friendship acceptance-rejection 


Race 1 42,22 | 22.77%* 04 +02. 
United Nations issue | 1 27 -09 1.23 -18 
Income issue 1 | 24.69 | 7.78* 3.59 | 2.27 
Factor III: Social distance 

Race 1 | 198.47 | 48,28*** | 3,60 3.27. 
United Nationsissue | 1 3,00 13 1,09 1.00 
Income issue 1 | 23.72 5.77* 3.85 3.50 

*p <.05. 

#*D <.01. 

+p <.001, 


TABLE 4 


TROLLED BY RACE AND Two BELIEFS ON THREE 
RESPONSE CONTINUA: RESULTS FOR SUBJECTS 
WITH NEITHER HIGH NOR Low LOADINGS 
ON EITHER OF THE Two SUBJECT 
FACTORS 


Source df SS! % 


Factor I: Acquaintance acceptance-rejection 


Bee Race 1 20.32 3.04 
United Nations issue if 1.76 27 
1 1.00 .15 


SN Income issue 
ae 
i 


Factor II: Friendship acceptance-rejection 


A. Race 1 60.28 665A 
¿^a United Nations issue 1 2.48 .29 
Income issue 1 2.77 33 
Factor III: Social distance 
i Race 1 305.77 28.47*** 
United Nations issue 1 1.96 .25 
T% Income issue 1 1.29 Wh 


Note.—For the subject type others, n = 64. 
er p< ..05, 


The subjects were asked to rate six social 
groups on a 7-point scale of importance in 
terms of their influence on the subjects’ social 
behavior. All subject types rated parents as 
most important (6.9 for race rejectors and 6.7 
for belief rejectors) and close friends as 
second most important (6.3 for both race 
rejectors and belief rejectors), After ascer- 
taining the relative importance of the groups, 
the subjects were asked to indicate on a 7- 
point scale the extent to which these reference 
groups would approve or disapprove of their 
engaging in three behaviors (representing the 
=#s, three factors of the behavioral differential) 

with two stimulus persons represented by a 
= white and a Negro college student of the same 
~ sex as the subject. 

pa For both race rejectors and belief rejectors 

the perceived approval was generally greater 
for the white stimulus person than for the 
Negro stimulus person, though this difference 
was more marked for the race rejectors. 
Table 5 presents a comparison of the de- 
grees of approval that the two subject types 
attribute to their two most important refer- 
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ence groups with respect to the three behaviors 
toward the Negro stimulus person. 

As may be seen from Table 5, the race 
rejectors perceive greater disapproval (or less 
approval) on the part of both reference groups 
for all three behaviors relative to the belief 
rejectors. For the least intimate item (“eat 
with”), this difference is significant for parents 
and in the expected direction, though non- 
significant, for close friends. For the inter- 
mediate behavior (“go on a trip with”), the 
difference is significant both for parents and 
close friends. For the most intimate behavior 
(“introduce to a relative on a blind date”), 
the difference is significant at the .05 level 
for parents and in the expected direction, but 
not significant, for close friends. The fact 
that the magnitude of the differences is less 
for this intimate item is a reflection of the 
fact that for both groups, the disapproval 
increases as the behavior becomes more inti- 
mate, thus introducing a certain “floor” effect. 
Thus, these results confirm the hypothesis put 
forward earlier that race-prejudiced subjects 
would perceive their important others as being 
more disapproving of interracial contacts than 
would belief-prejudiced subjects. 


TABLE 5 


MEAN DIFFERENCES BETWEEN BELIEF REJECTORS AND 
Race Rejectors IN TERMS OF PERCEPTIONS OF 
THEIR IMPORTANT REFERENCE GROUPS’ AP- 
PROVAL—DISAPPROVAL OF THEIR SOCIAL 
Contacts WITH Necro COLLECE 
STUDENTS 


“Pure high | “Pure high 
ones” : belief|threes’’: race 


rejectors rejectors 
Factor (w= 15) | (# =15) | Main) ¢ 
X | sp} X | sD 
I: Eat with 
Parents 6.00 | 1.21 | 4,13 | 1.93 | 1.87 | 3.12** 
Close friends 5.33 | 2.06 | 4.06 | 1.67 | 1.27 | 1.81 
II: Go on a trip with 
Parents 4.53 |.1.83 | 1.93 | 1.66 | 2.60 | 4.00** 
Close friends 4,40 | 2.09 | 2.13 | 1.88 | 2.27 | 3.24** 


III: Introducetoa 
relativeona 
blind date 

Parents 
Close friends 


2.00 | 1.86 | 1.00 | .00 | 1.00 | 2.05* 
2.02 | 1.33 | 1.02 | 1.93 | 1.58 


Note.—1 = maximum perceived disapproval; 7 = maxi- 
mum perceived approval. df = 28in each case. 
*p <.05. 
Kp <.01. 


354 
DISCUSSION 


The results of the present study would seem 
to constitute a rather close replication and 
extension of the earlier findings of Triandis 
and Davis (1965). The data for the present 
study were collected 4 years after the earlier 
study, from subjects in a different part of the 
country, utilizing somewhat different mea- 
sures of behavioral intentions and completely 
different belief stimuli. Thus, within the gen- 
eral framework of the culture from which the 
subjects were drawn, these findings can 
probably be safely generalized. 

Naturally, it would be desirable to seek 
to replicate these findings under widely dif- 
fering circumstances, such as with subjects 
from different cultures. Tan and de Vera 
(1970) reported findings obtained from 
University of the Philippines subjects that 
replicate the findings of the Triandis and 


Davis (1965) study. These results show that ; 


race becomes more important when intimate 
behaviors are involved; these authors, further- 

more, interpret their findings as being in direct 

contradiction to the findings of Rokeach et al. 

- (1960). Further replication studies are obvi- 
ously desirable, particularly ones that would 
take into account the differences among 
subject types. 

The present study was concerned with 
investigating the relative importance of stimu- 
lus-person characteristics of race and belief 
in determining responses of social acceptance 
or rejection. In addition to simulus-person 
characteristics, at least two other important 
parameters should be considered. 

One of these is the nature of the response 
continuum—in this case, scales representing 
behavioral intentions. It was shown that when 

~ such scales are subjected to factor analysis 
and the resulting factors are ordered according 

` to degree of intimacy, the importance of 
race inceases as the behaviors become more 
intimate. 

The second parameter has to do with the 
nature of the subjects. Using multidimen- 
sional scaling techniques, two subject types 
were identified, and the highs on these two 
subject factors were designated as belief re- 
jectors and race rejectors, These two subject 
types are very close replications of the two 


Marvin GOLDSTEIN AND EARL E. Davis 


types obtained by Triandis and Davis 
(1965). It was shown that, in general, over 
all response continua, for the race rejec 
tors, race was the most important stimulus 
characteristic, while for the belief rejectors, 
belief was the most important characteristic. 
However, for both subject types, race became 
more important as the response continuum 
became more intimate. 

Rokeach et al. (1960) stated that their 
theory, emphasizing the preeminence of belief, 
applies “insofar as psychological processes are 
involved . . . [p. 135].” Stein et al. (1965) 
stated that their “sensitive areas,” in which 
they found race to be of importance, are 
examples of “institutionalized racial prejudice 
outside the framework of his [Rokeach’s| 
theory [p. 288].” We find it a bit strange 
that only certain behaviors should be ex- 
plained on the basis of “psychological proc- 
esses,” whereas others are explained on the 


basis of “‘institutionalized racial prejudice,” y- 
with the implication that some sort of contra- ” 


distinction is involved. Don’t “psychological 
processes” mediate between behaviors and 
social norms? More generally, if a theory dis- 
misses data that does not fit it as merely being 
“outside the framework” of the theory, it is of 
limited usefulness. The model presented by 
Triandis and Davis and the present authors, 
while not claiming to constitute a theory in 
any strict sense, does permit the explanation 
of a more generalized set of behaviors. 

We have shown that, whereas belief is im- 
portant for certain subject types, there are 
other subject types for whom this is not true. 
In the present study, the race rejectors show 
a greater race effect than belief effect on all 
dimensions, including the least intimate one. 
In fact, as an inspection of Tables 3 and 4 
reveals, the mixed highs as well as the neu- 
tral or “other”? subjects also show a greater 
race effect than belief effect, even on the least 
intimate dimension. Thus, it is not possible 
to dismiss these findings as mere excep- 
tions. Rather, any theory or model must 
systematically account for them. 

In an effort to further understand the dif- 
ference between the subject types, the authors 
investigated the role of social norms as cor- 
relates of behavioral intentions. The hypothe- 


: 
| 
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_ sis that race rejectors would perceive their 
important others as being more disapproving 
(or less approving) of their having interracial 
contacts than would belief rejectors was 
confirmed. 

There is no reason to doubt that the same 
“psychological processes” are involved in deter- 
mining the behavioral intentions of both race 
rejectors and belief rejectors on the different 
response continua. Among the psychological 
processes involved in explaining the effect of 
belief as a determinant of social acceptance or 
> rejection are cognitive processes governed by 
‘principles such as balance theory and dis- 
sonance theory. It is quite plausible to assume 
a tendency on the part of many subjects to 
accept belief-similar persons and reject belief- 
dissimilar persons, since this would be in line 
with predictions of cognitive theories. It is 
equally plausible, however, to assume that 
subjects would reject persons of whom their 
_ important others disapprove and accept per- 
‘sons of whom their important others approve. 
Engaging in certain behaviors with (or even 
expressing positive behavioral intentions 
toward) persons of a different racial or ethnic 
group would certainly be dissonance pro- 
ducing if one perceived one’s important others 
as strongly disapproving of such behaviors. 
Thus, it is clear that the same psychological 
processes are operating for all subjects. These 
processes depend, among other things, on the 
subjects’ perceptions of social norms, specifi- 
cally the perceived approval of their important 
others. 

The present study has gone one further step 
in the direction of explaining apparently con- 
tradictory findings in an area that is of both 
theoretical and practical significance. Obvi- 
ously, however, further research needs to be 


% done. Do subjects just perceive their impor- 


tant others as disapproving of interracial con- 
tacts or do the important others actually 
disapprove of such contacts? A study of both 
the subjects and their important others would 
clarify this question. Furthermore, additional 
research needs to be done to understand more 
fully the differences between the two subject 
types isolated in this and the previous study. 
The results of continuing research along these 
lines would do much to help explain the basic 


nature of “prejudice”; that is to say, such 
results would help to answer the fundamental 
question of what the bases are on which 
human beings express social acceptance or 
rejection of their fellows. 
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A comparative analysis of performances 
the schedule of reinforcement (variable 


the type of reinforcement (direct or vicarious), and the nature of the model’s 
verbal cues (persistent relevant and nonpersistent irrelevant) during acquisi- 
tion, A total of 72 undergraduate males operated specially constructed elec- 


tronic apparatus by choosing one of four 


we 


tancy of reinforcement, 


ciples was offered to account for the 
performance. 


Although the concept of vicarious reinforce- 
ment is basic to a full understanding of imita- 
tion (Bandura & Walters, 1963), vicarious 
reinforcement is itself not well understood. 
Bandura (1969) cited studies that, he main- 
tained, indicate that “vicarious reinforcement 
processes are governed by variables such as 
the percentage (Bisese, 1966; Kanfer, 1965), 
intermittance (Rosenbaum & Bruning, 1966), 
and magnitude (Bruning, 1965) of reinforce- 
ment in essentially the same manner as when 
they are applied directly to a performing sub- 
ject [p. 30].” Bandura (1969) further specu- 
lated that, as with direct reinforcement, the 
motivation produced by vicarious reinforce- 
ment “is most likely to affect the speed, in- 
tensity and persistence with which matching 
responses are executed [p. 31].” However, it 
is perplexing to find that Flanders (1968, p. 
321) cited one of the above studies (Bisese, 
1966) as evidence that vicarious and direct 
reinforcement do not have the same effect on 
performance, 

Jenkins and Stanley (1950) and Lewis 
(1960) have surveyed the literature on partial 
direct reinforcement, and, together, their 


1 The research was conducted while the author was 
at the University of Missouri, Columbia. 

2The author is indebted to his advisor, June 
Chance, for her valuable assistance and support. 

Requests for reprints should be sent to Stephen 
H. Braun, Arizona State Hospital, 2500 East Van 
Buren Street, Phoenix, Arizona 85008. 
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forcement produced essentially comparable performances in extinction in terms 
of resistance to extinction, rate of responding, extent of imitation, and expec- 
The model’s verbal cues had minor effects on perform- 
ance, A modified operant paradigm incorporating contiguity-mediational prin- 


MODELING CUES ON 


BRAUN ? 


in extinction was conducted, varying 
ratio 20% and variable ratio 80%), 


manipulanda. The two types of rein- 


effects of vicarious reinforcement on 


articles provide a comprehensive summary up 
to 1960. However, the literature dealing with 
the effects of partial vicarious reinforcement 
(cf. Bourdon, 1970; Flanders, 1968) is not 
as substantial. Four princip e of research 
warrant consideration herein: the general 
nature of responding during acquisition, re- 
sistance to extinction, rate of responding in 
extinction, and expectancy of reinforcement 
during extinction. 


Responding during Acquisition 


It is clear that, with direct reinforcement, 
responding during acquisition is not 
sarily identical to responding during ex 
(Jenkins & Stanley, 1950; Lewis, 1960). ‘here 
is no persuasive argument against a similar 
statement with regard to vicarious reinforce- 
ment. Nevertheless, the majority of studies of 
partial vicarious reinforcement have investi- 
gated performance in acquisition but not in 
extinction, yet they have often implied that 
conclusions would apply with equal validity 
to both. This error frequently occurs in in- 
vestigations of the competence of the model, 
wherein competence is varied in terms of the 
percentage of the model’s responses that are ~ 
reinforced, while the observer's performance i 
is sampled only during acquisition. It should 
be noted that the broad generalizations that — 
imitation is a direct function of increasing — 
percentages of vicarious reinforcement or that 4 
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more competent models will be more exten- 
sively imitated are not supportable until they 
have been empirically demonstrated to hold 
true in extinction. 


sistance to Extinction 


The partial reinforcement effect is one of 
the most firmly established generalizations 
arising from the literature on partial direct 
reinforcement. In contrast, only two studies 
have dealt with the resistance to extinction 
X. of a vicariously reinforced response. Lewis and 
“Duncan (1958) employed a lever-pulling re- 

sponse, for which subjects received either 25% 

or 100% reinforcement. The authors con- 

cluded that both direct and vicarious rein- 

forcement produced the partial reinforcement 

effect, but the latter did so only when ob- 

servers actually received the reinforcement 

earned by the model. However, these con- 

clusions are questionable for at least three 
“reasons: (a) the unsatisfactory operational 
n of vicarious reinforcement in the 
“Watch and Win” and “Explain and Win” 
groups; (b) the limited number of acquisition 
trials (eight) to which the subjects were ex- 
posed (cf. Lewis, 1960); and (c) the avail- 
ability of only one operant response of the 
subjects which was identical to that made by 
the model and was therefore described as 
“imitative.” 

In contradiction to Bandura’s (1969) inter- 
pretation, Flanders (1968) noted that Bisese 
(1966) failed to obtain the partial reinforce- 


3 


> ment effect with vicarious reinforcement. How- 


ever, several methodological faults of the 
study render the support it offers to either 
position rather tenuous. First, the task re- 
quired skill, and, in view of the data obtained 


2% by James and Rotter (1958), for example, 


it is probably more surprising that Bisese 
obtained the partial reinforcement effect with 
direct reinforcement than it is that he did not 
> obtain it with vicarious reinforcement. Sec- 
ond, reinforcement was dispensed according 
to preselected schedules which took no account 
of the nature of the response on which rein- 
forcement was presumably contingent. The 
lack of definable contingencies precludes any 
learning and performance other than that 
which must be described as superstitious. 
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Third, all subjects received both direct and 
vicarious reinforcement during acquisition, 
and the study provided no adequate means of 
separately analyzing their relative effects on 
performance during acquisition or extinction. — 
Fourth, since all subjects received vicarious 
reinforcement during “extinction,” it might 
reasonably be argued that the procedure failed 
to conform with the traditional operational 
definition of extinction. Finally, limiting ex- 
tinction to 10 trials is an unorthodox proce- 
dure of doubtful merit since it restricted the 
investigator’s scrutiny to a seriously insuffi- 
cient sample of behavior following exposure 
to variable ratio schedules of reinforcement. 


Rate of Responding during Extinction 


Although rate of responding is regarded as 
a basic datum in operant research, there have 
been no investigations of the effects of 
partial vicarious reinforcement on the rate 
of responding in a free operant situation. 
Rosenbaum and Bruning (1966), however, 
investigated the speed of discrete responses in 
a controlled-responding situation, requiring 
children to move an upright handle from left 
to right along an 18-inch slot. Although per- 
formance was similar following either direct 
or vicarious reinforcement, the performance 
curves obtained represented a reversal of the 
usual trends (cf. Lewis, 1960). The reversal 
was explained in terms of the social demands — 
implicit in the research that the child con- 
tinue to respond despite any appropriate tend- 
encies to stop. The study, in other words, 
failed to provide a completely satisfactory in- 
dication of the effects of experimentally con- 
trolled direct and vicarious reinforcement 
unencumbered by the effects of extraneous 
variables, 


Expectancy of Reinforcement during 
Extinction 


In the first study of partial reinforcement 
using human subjects, Humphreys (1939) 
argued in favor of the concept of expectancy 
to explain his data. Lewis and Duncan (1957) 
found no evidence to support Humphreys’ 
(1939) original notion with direct reinforce- 
ment procedures. Lewis (1960) stated that, 
although it is possible that every study cited 
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in his review of partial reinforcement could 
be reinterpreted according to an expectancy 
notion, the concept is apparently moribund 
as an explanation of the partial reinforcement 
effect since there is no way of disproving it. 

Only one study has investigated expectancy 
of reinforcement following exposure to sched- 
ules of both direct and vicarious reinforce- 
ment. Lewis and Duncan (1958) found that 
the expectancy curve after 100% reinforce- 
ment fell off faster and ended lower than did 
the curve after 50% reinforcement, when the 
reinforcement was direct. Similar effects were 
found when observers also participated in the 
model’s winnings, but no significant trends 
were reported when the observers received 
only vicarious reinforcement. 

Bisese (1966) neither manipulated nor 
measured expectancy, but he concluded that 
his data were most consistent with an inter- 
pretation based on mediational processes in- 
volving expectancy. As previously noted, 
however, the data are of limited value. 

The present study is primarily directed 
toward the goal of providing data concerning 
performance in extinction following exposure 
to intermittent schedules of vicarious or di- 
rect reinforcement. Specifically, the interest 
is in assessing resistance to extinction, imita- 
tion, rate of responding, and expectancies; in 
comparing the performances following the two 
types of reinforcement; and in determining 
the influence of selected verbal modeling cues 
on performance. Access to these data is essen- 
tial to a complete understanding of vicarious 
reinforcement and its role in imitation. 


METHOD 
Subjects 


The subjects were 72 undergraduate males enrolled 
in an introductory psychology course at the Uni- 
versity of Missouri. A total of 80 subjects were 
exposed to the experimental procedure. Of these, 
8 were excluded from the data analysis: 5 were 
eliminated because they failed to operate the machine 
as instructed, 2 were eliminated due to malfunction 
of the event recorder, and 1 was eliminated both 
because he had moral objections to gambling and 
because he failed to complete the series of acquisition 
trials. 


Apparatus 


A brightly painted, electronically operated machine, 
shaped roughly like a slot machine, 55.88 centimeters 


STEPHEN H. BRAUN 


high, 50.80 centimeters wide, and 76.20 centire 
deep at the base, was constructed for the 
The slanted (25-degree) panel facing the subj 
tained (a) four black “response” buttons pai 
red lights that lighted when the corresponding 
was pushed; (b) five red “expectancy” buttons pi 
with amber lights and labeled from left to rignt 
“Sure Not Win,” “Fairly Sure Not Win,” ‘?,” “Fairly 
Sure Win,” and “Sure Win”; and (c) a green “play 
light that signaled the subject to respond, The vertical 
panel facing the subject contained (a) the v 
“WIN!” in large, bright letters; (b) three blue and 
two green lights that flashed contiguously with th: 
delivery of reinforcement; and (c) a lighted score 
board that displayed the total number of points | 
accumulated by the subject. A prize list, prominently” 
displayed on the table next to the machine, indicated 
the amount of money won corresponding to th: 
number of points accumulated, with a maximun 
of $5. 

A separate control panel, located in another room 
allowed the experimenter to monitor all responses, 
administer reinforcement, and reset the machine afte 
each trial. All responses by the subject and operations 
by the experimenter were automatically recorded by 
an Esterline-Angus 20-channel event recorder. 


Procedure 


Twelve subjects per group were brought individu- 
ally to the experimental room, seated at the machine, 
and given the following instructions to read: 


You will be taking part in a study of concept 
formation. We are studying the way in which 
people learn and form concepts under various 
experimental conditions. 

On the desk is the equipment you will be using. 
It works very much like a slot machine, except 
that you can outguess it. When you have learned 
the correct concept (or outguessed the machine, 
in other words), you will win money. How much 
money you win is determined by how many points 
you accumulate. You get 10 points every time 
you outguess the machine and make a co 
choice. A list on the table shows how much you 
can win: anywhere from 5¢ for 10 points to $5 for 
1,000 points or more. j 

Here is how you play the machine: We havez 
previously chosen and programmed into the ma- 
chine a number of concepts. That is, there are 
certain buttons that have already been selected 
and programmed into the machine to win. If you 
figure out how the machine is programmed and 
what button to press on each trial, then you 
will win. 

What you will do on each trial is to choose one 
of four buttons. (We will be using only buttons 
numbered i, 2, 4, and 5—choose only one of 
these.) When the green light in the lower. left 
corner of the machine comes on, that is your 
signal to choose a button. It is up to you to figure 
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out which one button will pay off on any given 
trial. You can press only one button on each trial. 

After you have made your choice, you will 
choose one of the five buttons on the top row. 
You will notice that these buttons are labeled 
according to how sure you are of winning on each 
rial. We are using these buttons only to indicate 
how sure you are of winning. These buttons have 
absolutely nothing to do with whether or not you 
win. We need to know how you feel about your 
choices as you learn to play the machine, so please 
try to make your choices of these buttons an 
accurate indication of how you really do feel. 

Remember that this machine works very much 
like a slot machine. You probably will not win 
every time, and you may get to a point where the 
machine is quite difficult to outguess. 

There is no time limit to how long you can play. 
You can quit whenever you want to. In fact, it is 
important that you quit whenever you feel you 
can no longer outguess the machine. When you do 
decide to stop playing the machine, we will go on 
to the second part of the research. Quitting is not 
a reflection on you or your personality; we are 
not interested in that. We are interested only in 
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the manner in which people in general learn 
and form concepts under various experimental 
conditions and programs. = 


During acquisition, pressing Response Button 
Number 1 was designated, unknown to the subject, 
as the “correct” response to be reinforced according 
to either a variable ratio 20% or a variable ratio 
80% schedule. Directly reinforced subjects were told 
to “go ahead and play the machine.” The experi- 
menter seated himself about 10 feet away and quietly 
read until the subject had emitted 30 “correct” re- 
sponses. The experimenter then announced that he 
had to leave, asked for questions, and returned to 
resume reading after 2 minutes. Vicariously rein- 
forced subjects observed the model respond as would 
a naive subject during acquisition, after which the 
model asked for questions, instructed the subject to 
play the machine, and left for 2 minutes. During 
vicarious acquisition, the model provided the verbal 
cues presented in Table 1. Similar verbal cues were 
not provided during the direct reinforcement acqui- 
sition procedure in order to avoid the extraneous 
vicarious influence of a verbal model. Procedural 
extinction was terminated when the subject indicated 


TABLE 1 
MODELĽ’S VERBAL CUES DURING ACQUISITION 


e 
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Correct 
response} 
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Persistent-relevant cues 


Nonpersistent-irrelevant cues 
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I’ve gotten pretty good at outguessing this 
machine, so I’ll run through a few trials just to 
show you how it works. Td like for you to 
watch and see what I do so we’ll be sure you 
know how to play it. 


I really enjoy showing people how to play this 
thing—it’s fun to try to outguess it. 


When you find a button that wins, you sometimes 
have to press it several times in a row to win. 


Tf a button doesn’t win, eventually you can forget 
it, but you have to keep at it to figure it out. 


This is about how often it pays off when you 
figure it out. If you don’t win as often as I have 
been winning, you can forget it—the program is 
too hard to beat. 


If you can see that you aren’t winning, it’s prob- 
ably because the program is too complicated to 
beat. I always quit when I can tell I’m not 
going to win, but you really have to keep at it 
before you can tell. 


I didn’t do badly at all on that one. 


I never know if I can beat it, but I’ll run through a 
few trials just to show you how it works. Pd like 
for you to watch to see what I do so we'll be sure 
you know how to play it. 


Sometimes you get a really tough program—one 
that’s hard to figure out. 


This machine will fool you—you go for a while 
without winning and think you can’t outguess it, 
but if you keep at it, sometimes it eventually 
pays off, and sometimes it doesn’t. 


There are so many combinations of buttons that it 
gets hard to figure out what’s right and what’s 
not. 


Sometimes there’s no way of knowing what the 
program is. 


You really have to work to figure this thing out. 
Sometimes I wonder if it’s worth the effort. 


OK, that’s enough; I quit. 
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his desire to stop responding. The experimenter then 
paid the subject his winnings (subjects who received 
only vicarious reinforcement were given 25¢ as an 
unexpected “consolation prize”), explained the opera- 
tion of the machine, and asked him not to discuss 
the details of his performance with anyone, 


RESULTS AND DISCUSSION 


Performance during Acquisition 


The analysis of performance during acqui- 
sition is not of primary concern, but it does 
provide a more precise definition of the inde- 
pendent variables. Table 2 presents the mean 
acquisition performance data by groups. The 
performances of the models were similar to 
the performances of the subjects who received 
direct reinforcement during acquisition, but 
they were not identical. Nevertheless, these 
differences do not diminish the relevance of 
the data, since the ultimate concern is not 
with variable ratio 20% and variable ratio 
80% schedules specifically, but rather the 
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serve merely as a basis for comparisons of - 
the relative extent to which directly and vi- 
cariously reinforced responses were maii 
tained during extinction. Table 4 summarize: 
an analysis of variance based on the mea» 
number of all responses emitted by each group 
during extinction, while Table 5 summarize: 
an analysis of variance based on the mean 
number of imitative responses emitted by eac! 
group during extinction. 

In terms of the total number of all re- 
sponses emitted during extinction, a variable 
ratio 20% schedule produced greater resis-< 
tance to extinction than did a variable rati: 
80% schedule, regardless of the type of rein 
forcement. The variable ratio 20% schedules 
also produced greater resistance to extinction 
than did the variable ratio 80% schedules in 
terms of the total number of “imitative” re- 
sponses, that is, those reinforced during ac- 
quisition, regardless of the type of reinforce- 


ment. The differences in the total ‘ime elapsed ay“ 


under direct reinforcement conditions (cf. 


Spent tng 


T te A S 


more general effects of rich and lean schedules during extinction further reflect the greater 
of direct and vicarious reinforcement. resistance to extinction produced by the varia- % 
3 EN ble ratio 20% schedules. These data are con- || 
Resistance to Extinction sistent with the typical nature of responding 2 


_-» Table 3 presents the mean number of all 
responses, thë mean number of imitative re- 
sponses, and the mean total time in extinction 
for each group. The term “imitative” refers 
here to the use during extinction of the manip- 
ulandum on which reinforcement was contin- 
gent during acquisition. The subjects who ex- 
perienced direct reinforcement during acquisi- 
tion were not imitating a model at any time, 
and the figures presented for these subjects 


Lewis, 1960). However, they are in decided 
contrast to the generally accepted premise 
that more frequently reinforced (or more 
“competent”) models are more extensively 
imitated (cf. Flanders, 1968), a phenomenon 
that may occur during acquisition, but ap- 
parently is not maintained during extinction. 

Although the direct reinforcement acquisi- 
tion procedures required more time and 2 


yh Se 


TABLE 2 
MEAN ACQUISITION PERFORMANCE DATA BY GROUPS 


wh 
Time to ie 
Actual ; iteri g Rate to Expectancy , 
Group AG pale co ligne alee epee RRS % = 
ment eons responses | per minute) criterion 
Direct reinforcement 20% 7.39 85.00 4/20!" 36.95 20.72 2.58 
Direct reinforcement 80% 51.07 48.58 3/01” 63.86 17.72 3.29 
Vicarious reinforcement-persistent 
relevant 20% 12.69 48.00 1/40” 63.43 28.67 3.33 
Vicarious reinforcement-persistent 
relevant 80% 56.56 42.58 1'42” 70.69 25.50 3.67 
Vicarious reinforcement-nonpersistent 
irrelevant 20% 12.17 50.50 1'38” 60.82 31.08 3.29 
Various reinforcement-nonpersistent 
irrelevant 80% 60.67 39.67 1/36” 75.83 25.34 3.70 
t —— 


TABLE 3 


zD STANDARD DEVIATIONS OF THE TOTAL OF 
SPONSES TO EXTINCTION, THE TOTAL 
TATIVE RESPONSES TO EXTINCTION, 

AND THE TOTAL TIME IN 


EXTINCTION 
tere Total 
tati i 
Group erone eo iea ame? 
Direct reinforcement 20% 
ï 225.166 | 80.500 | 520.833 
E$ SD 116.821 | 39.322 | 299.043 
Direct reinforcement 80% 
x 74.416 | 29.666 | 236.250 
me SD 35.477 | 15.370 | 189.042 
Wicarious reinforcement- 
persistent relevant 
20% 
a 453.416 | 140.583 | 660.000 
ev 205.892 | 68.241 | 260.855. 
Vicarious reinforcement- 
' e persistent relevant 
E 80% 
X 116.083 | 31.416 | 239.167 
| SD 49.592 | 16.484 | 118.702 
Vicarious reinforcement- 
nonpersistent ir- 
relevant 20% . 
Be 251.250 | 76.833 | 395.000 
96.142 | 29.120 | 174.317 
us reinforcement- 
nonpersistent ir- 
_ relevant 80% 
“4 123.666 | 35.750 | 225.833 
64.333 | 19.620 | 132.044 


| SD 
. 


- greater number of trials to criterion than the 
i vicarious reinforcement acquisition procedures, 
= the latter procedures tended to increase the 
behavior’s resistance to extinction in terms of 
the total number of all responses and in terms 
of the total number of imitative responses. It 
` is, of course, conceivable that the vicarious 
reinforcement procedures would not continue 
to be more efficient if the vicarious reinforce- 
ment and direct reinforcement acquisition pro- 
“<cedures were to be equated in terms of the 
precise percentages of reinforcment dispensed 
during acquisition, or if fewer acquisition 
trials were employed. Nevertheless, the pres- 
ent data support Bandura and Walters’ 
(1963) observation that certain behaviors 
may be acquired with relatively greater ease 
through modeling as opposed to operant pro- 
cedures. 
The persistent-relevant cues enhanced the 
resistance to extinction of imitative respond- 
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TABLE 4 


SUMMARY OF ANALYSIS OF VARIANCE OF TOTAL 
NUMBER oF ALL Responses TO EXTINCTION 


Source of variation df MS F 


Schedule of reinforce- 


ment (A) 1 | 758090.8890 | 62.1904** 
Type of reinforcement 
B 2 | 116392.7915 | 9.5483** 
AXB 2| 79345.0975 | 6.5091* 
Within cell error 66 | 12189.8384 
Total 71 


* p< 01, 
> < 001. 


ing when the observers were exposed to a 
variable ratio 20% schedule in acquisition 
(t = 2.98, p < .01), but not when they were 
exposed to a variable ratio 80% schedule (¢ 
= .59). It may be speculated that the model’s 
verbal cues in the variable ratio 20% condi- 
tions had a greater effect on the observers’ 
subsequent behavior both because the cues 
were more appropriate to the task observed 
and because the task observed was more simi- 
lar to the task later encountered by the ob- 
servers. 

The resistance to extinction of a vicari- 
ously reinforced behavior may be reinter- 
preted in terms of the imitation of persistence 
at a task. On the basis of the present data, it 
is apparent that persistence will be imitated, 
a conclusion consistent with the results of 
closely related studies (e.g., Bandura, Grusec, 
& Menlove, 1967; Bandura & Kupers, 1964; 
Bandura & Mischel, 1965; Bandura & Wha- 
len, 1966) which indicate that delay-of-grati- 
fication patterns, self-evaluative responses, 
and standards of self-reinforcement displayed 
by a model will be imitated. 


TABLE 5 


SUMMARY OF ANALYSIS OF VARIANCE OF TOTAL 
IMITATIVE RESPONSES TO EXTINCTION 


Source of variation df MS F 


Schedule of reinforce- 3 
1 | 80869.0139 


ment (A) 61.0797** 
Type of reinforcement 
B 2| 7359.5415 | 5.5586* 
AxB, 2| 8133.1806 | 6.1429* 
Within cell error 66 | 1323.9912 
Total 71 
p< 0i. 
* > < ,001, 


X 
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TABLE 6 
MEAN RATES OF RESPONDING IN RESPONSES PER MINUTE DURING EXTINCTION 
3 Vicarious Vicarious Vicarious Vi 
Direct Direct reinforcement- | reinforcement- | reinforcement- | reini it- 
Time reinforcement reinforcement persistent persistent nonpersistent. nonper it 
20% 80% relevant relevant irrelevant i ar 
20% 80% 20% 
Overall 27.31 22.48 41.09 31.23 41,30 
Minute 1 33.25 25.00 37.00 33.25 39.25 
Minute 2 25.17 21.87 37.58 30.77 38.92 
Minute 3 26.00 17.86 38.75 25.87 37.70 
Minute 4 26.33 17.22 41.25 30.70 38.60 
Minute 5 27.27 14.61 40.27 26.34 39.50 
Minute 6 25.89 8.25 45.00 23.60 38.10 
Minute 7 24.56 14.00 41.90 18.75 37.57 
Minute 8 27.60 12.00 45.85 19.00 38.00 
Minute 9 19.77 8.00 46.50 20.00 39.50 k 
Minute 10 21.25 8.00 47.74 — 28.13 ores 
Minute 11 20.15 9.00 40.05 = 29.54 = 
“Minute 12 30.25 12.00 38.63 =a) = = 
Minute 13 23.67 12.00 35.67 ee a — 
Minute 14 24.39 = 32.00 — — — 
Minute 15 17.00 z5 26.33 = — = 
Minute 16 15.50 ei 32.47 = — = 
Minute 17 25.20 — 44.57 — — — 
Minute 18 33.33 = 51.00 — — — 
Minute 19 — — 67.00 = — — 
Minute 20 = — — — — = 


Rate of Responding during Extinction 


Table 6 presents the mean overall rates of 
responding in terms of responses per minute 
during extinction and the mean local rates of 
responding within each minute of extinction. 
Table 7 summarizes an analysis of variance 
based on the mean overall rates of responding 
during extinction, 

The variable ratio 20% schedules produced 
greater mean rates of responding, particularly 
after the early extinction trials, than did the 
variable ratio 80% schedules, whether the 
reinforcement was direct or vicarious. The 


rates of responding decreased earlier for the. 


variable ratio 80% groups, generally between 


TABLE 7 


SUMMARY OF ANALYSIS OF VARIANCE OF MEAN OVERALL 
RATE OF RESPONDING DURING EXTINCTION 


Source of variation df MS F 
Schedule of reinforcement 
A; 1} 798.9998 | 6.4436* 
Type of reinforcement (B) | 2} 1289.5174 | 10.3993** 
AXB 46.4623 | .3747 
Within cell error 66 | 123.9999 
Total 71 
*p <.05. 
** p < 001, 


the second and fourth minutes of extinction. 
The rates of the variable ratio 20% groups 
were maintained at a higher level, and thes 
generally did not decrease until after the 
fourth minute of extinction at the earliest. 

The vicarious reinforcement procedures 
produced greater overall rates of responding, 
than did the comparable direct reinforcement. 
procedures in extinction, thus reflecting the 
rates of responding characteristic of the ac- 
quisition period. It is possible that, had the 
models responded at a lower rate, the observ- 
ers in the vicarious reinforcement procedures 
would have also. 

The verbal modeling cues tended not to pro- 
duce significant differences either in the over- 


all or in the local rates of responding. How- A4 


ever, the variations in the local rates in the 
two vicarious reinforcement variable ratio 
20% groups, particularly late in extinction, 
were quite consistent with the nature of the 
cues. 


Expectancy of Reinforcement during 
Extinction 
Table 8 presents the mean overall expec- 


tancy for each group as well as the mean local 
expectancies expressed in the first, third, and 
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sixth block of 10 trials during extinction. In 
calculating each subject’s expectancy, the five 
expectancy buttons were assigned numerical 
values of 1, 2, 3, 4 and 5, so that a numeri- 
cally greater score represents a greater ex- 
pectancy. Table 9 summarizes an analysis of 
variance based on the mean overall expec- 
> ‘ancies of reinforcement during extinction. 
The variable ratio 80% schedules, as com- 
x pared to the variable ratio 20% schedules, 
‘tended to produce greater expectancies of re- 
4. inforcement during the first few extinction 
4rials. The expectancies of all groups tended 
to decrease during the first 30 trials, but the 
variable ratio 80% expectancies decreased 
p- more rapidly and reached a lower level during 
» the early extinction trials. Thereafter, how- 
7 ever, the course of expectancies became quite 
complex, with all groups exhibiting immense 
variability. There was no simple relationship 
wherein the initially high variable ratio 80% 
‘expectancies quickly decreased according to a 
‘ht-line function to a low point at which 
responding ceased, while the initially lower 


A TABLE 8 
MEANS AND STANDARD DEVIATIONS OF EXPECTANCY OF 
REINFORCEMENT OVERALL AND IN SELECTED 
BLocks oF 10 TRIALS DURING EXTINCTION 


3 Grou Ore Block | BIE aoa 
Direct teinforcement 
z 2.33 | 2.79 | 2.31 | 2.37 
SD or 31 4 62 
ss ae reinforcement i 
o 
“See 2.47 | 3.18 | 2.35 | 2.20 
X SD 39 .60 -16 33 
Vicarious reinforcement- 
s persistent relevant 
i _ 20% 
Č 2.59 | 2.99 | 2.98 | 2.70 
SD 56] 10 | 30) .61 
~ Vicarious reinforcement- 
persistent relevant 
-80% 
2.70 | 3.08 | 2.81 | 2.68 
x, SD EE E A EON Iia ES 
$ Vicarious reinforcement- 
nonpersistent irrele- 
_ vant 20% 
X 2.45 | 3.02 | 2.93 | 2.63 
SD 56} .22 | 43 .65 
Vicarious reinforcement- 
nonpersistent irrele- 
_ vant 80% 
2.61 | 3.00} 2.81 | 2.41 
SD 55 45 72| 74 
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TABLE 9 a 


SUMMARY OF ANALYSIS OF VARIANCE OF MEAN OVER- 
ALL EXPECTANCY OF REINFORCEMENT 
DURING EXTINCTION 


Source of variations df MS F 
Schedule of reinforcement (A) | 1| .3376 | 1.3045 
Type of reinforcement (B) 2| 3637 | 1.4053 
AXB 2 | .0048 0184 

Within cell error 66 | .2588 
Total 71 


variable ratio 20% expectancies slowly de- 
creased throughout extinction. In addition, 
there were no significant differences in the 
overall expectancies of the groups as a func- 
tion of the schedule of reinforcement. 

Although the direct and vicarious acquisi- 
tion procedures appeared to produce expec- 
tancies of reinforcement which followed gen- 
erally comparable courses throughout extinc- 
tion, the latter procedures tended to produce 
expectancies greater than those produced by 
the direct-experience procedures during the 
eatly extinction trials. However, there were 
no significant differences in the overall ex- 
pectancies as a function of the type of rein- 
forcement. It is interesting to note that, in — 
general, theories of extinction that are based 
on the notion of expectancies would predict 
that the greater expectancies held by the 
vicarious-experience subjects would produce 
earlier extinction than the lower expectancies 
held by the direct-experience subjects. Such a 
prediction would not be supported by the re- 
sults of the present research. It seems unlikely 
that the expectancies alone served as the pri- 
mary determinant of the specific details of per- 
formance during extinction, or that relatively 
simple expectancy principles, such as those 
proposed by Humphreys (1939) and Bisese 
(1966), might explain the various details of 
the performances in extinction. 


CONCLUSIONS 


The present research sustains several gen- 
eral conclusions that may be summarized in 
terms of the independent variables. 4 


Direct and Vicarious Reinforcement 


The most significant conclusion that may 
be drawn from the data is that comparable 
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schedules of direct and vicarious reinforce- 
ment appear to produce essentially parallel 
effects on behavior during extinction in terms 
of resistance to extinction, the extent to which 
the previously reinforced response is emitted, 
rate of responding, and expectancies of rein- 
forcement. Although the performances of the 
vicariously reinforced subjects were not pre- 
cisely identical with the performances of the 
directly reinforced subjects, the obtained dis- 
crepancies were not regarded as reflecting sig- 
nificant intrinsic differences between the two 
types of reinforcement. 

The primary practical implications of the 
research derive from the similarity of the 
effects produced by direct and vicarious rein- 
forcement. The pervasive extent to which 
modeled acquisition performances were imi- 
tated during extinction, and the fact that 
vicarious acquisition procedures may require 
less time and effort than direct-experience 
acquisition procedures with large groups of 
individuals learning complex behaviors, sug- 
gests that procedures employing vicarious 
acquisition may be of considerable value in 

applied or therapeutic settings. 

~The primary theoretical implications of the 
research also derive from the similarity of the 
effects produced by direct and vicarious rein- 
forcement, Bandura (1969) argued that the 
traditional operant paradigm is not entirely 
applicable to vicarious reinforcement. How- 
ever, the present data do not obviate the ne- 
cessity of recourse to different sets of prin- 
ciples to account for the effects of the two 
types of reinforcement. A liberally interpreted 
operant paradigm appears viable. 

Within the operant model, S —> R — S", S 
may have reference not only to an overt dis- 
criminative modeled stimulus, but also to the 
covert discriminative modeled stimuli provided 
by the observer on a symbolic and imaginal 
level. Similarly, R may be construed as repre- 
senting covert responses as well as overt 
matching responses. Covert operant responses 
may be conveniently described as “coverants” 
(cf. Homme, 1965). Bandura (1969) has sug- 
gested that covert responding may entail the 
“symbolic coding and central organization of 
modeling stimuli, their representation in mem- 
ory, in verbal imaginal codes, and their subse- 
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quent transformation from symbolic forms ‘> 
motor equivalents [p. 127].” Finally, St ma 

denote either an overt reinforcing stimulus o 
a covert stimulus present on a symbolic and 
imaginal level. 


Schedules of Reinforcement 


The effects of the schedules of reinforce- 
ment were consistent with previous research on 
direct reinforcement (Jenkins & Stanley 
1950; Lewis, 1960). The variable ratio 20% 
schedules produced greater resistance to,” 
extinction than did the variable ratio 807% 
schedules, in terms of total responses and k 
imitative responses. The variable ratio 20% 
schedules generally produced greater rates of 
responding, and ultimately produced a greater D 
expectancy of reinforcement. Baik s 

The data were in contrast to thi 
(cf. Flanders, 1968) that suggest 
competent (or more frequently reinfo: 
models are more extensively imitated. During 
extinction, the variable ratio 20% schedules 17 
of reinforcement produced more extensive 
imitation of the models and a greater number 
of imitative responses than did the variable + 
ratio 80% schedules. These results are, how- 7 
ever, consistent with the partial reinforcement 
effect obtained with schedules of direct rein- 
forcement. Although the present research was 
not designed to permit a complete evaluative’ 1 
comparison of the various theories of extinc- : 
tion and the partial reinforcement effect, it 
was apparent that the data could not be 
completely explained in terms of a simpie’ 
expectancy theory. 


Discriminative Modeling Cues ` 


The discriminative or differentially informa-. 
tive properties of vicarious acquisition proce- 1 
dures noted by Bandura (1969) occurred 
primarily as a function of the nature of the 
reinforcement contingencies, rather than as a 
function of the model’s verbal cues or affec- % 
tive expressions. The model’s verbal cues ap- 
parently served to enhance the partial rein- | 
forcement effect with vicarious reinforcement, 
but only when the cues were more consistent 
with the model’s performance and when the 
model’s task was relatively more similar to the 
task later encountered by the observer. 
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Ea ERRATA 


read: 


necessary for changes in risk level.” 


In the article “Can Self-Perception Theory Explain the Findings of Harvey and 
Mills (1971)?” by Judson Mills and John Harvey in the May 1972 issue, there was 
an error in the second paragraph, lines 9-12 should read: 


“Our major finding was that subjects in our revocable-commitment conditions who 
were given an opportunity to record a substitute speech at an inconvenient time and 
place showed more attitude change toward consistency with the action than the 
subjects in our irrevocable-commitment conditions who were not given the difficult 
opportunity to revoke their counterattitudinal action.” 


In the article “Risky Shift is Eminently Rational” by Eugene Burnstein, Harold 
Miller, Amiram Vinokur, Stuart Katz, and Joan Crowley in the December 1971 
issue, the tenth line of the first paragraph in the second column on page 463 should 


“The relation between pr and the utilities also shows what changes in utilities are 


E 
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MACHIAVELLIANISM, JUSTIFICATION, AND ATTITUDE 
CHANGE FOLLOWING COUNTERATTITUDINAL 
ADVOCACY 


MICHAEL BURGOON,! GERALD R. MILLER,? 


Michigan State 
AND STEWART 


General Motors Institute 


This study investigated the following interaction hypothesis: When counter- 


attitudinal messages are prepared for an 


tions of low justification, low Machiavellians will report significantly greater 


self-persuasion than high Machiavellians 


under conditions of high justification, high Machiavellians will report signifi- 


cantly greater self-persuasion than low 


counterattitudinal essays advocating elimination of college student draft de- 


ferments under conditions of either high 


counterattitudinal advocacy. 


Recent dissonance theorizing (Aronson, 
1968; Bodaken & Miller, 1971; Carlsmith, 
1968; Cooper & Worchel, 1970; Helmreich & 
Collins, 1968; Nel, Helmreich, & Aronson, 
1969) holds that attitude change following 
counterattitudinal advocacy results from the 
advocate’s concern that his communication 
behavior may produce aversive consequences 
for his audience. Obviously, such concern is 
likely to be accentuated or tempered by 
numerous individual difference variables. The 
present study stemmed from the assump- 
tion that persons who score high in Machia- 
vellianism will experience few ethical qualms, 
and consequently little dissonance, about the 
possible negative audience outcomes of their 
counterattitudinal messages. Traditionally, the 
Machiavellian is one who uses others for 
his own purposes. Recently Christie and Geis 
(1970) have posited that the high Machiavel- 
lian views others as objects to be manipulated 
rather than as people with whom to empathize 


1 Now at San Fernando Valley State College. 

2 Requests for reprints should be sent to Gerald R. 
Miller, Department of Communication, Michigan 
State University, East Lansing, Michigan 48823. 
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terms of the number of grade points awarded for participation, The results 


and justification. When the justification was low, attitude change was greater 
for low than for high Machiavellians; given high justification, the opposite 
relationship was observed. The findings are discussed in terms of their impli- ~ 
cations for recent dissonance theorizing, which stresses the importance of 
aversive audience consequences as a determinant of attitude change following 
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uncommitted audience under condi- 
; but when messages are prepared 
Machiavellians. The subjects wrote 
or low justification, manipulated in 


supported the hypothesized interactive relationship between Machiavellianism 


EA 


he 
and that, in addition, he lacks concern about , 
conventional moral sanctions against lying 
cheating, and other forms of deceit. If their 
analysis is accurate, high Machiavellian 
should experience little dissonance because 
moral factors associated with untruth 
counterattitudinal encoding. 5 

Moreover, prior research has indicated tha, 

high and low Machiavellians do, indeed, ré ` 
spond differently to dissonance-arousing situ’ 
ations. Bogart, Geis, Levy, and Zimbardo! 
(1970) found that low Machiavellians who 
cheated in a high dissonance condition re- 
ported less favorable posttest endorsements of 
conventional morality, an effect not observed, 
for high Machiavellians. In a study involving 
counterattitudinal advocacy, Feiler (1970) 
reported that the dissonance theory prediction , 
of attitude change following counterattitudinal | 
encoding was supported for low Machiavel- 
lians but not for high Machiavellians. Finally, 
in a study similar to the present investigation, 
Epstein (1969) also confirmed the prediction 
that low Machiavellians would demonstrate 
greater self-persuasion than high Machiavel- 
lians following counterattitudinal advocacy. 


Ue However, her attempt to induce high and low 
V justification by manipulating the attractive- 
| ness of the sponsor of the counterattitudinal 
| request failed, since both sponsors were per- 
eived positively. 
ji The present investigators examined post- 
| encoding attitude change by high and low 
i Machiavellians in a situation in which coun- 
\terattiudinal messages were ostensibly . in- 
| tended for an uncommitted audience and were 
Maa under high or low justifying circum- 
p\ stances. Given low justification, low Machia- 
| yvellians should experience considerable dis- 
| sonance; for not only is justification for 
| /counterattitudinal advocacy low, but in addi- 
$| tion, the uncommitted audience enhances the 
’ „i likelihood that the message will have the 
“deleterious effect of persuading others to ac- 
jcept a position the advocate himself does not 
believe. By contrast, the preceding morally 
_jcentered consideration should result in little 
issonance for high Machiavellians. Moreover, 
he absence of a high level of extrinsic justi- 
? fication should also result in less dissonance 
s arousal for high Machiavellians, since they 
are likely to derive more satisfaction from 
heir probable success in manipulating others. 
‘i bus, when compared to their low Machiavel- 
| lisn counterparts, high Machiavellian coun- 
erattitudinal advocates should experience less 
issonance and hence report less attitude 
si hange following counterattitudinal encoding, 
4 | When high justification is provided for pre- 
jaring counterattitudinal messages for an un- 
'zommitted audience, the expected relationship 
between Machiavellianism and self-persuasion 
is altered. High justification should give rise 
pi, to an incentive rather than a dissonance ef- 
fect (Elms & Janis, 1965; Janis, 1968; 
Janis & Gilmore, 1965). Bogart et al. (1970) 
have proposed that attitude change for high 
T Machiavellians may be predicted more accu- 
f rately by incentive theory. In addition, situa- 
‘tions involving preparation of a counteratti- 
> tudinal message for an uncommitted audience 
T under highly justifying circumstances fit an 
_ incentive paradigm more unequivocally for 
| high Machiavellians than for low Machiavel- 
| linas. Not only are such incentivist processes 
| as biased scanning heightened by high justi- 
| fication, but, in addition, high Machiavellians 
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should experience little conflict because of 
their perceptions that they may persuade the 
audience to accept the counterattitudinal 
position. By contrast, low Machiavellians’ 
knowledge that they are being handsomely 
rewarded for counterattitudinal encoding may 
be balanced by the sobering, potentially nega- 
tive incentive that their behavior may result 
in aversive audience consequences. Thus, — 
when justification is high and the audience is 
uncommitted, high Machiavellian advocates 
should demonstrate greater self-persuasion 
than low Machiavellians. 

Taken as a whole, the preceding reasoning 
culminates in the central interaction hypothe- 
sis tested in the present study: When counter- 
attitudinal messages are prepared for an 
uncommitted audience under conditions of 
low justification, low Machiavellians will re- 
port significantly greater self-persuasion than 
high Machiavellians; but when messages 
are prepared under conditions of high justi- 
fication, high Machiavellians will report 
significantly greater self-persuasion than low 
Machiavellians. 


METHOD 
Subjects 


The subjects were 125 undergraduate students en- 
rolled in either a freshman communication course 
or a junior public address course at General Motors. 
Institute. The subjects were assigned to one of four 
experimental conditions or a control condition. 


Independent Variables 


The subjects were assigned to high or low Machia- 
vellian groups on the basis of their scores on a 
20-item, Likert-type instrument developed by Christie 
and Geis (1970). Each item is scored from 1 (dis- 
agree strongly) to 7 (agree strongly). A constant of 
20 is added to the total score for each person; thus, 
scores range from 40 (the lowest possible Machiavel- 
lian score) to 100, the scale midpoint, to 160 (the 
highest possible Machiavellian score). Christie and 
Geis report a split-half reliability of .79 for the 
instrument, which has been used in a number of 
studies, In the present study, subjects who scored 
less than 100 were assigned to the low Machiavellian 
condition, while those who scored 100 or above were 
classified as high Machiavellian. 

Justification was manipulated by varying the num- 
ber of grade points offered for engaging in counter- 
attitudinal advocacy. In the two courses from which 
subjects were drawn, final grades are calculated on a 
50-100 scale. Subjects assigned to the low justifica- 
tion condition were told that if they chose to write 
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the counterattitudinal message, their final course 
grades would be raised 1 point. By contrast, high 
justification subjects were promised a grade increase 
of 5 points for writing the message. To heighten 
their awareness of the magnitudes of justification, 
all of the subjects were told that some instructors 
were giving 5 points for participation, while others 
were giving 1 point. The subjects were then told 
how many points they would receive. 

To check the efficacy of the justification manipula- 
tion, all the subjects completed a postencoding 
semantic-differential-type item asking them to rate 
the adequacy of the reward offered for essay writing. 
The possible scores ranged from 1 (inadequate) to 


7 (adequate). 
zpendent Variable 


measure the subjects’ attitudes toward the ex- 
perimental issue, four 7-interval semantic-differential- 
type scales, bounded by the polar adjectives good-bad, 
valuable-worthless, fair—unfair, and pleasant-unpleas- 
ant, were employed. Since each scale was scored on 
a 1-7 basis, a score of 4 reflects a maximally un- 
favorable attitude toward the experimental issue, 
while a score of 28 represents a maximally favorable 
attitude. The dependent measure used was the pretest 
through postencoding attitude change score. 


Experimental Procedures 


Nine weeks before the actual experiment, the sub- 
jects in nine sections of the two courses completed 
the pretest questionnaire, ostensibly to solicit student 
opinion on possible issues to be used in a project 
in another class. Included on the questionnaire were 
items to assess the subjects’ attitudes toward five 
controversial issues and the 20-item Machiavellian 
scale. Because almost all of the students were op- 
posed to the abolition of student draft deferments, 
that topic was selected as the experimental issue. 

Following the pretest session, each intact class 
section was randomly assigned to high justification, 
low justification, or control conditions. Four class 
sections composed the high justification condition, 
while three other sections were assigned to the low 
justification condition. Two classes became a control 
condition that only completed attitude measures at 
the pretest and experimental sessions. Moreover, the 
subjects were assigned to high or low Machiavellian 
groups in the manner previously discussed. Since 
intact class groups were assigned to create high and 
low justification conditions, it was necessary to en- 

sure that differences in mean Machiavellian scores 
did not exist between the justification conditions, 
A t test indicated no difference between those clas- 
sified as high Machiavellians in the high and low 
justification conditions (t <1, df= 44); similarly, 
the low Machiavellians in the high and low justifica- 
tion conditions did not differ on mean Machiavellian 
scores (£ < 1, df = 46). 
At the subsequent experimental session, all subjects 
were given the proper form for their experimental 
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condition of the following letter while the 
menter read the message aloud. Since classe 
assigned as intact groups to the treatment conc F 


each class was promised either 1 or 5 grade 
for preparing the essays: 


For the past several weeks I have been workir 
for the International Communication Association 
preparing for a youth conference in Detroit ne 
month. We are interested in what it takes to mak: 
a persuasive message. What we do is get sor 
indication of the attitudes people have on cert. 
issues and then, at some later date, we pres 
certain arguments in the form of speeches, ess: 
debates—various methods—which are designed 
get them to change or strengthen their attituc> 

Let me explain briefly what we are attemp‘ 
to do in this project that you will be helping 
with. Sometime ago we administered questionnaires 
to all of the high school students who would be 
attending this youth conference in Detroit asking 4 
them their opinion on the abolition of student 
draft deferments. Analysés of these questionnaires 
indicate that these peo are undecided on the 
issue. PND, 

We have asked ov 10 students like you to 
create messages that ar e for abolishment of stuf 
dent draft deferments. We are looking for the 
things in a message that are most likely to change 
people’s attitudes. We are using this class since 
it is a communication class, and you have studiec 
some of the techniques of persuasion. 

I have talked to all of your professors about 
rewarding you for helping me in this project. | 
About half of the faculty participating in this re- — 
search have agreed to raise the final grade of any 
student who writes this essay 5 points; oth 
faculty members are raising the grade 1 point, 

Your professor has said that he would raise your 
grade 1 (5) point(s) if you choose to write th 
essay. Thank you in advance for your help, 


The letter contained several manipulations designed 
to increase the subjects’ interest in preparing counter- 
attitudinal essays arguing in favor of abolition of 
draft deferments for college students. To create the 
illusion of an uncommitted audience, the subjects 
were told that data on the high school students’ 
attitudes revealed that they were undecided as to 
whether or not student deferments should be abol- 
ished. To increase public commitment to the task 
and to foster the impression that each subject would 
be personally responsible for persuading the high, 
school students, the subjects were asked to sign their 
names to the essay “so that they could get credit 
for their ideas.” One high Machiavellian in the high 
justification condition and two in the low justifica- 
tion condition refused to write the counterattitudinal 
essay. Two low Machiavellians in each of the high 
and low justification conditions refused to complete 
the message. Therefore, all subsequent analyses 
were completed on the 118 subjects who wrote the. 
counterattitudinal essays. 


After the messages were completed, all subjects 
led out the postencoding questionnaire, which con- 
tained the same four scales used to tap pretest 
attitudes and the item used to check the manipula- 
tios of the justification variable. The subjects were 

n told the real purpose of the study. When ques- 
ioved in the debriefing session, the subjects indicated 
hat they had not remembered taking the pretest 
© weeks earlier. 


RESULTS 


For all statistical tests, the .05 level of 
significance was employed. Analysis of the 
data yielded the following results. 


Justification Manipulation Check 


Table 1 contains the mean ratings of ade- 
quacy of justification for subjects in the four 
experimental groups. The mean ratings for the 
two high justification conditions are above 
the midpoint (4) of the 7-point scale, while 
the mean ratings for the low justification 
conditions are below the midpoint. Thus, 
when compared to the prior study by Epstein 
(1969) ihe present manipulation appears to 
been more successful. Moreover, the 
two-factor analysis of variance yielded a sig- 
{nificant justification effect (F = 28.66, df = 
1/83), indicating that low justification sub- 
jects felt that the number of points awarded 
. for counterattitudinal advocacy was signifi- 
cantly less adequate than the number awarded 
their high justification counterparts. Thus, the 
general pattern of the results suggests that 
the justification manipulation was successful. 


Experimental and Control Group 
Attitude Change 


A simple analysis of variance of the pretest 
attitude scores revealed that the five group 
means did not differ significantly (F = 1.5, 


TABLE 1 


RATINGS OF ADEQUACY OF JUSTIFICATION 
FOR WRITING THE Essay 


Group ne M 


High justification-high Machiavellian | 23 5.01 
High justification-low Machiavellian 20 5.21 
Low justification-high Machiavellian 22 ZIS 
Low justification-low Machiavellian 22 2.93 


Note.—The range of possible scores is 1 (inadequate) to 7 
7 TARN 

* The 31 subjects in thi d only th 
attitude mersaroe: e control group completed only the 
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Fic. 1, Amount of attitude change for subjects in 


experimental and control groups. 


df = 4/113). The overall attitude change for 
the five conditions is shown in Figure 1. Com- 
parison of each experimental condition to the 
control using Scheffé’s test revealed that the 
high justification-high Machiavellian and the 
low justification-low Machiavellian conditions 
differed significantly from the control group, 
while the remaining two conditions did not. 
It should be noted that the two conditions 
that differ from the control are precisely those 
predicted to show the greatest amount of 
pretest through postencoding attitude change. 


Test of Hypothesis 


Two-factor analysis of variance of the atti- 
tude change scores for subjects in the four 
experimental groups yielded the predicted sig- 
nificant interaction between Machiavellian- 
ism and justification (F = 4.01, df = 1/83). 
Neither main effect was significant. Moreover, 
comparison of the appropriate pairs of means 
using ¢ tests revealed that the pattern of the 
interaction conforms closely with theoretical 
expectations. Low Machiavellian subjects in 
the low justification condition reported sub- 
stantially greater attitude change following 
counterattitudinal encoding than did high 
Machiavellianism—low justification subjects 
(t = 9.05, df = 42, one-tailed test). While the 
difference in attitude between high justifica- 
tion-high Machiavellian and high justifica- 
tion-low Machiavellian subjects is not as 
great, it is in the expected direction and is 
significant (¢ = 3.92, df = 41, one-tailed test). 
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Thus, taken as a whole, the findings provide 
strong support for the interaction hypothesis 
tested in the study. 


Discussion 


Results of the present study constitute ad- 
ditional support for the theoretical distinctions 
between high and low Machiavellians posited 
by Christie and his associates. Consistent with 
expectations, high Machiavellians placed in a 
strongly dissonance-providing situation did not 
alter their attitudes toward the advocated 
belief-discrepant position as much as their low 
Machiavellian counterparts. As indicated pre- 

iously, this finding is consistent with the 

esults of earlier research by Bogart et al. 

_ (1970), Feiler (1970), and Epstein (1969). 
At least two reasons can be advanced for 
the minimal self-persuasion observed in the 
_ low justification-high Machiavellian condi- 
tion. First, negative sanctions against lying 
and deception may be less relevant to high 


than to low Machiavellians. Such an interpre- 
tation is in keeping with the high Machiavel- 
lian’s proposed tendency to view people as 
objects of manipulation rather than fellow 
humans worthy of empathic response. Obvi- 
ously, such an amoral posture, which views 
lying and deception as irrelevant, would pre- 
vent the onset of dissonance caused by 
engaging in counterattitudinal advocacy, 

But beyond the conceivable irrelevance of 
these negative factors, it is possible that the 
ostensible availability of an uncommitted, 
potentially persuasible audience may provide 
positive incentives for the high Machiavel- 
lians. Unlike the low Machiavellian counter- 
attitudinal advocate, high Machiavellian ad- 
vocates may derive satisfaction from their 
likely success in manipulating others, even 
though they do not accept the position they 
are championing. In short, certain dimensions 
of the behavior of high Machiavellians may 
closely correspond with individuals who have 
a high need to influence. For such persons, 
the role of devil’s advocate—particularly if 
chances for successful influence are high— 
may be intrinsically rewarding, regardless of 
the content of the argument. In fact, it might 
be useful to correlate scores of the Machiavel- 
lian scale used in the present study with 
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scores on Uleman’s (1965, 1966) projective 
need to influence test, the assumption being 
that a positive relationship would be observed. | 

The reversal in the magnitude of s 
persuasion for high and low Machiavellians i 
the high justification condition again atte 
to the importance of carefully deline 
those conditions that will produce a 
nance-based relationship between justification 
and self-persuasion and those that will resul 
in an incentive theory outcome. Just < 
smith, Collins, and Helmreich (1966) and 
Linder, Cooper, and Jones (1967) demon 
strated that variables such as the degree of 
publicness of the counterattitudinal act ard 
the amount of decision freedom provided for 
performing counterattitudinal advocacy influ- 
ence whether or not an incentive or disso- 
nance effect will occur, the present study sug 
gests that the individual difference variable 
of Machiavellianism is an, important deter- 
minant of the relationship between justifica- 
tion and self-persuasion. Given high Machia- 
vellianism, self-persuasion is more likely to 
occur under conditions calculated to produce 
positive incentives; when Machiavellianism is 
relatively low, dissonance-producing circum- 
stances appear to exert strong pressures for — 
attitude change. As research continues to iso- 
late these interactive relationships, it be- 
comes increasingly apparent that within the / 
realm of counterattitudinal advocacy, incen- 
tive and dissonance theories are comple- 
mentary, not antagonistic. 

Several writers (Miller, in press; Miller 
& Rokeach, 1968) have argued for the impor- 
tance of individual difference variables as 
mediators of the self-persuasive effects of 
counterattitudinal advocacy. Prior studies by 
Hunt and Miller (1968), Rebach (1968), and 
Berger (1969) have established theoretically 
coordinate relationships between such person- 
ality variables as dogmatism and need to 
influence and attitude change following coun- “ 
terattitudinal advocacy. The present study 
indicates that Machiavellianism is also an 
important determinant of self-persuasion ef- 
fects. The extent to which its influence is 
heightened or reduced by other variables as- 
sociated with the counterattitudinal advocacy 
situation remains a matter for future research. 
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A METHOD FOR INVESTIGATING AND REPRESENTING A 
PERSON’S IMPLICIT THEORY OF PERSONALITY: 


THEODORE DREISER’S VIEW OF PEOPLE! 


RUSSELL JONES 
University of Kentucky 


SEYMOUR ROSENBERG? 


Rutgers University 


Terms and phrases that refer to relatively stable physical and psychological traits 
were extracted from Theodore Dreiser's A Gallery of Women. A measure of trait 
co-occurrence for each pair of traits, based on the proportion of times the two traits 
were attributed to the same character, was used as input for clustering and multi- 
dimensional scaling. The resulting structures were interpreted in terms of selected, 
independently measured properties of the traits. The most relevant properties for 
interpreting these trait structures are (a) sex of character, (b) conformity, (c) 
evaluation, and (d) potency (hard-soft). The first property is discussed in terms of 
Dreiser's involvements with women, and the second property is discussed in terms 
of his intense struggles with conformity pressures. Evaluation (which is not ortho- 
~*gonal to potency in this study) is relatively weak by contrast with the usual finding 
reported in the literature. Biographical data indicate that Dreiser was extremely 
evaluative in his personal life, and the weakness of this property in Gallery may 
simply reflect a novelistic style that attempts to avoid polarized impressions of the 
characters, The implications of these findings for identifying general dimensions — 
of person perception are discussed. The method is applicable to a variety of indi- N 
viduals, where it may not be desirable or even possible to administer rating scales , 
and checklists and where the format and content of such standard instruments may “— 


| 


not permit the individual to express his impressions adequately. 


The concept of a person’s implicit theory of 
personality is used by psychologists to refer to 
(a) the categories that the person employs to 
describe the range of abilities, attitudes, inter- 
ests, physical features (and accoutrements), 
traits, and values that he perceives in himself 
and others and (b) the beliefs that the person 
holds concerning which of these perceived 
characteristics tend to go together and which 
do not. 

The “theory” is characterized as “implicit” 
because a person’s categories and beliefs are 
inferred from his spontaneous descriptions and 
expectations about particular individuals and 
groups rather than being stated by him as a 
formal theory. It is unlikely that an individual 
could make many of his categories and beliefs 
explicit or, given these basic elements, that he 
could organize them into a formal and parsi- 
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moniously stated theory of personality. Im- 
plicit theories, and there are presumably as 
many such theories as there are individuals 
albeit with many overlapping elements and 
relations, have also been dubbed “common- 
sense,” “lay,” and “naive” to distinguish 
them from scientific theories of personality. 

How are a person’s categories and beliefs 
extracted from his descriptions of people and 
then represented in at least a quasi-formal 
manner? One proposal is that a person’s 
descriptions of others on a set of rating scales 
be thought of as defining a distribution of 
points in a multivariate space (Cronbach, 1955, 
1958). A person’s implicit theory is represented 
in terms of the means, variances, and covati- 
ances of this multivariate space: the mean 
may be regarded as the person’s sicreo!ype, 
the variance as the person’s tendency to dif- 
ferentiate along a given dimension, and the 
covariance as his perceived relation between 
variates or traits. 

The problem with these conventional statis- 
tics is that they are applicable primarily to 
rating scales, checklists, or other instruments 
that limit the respondent to a fixed format 
and to trait categories selected in advance by 
the investigator. When the respondent is per- 


mitted to describe persons with terms of his 
own choosing, conventional statistics usually 
fail to capture the aspects of such “free re- 
sporse” data that are of most interest to the 
investigator. 

The potential superiority of free response 
pretocols, and the reason their use has been 
acvocated by a number of writers (Hastorf, 
Xichardson, & Dornbusch, 1958; Kelly, 1955; 
Rosenberg & Sedlak, 1972), is that they might 
contain important information about an indi- 
ridual’s categories and beliefs—information 
\ot captured by a fixed-response format. More- 
over, it is not desirable, feasible, or even 
possible to administer a standard instrument, 
to some individuals of interest, for example, 
writers, politicians, psychotherapy patients. 
The present analysis of Theodore Dreiser is a 
good example. 

The techniques described in this paper were 
ased to extract the traits ascribed by Dreiser 
to the characters in his A Gallery of Women 
(1929), to measure the relations he perceived 

~~ these traits, and to provide a parsimo- 

, suruetural representation of this system 
traits and beliefs. The advance over pre- 
s work, which focused simply on identify- 
which general categories of description 
‘appearance, emotional adjustment, interests, 
ete.) are found in free response protocols 
(e.g Beach & Wertheimer, 1961; Dornbusch, 
Hastorf, Richardson, Muzzy, & Vreeland, 
1965), was made possible partly by the applica- 
tion of certain developments in clustering 
(Johnson, 1967) and multidimensional scaling 
(Kruskal, 1964a, 1964b; Shepard, 1962a, 
{962b; Torgerson, 1958). 

The application of clustering and multi- 
dimensional scaling starts with any set of 
objects—in the present case, the physical and 
psychological trait categories used by Dreiser 
in his descriptions of the characters in A 
Gallery of Women. The basic datum required 
for the clustering and scaling of the traits is 
a number for each pair of traits reflecting how 
frequently the two traits in a pair were per- 
ceived by Dreiser as going together in the same 
character, This datum was obtained for each 
pair of traits, using a measure based on the 
proportion of times the two traits were actually 
ascribed to the same characters, relative to 
the total usage of the two terms. 
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Clustering analysis provides what might be 
considered a hierarchical typology of the im- 
plicit theory; that is, the nonoverlapping clus- 
ters of traits may each be considered to be a 
basic personality type in Dreiser’s view of 
people. Although the notion of hierarchical 
organization among personal constructs is 
prominently mentioned by Kelly (1955), there 
has been no empirical work on the usefulness 
of this type of structure. 

Multidimensional scaling, by contrast, gen- 
erates a configuration of traits, this configura- 
tion being interpretable in terms of a set of 
trait dimensions. The question then is: What 
are these dimensions? To what extent do 
they include, for example, the basic dimensions 
of evaluation, potency, and activity usually 
found in semantic spaces? 

Multidimensional scaling was first used by 
Rosenberg and Sedlak (1972) with a sample of 
free response descriptions solicited from college 
students. In that study, scaling solutions were 
obtained only for the trait co-occurrences 
summed over the entire subject sample. Analy- 
sis of individual subjects was not attempted 
because meaningful solutions require a much 
larger corpus of free response descriptions 
than an individual can usually give in an hour 
or two of experimental time. The choice of a 
prolific writer such as Dreiser is thus not only a 
matter of intrinsic interest in a writer of his 
stature, but also extremely convenient from 
the standpoint of providing sufficient material 
for analysis. 

Dreiser’s A Gallery of Women consists of 
separate “fictionalized” portrayals, or stories, 
of 15 different women known by Dreiser. 
Each story also contains descriptions of an 
assortment of other characters (family, friends, 
lovers) more or less oriented around the hero- 
ine. A Gallery of Women was chosen from 
among Dreiser’s writings because the focus is 
on character description rather than story 
telling—an ideal emphasis from our stand- 
point. Also, the description of any given char- 
acter was completely contained within 20-50 
printed pages—the length of a story—so that 
it was not necessary to search through an 
entire novel to extract all the descriptions 
concerning a person. Finally, the number of 
different persons described by Dreiser is con- 
siderably larger than that found in a typical 
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novel, and the entire collection of the 15 
portrayals is no longer than that of a single 
novel. The larger the number of persons de- 
scribed and the more diverse they are, the more 
likely it is that the trait categories and mea- 
sures of co-occurrence among these categories 
are representative of the elements and rela- 
tions in the individual’s implicit personality 
theory, 


METHOD 


The several steps involved in representing Dreiser's 
view of people are described in detail in this section. 


Extraction of Character Descriptions 


An attempt was made to extract every linguistic unit 
that described a stable characteristic of each person 
mentioned in A Gallery of Women. 

The guidelines for identifying such linguistic units 
were based on procedures previously worked out by 
Rosenberg and Sedlak (1972) for coding the person- 
ality descriptions that they had solicited from college 
students. 

"Physical or psychological traits described by Dreiser 
= with one word were easily identified, for example, 

young, interesting, tall, and lonely. Where Dreiser used a 
phrase or clause (and these were the more frequent 
units), a judgment was made as to whether the particu- 
lar phrase could be divided into two units, each de- 
scriptive of a different characteristic, or whether it 
-could be listed as one unit. For example, the phrase 
‘enjoyed an expensive and wasteful life. . ,’’ was 
broken up into the two units: enjoyed an expensive— 
life and enjoyed—wasteful life. Similarly the phrase 
“—would most carefully and liberally provide for her 
child . . .”’ was listed as would most carefully—provide 
for her child and would most—liberally provide for her 
child. 

The guidelines used by Rosenberg and Sedlak (1972) 
to identify traits in solicited personality descriptions 
were found to be incomplete in dealing with narrative 
materials, 

First, Dreiser often attributed human traits to a 
character's dwelling, work, clothes, and other poses- 
sions and products, whereas college students never did. 
These traits were included in the list associated with the 
character when they appeared to tell something about 
the character’s makeup; for example, her studio was 
really gay, her canvases were lush, smooth flowing graceful 
dresses, his house had a large stiff parlor. 

Second, Dreiser's narratives gave rise to many more 
questions concerning whether a characteristic of an in- 
dividual was indeed being described in a statement, 
These questions were typically resolved by including 
the term or phrase in the list; for example, a roundness 
to her fancy was included, although it is not clear 
whether a personality characteristic is described by this 
phrase. The general principle was to include itemsin the 
list when in doubt, but with the following exceptions: 
(a) kinship terms, unless they were used metaphoric- 
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ally; (6) numerical age; (c) relationships with particular 
other characters; (£) comparisons with particular other 
characters (“A is more skillful than B”); and (e) any 
clearly temporary mood or state. 

The additional rules necessitated by the use of nar- 
rative material were evolved by the present auth-:s in 
discussions following the independent reading of cach. 


of three stories. The second author then the” 
trait terms from the remaining 12 stories. T these 
12 stories were also coded by six students in an uder- 
graduate seminar. Although no formal check wa de, 
it was our impression that the rules successfully com- 


municated the coding procedure to the students. 

The units were extracted verbatim. If the unit we 
more than one word, the key word was recorded í 
followed by a comma and the rest of the unit: 
un—to satisfy her lover in any way; sacrifice, wished !o— 
something to the great cause. If a unit contained two or 
more key words, the unit was listed separately for each 
key word: innocent artfulness andiartful, innocent—ness 
Each trait was identified as to the character to whom it 
was attributed and the person attributing the trait, usu- 1 
ally the author (Dreiser), but oftèn another character 
in the story. 

As an example, the units extracted for one of ihe 
minor characters, Olive Brand's father in Olive Brand, 
is listed below in the order in which they appeared in 


Ing, 


the story: So ee 
SOURCE DESCRIPTIVE Unit > z 
t ga a 
Dreiser mathematics, occupied the chair of 
Dreiser professor 
Dreiser banker 
Dreiser elder in a church 
Dreiser vestryman in a church 
Dreiser baptist 
Dreiser quiet, his home was in a—street 
Dreiser solemn, his home was very 
_ Dreiser middle-western, his home was very 
Dreiser long face 
Dreiser grave face 
Dreiser | bearded 
‘Olive Brand solemn 
Olive Brand profound 
Olive Brand conspicuous ornament of his church 
Olive Brand ornament, conspicuous—of his church 
Dreiser bearded 
Dreiser whiskerando 
Dreiser commonplace, represented all the—s 
Dreiser solemn, represented all the—ities 
Olive Brand commonplace 


A total of 6,761 such units were extracted, attributed 1 
to a total of 241 characters. The largest number of units ` 
attributed to a character was 353 and the smallest was “J 
1, with a median of 6 units. T 


Grouping Units into “Trait Categories” 


In order to reduce the number of discrete trait de- 
scriptions to a manageable number of trait categories, 
certain variations in phrasing were collapsed into one $ 
trait category which was then identified with a key trait f 
name. Table 1 is a list of the 99 most frequently ocur- 
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categories, identified by their key trait names, 
vith their frequency in all 15 stories. 

iles for collapsing descriptive units into cate- 
ere based on guidelines described by Rosenberg 
lak (1972) and were as follows: 


ring trait 


1. It was necessary but not sufficient that a descrip- 

tive unit contain the basic morpheme or morpheme 
com:-ound in the name identifying the trait category. 
Thos, different grammatical forms of the same mor- 
pheme or morpheme compound were grouped together. 
In addition, it was necessary that the meanings were 
judged to be similar. For example, beauty was groupd 
with beautiful, and sincerity with sincere, whereas artful 
wo= not grouped with artistic. 
Descriptions that were accompanied by modifiers 
did not change the meaning of the trait name to a 
tially different meaning were placed in the same 
. For example, quite intelligent, extremely intel- 
nd rather intelligent were all grouped with intel- 
i, whereas not so remarkably intelligent was not 
ped with intelligenti but rather with limited intel- 
ligence. 

3. Traits that were otherwise qualified were also in- 
Juded in the same category as the unqualified trait 
na for example, read interesting books; read much, 
principally the classics; reader of much poetry were all 
grouped with reads. 

4. The negation of a trait with mot or never was cate- 
gorized with the trait prefixed by wn-, in-, or im-. 


M casures of Trait Co-occurrence. 


Two separate indexes of trait co-occurrence were cal- 
culated. One index, denoted s, is a measure of direct 
co-occurrence between two traits. The second index, 
denoted ô, is a measure of profile distance between two 
traits and is based on the s measure. 

The s measure was constructed so that a zero value 
(psychologically close together) for a pair of traits 
meant that when one of the traits was attributed to a 
character, so was the other, and vice versa; a value near 
ychologically distant) meant that the two traits 
almost never attributed to the same characters. 
The full theoretical range of the s measure is between 


-0 and 1, inclusive. 


The 6 measure includes a measure of direct co-occur- 
rence between traits 7 andj but also gives weight to the 
fact that 7 and j may co-occur independently with a 
third trait k. That is, one character may have been de- 
scribed with traits 7 and & but not with j, whereas a 
second character may have been described with traits 
jand k but not 7. Thus, the ô measure incorporates both 
direct and indirect ties that exist between any given 
pair of traits. 

Clustering and multidimensional scaling were per- 
formed for each measure, and the results were com- 
pared in order to determine which method, if either, 
Provided the better measure of psychological distance. 
Only the traits in Table 1 were subjected to these analy- 


è There are 784 trait categories with two or more 
tokens, and a list of these categories may be obtained 
from the authors. 


TABLE 1 


Tue 99 Most FREQUENTLY OCCURRING TRAIT CATE- 
GORIES IN A Gallery of Women 


Trait category Frequency 

Young 100 
Beautiful 67 
Attractive 44 
Charming, dreamer 41 
Poetic 39 
Interesting, worker 32 
Artistic 29 

y 26 
Practical, romantic, writer 25 
Conventional, girl 24 
Free, strong 23 
Woman 22 
Unhappy 20 
Intellectual, radical, sensitive, sensual 19 
Kind, cold 18 
Vigorous 17 
Able, drinker, generous, troubled 16 
Colorful, graceful, intelligent, poor, reads, 

religious, studies, tall 15 
Fool, good-looking, literary, pagan 14 
Aspirant, determined, good, had means, sad, 

society person, not strong, tasteful 13 
Ambitious, careful, defiant, different, erratic, 

genial, happy, lonely, man, varietistic 12 
Clever, genius, indifferent, Irish, manager, 

nice, old, pale, quiet, restless, serious, 

shrewd, sincere, successful, suffering, 

sympathetic, thin, understanding 11 
American, communist, crazy, critical, 

emotional, enthusiastic, fearful, fighter, 

forceful, great, handsome, hard, lovely, 

had money, painter, physically alluring, 

playful, repressed, reserved, skill, 

sophisticated 10 


ses since 99 is the upper limit for the number of objects 
that the Rutgers computer could handle. 
The exact definition of s for any two traits è and jis 


sy= 1- B2 E cp/(L fat X fd), 
1EL IEL IEL 


where fn and fj are the number of times traits 7 and j, 
respectively, were attributed to character /; L is the 
set of characters described with the 99 traits used in the 
analyses; and cyr = min (fi, fj). By definition Si; = Sji 
and si = 0. 

The 6 measure is a profile distance measure based on 
the s measure. That is, for any two traits 7 and 7, 


99 
bij = DO (Sik — Sj): 
k=l 
Since si = sji and si; = 0, 
6:3 = 2s,;2 + 97 other squared terms. 


The remaining 97 terms give a measure of the extent to 
which 7 and j occur differentially with each of the 97 
other traits. 


The s and 6 measures are each representative of a ) 


large class of measures, any member of which yields the 
same results when clustering and nonmetric multidi- 
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mensional scaling are used. Nevertheless, other indexes 
can be constructed that emphasize somewhat different 
features of trait co-occurrence, and hence may lead to 
somewhat different structures. Also, asymmetricindexes 
~ of trait implication, that is, where di; # dji, have been 
W avoided because of the complications that then arise 
in the application of clustering and scaling. 


Measurement of Trait Properties 


Ratings of the 99 traits in Table 1—that is, those 
subjected to clustering and multidimensional scaling— 
were obtained on 12 different scales. The purpose of 
these ratings was to interpret the clustering and scaling 
results. In the case of clustering, the ratings can be used 
to determine the properties that traits within clusters 
tend to share and that traits in different clusters do not. 
For example, are some clusters composed primarily of 
poistive traits and other clusters of negative traits? 
To the case of scaling, the interpretation of a trait space 
involves the determination of which properties are sig- 
nificant dimensions in the space. Rating scales repre- 

sentative of the semantic differential were included in 
© "the 12 scales, for example, to determine whether evalu- 
ation, potency, and activity are present in Dreiser’s 
multidimensional trait space as three more or less or- 
thogonal dimensions, Answers to these questions in- 
volved the use of certain statistical methods that are de- 
scribed when the clustering and scaling solutions are 
presented. 

Interpretations based on biographical information 
about Dreiser are taken up in the Discussion section. 
Such interpretations are more informal and subjective 
than those involving the trait ratings, and the general 


TABLE 2 
Factor ANALYSIS OF 14 TRAIT PROPERTIES 
Factor 
Property 
I II II Iv 

Good-Bad* 90 .04 24 .00. 
Interesting—Dull 85 12} —.19 | —,08 
Social Good-Bad* 82 15 28 | —.15 
Intellectual Good—Bad* 9 Skl 16} .18 
Hard-Soft® 159) S416 |) 34 
Active-Passive* .05 90 .00 .00 
Introverted- 

Extraverted* —.04 | —.86 | —.02 | —.10 
Impulsive-Inhibited* 19} 86 | —.11 .01 
Dominant—Submissive* 10 80 25 24 
Young-Old -10 .60 | —.21 | —.25 
‘Conforms-Does Not 

Conform -12 | —.12 .87 | —.04 
Decided-Undecided* .49 17 58 10 
Male-Female A7} 15 | —.16 | .72 
Frequency of occurrence | —.20 | —.06 AS 70 


Note.—lItalics indicate highest loading for a given property, 
that is, largest number in a given row. 
2 From Rosenberg and Olshan (1970). 
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question of interest is whether the clustering and sca 
results show evidence of Dreiser’s own personality an 
preoccupations. Such evidence, even if it is intui 
would lend support to the methodological approach 
used in this study in its ability to tap important fea- 
tures of the individual being studied. E 

The raters were 144 Livingston College (Rutgers 
University) undergraduates. Each rater was giv 


the restriction that the same number of rate - 
signed to each scale. The 99 traits were listed in one oi 
two random orders. 

The rating instructions were similar in format for all 
scales, the main difference being the labels on the ends 
of the scales. The good-bad property is used in the fol- 
lowing set of instructions to illustrate the format. If the 
property were active-passive, for example, these two 
terms would appear instead of good and bad. The in-1 
structions were as follows: 


We would like for you to rate each of the personal- 
ity traits listed on the attached sheet according t 
whether a person who exhibited each of the trai 
would be a GOOD or a BAD person. In making your 
ratings use the following scale: 


GOOD |1] 2 13} 4 L5) 6 L7] BAD 


Nine of the 12 scales were the same as those used in a 
previous study of personality perception (Rosenberg & 
Olshan, 1970), and they include the three semantic dif- 
ferential dimensions; evaluation, potency, and activity. 
The three other scales were suggested by an examina- 
tion of the five-dimensional configuration. The 12 scales 
are listed in Table 2. Also listed are the two variables 
male-female, which refers to the proportion of times 
Dreiser used a trait to describe a female character, and 
Frequency, which refers to the overall frequency of oc- 
currence of a trait category in A Gallery of Women (see, 
Table 1). 

The median rating was obtained for each of 99 traits 
on each of the 12 scales.‘ Intercorrelations among the 14 
properties were then obtained and subjected to a prin= 
cipal-axis factor analysis. Factors with eigenvalues 
greater than or equal to 1.00 were rotated by a varimax 
rotation. Four factors emerged, and the loadings on 
each factor are shown in Table 2. 5 

Each property was placed into one of four groups a 
cording to the factor on which it loaded most heavily. 
The grouping is shown in Table 2, with the highest 
loading for each property shown in italics. 

The two groups with five properties in each gro 
are interesting in that they suggest that Factor I is 
evaluation and that Factor II is activity. Except fo 
hard-soft, a potency scale, which loads about equally om 
these two factors (and Factor IV to some extent also), % 
none of the properties load strongly on more than one 
factor. The third and fourth groups in Table 2 have two 
properties each. The presence of only two properties, 
particularly in the third group makes generalizations 


4A paper by Rosenberg and Jones (1972) reports 
(a) median ratings of 406 trait terms used in various 
studies (including the 99 in this study) on nine of the 
rating scales in Table 2 and (b) factor analyses based 
on these rating data. 
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756 810.826 
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about Factor III difficult. In the case of Factor IV, the 
clustering of male-female and frequency of occurrence is 
partially explicable in terms of the fact that more lin- 
guistic units (tokens) were attributed to women than 
to men (A Gallery of Women is mostly about women). 

It should be noted that the factors in this analysis 
may or may not be relevant in the interpretation of the 
clustering and scaling solutions. The factors reflect 
differences in the trait vocabulary that Dreiser used 
but not necessarily to the degree in which these traits 
co-occur in the characters that he described. As already 
noted, the question is which, if any, properties or fac- 
tors are present in the structures obtained from co- 
occurrence data. This question is taken up in the next 
section. 


RESULTS 
Clustering Analysis and Interpretation 


The clustering analysis was performed with 
a computer program written by Johnson and 
based on his 1967 Psychometrika article. The 
program uses input distance (or proximities) 
as a basis for partitioning the objects—here, 
traits—into a set of nonoverlapping clusters. 
The clustering is hierarchial, which means 
that a sequence of clusterings is produced in 
which each succeeding cluster contains a 
merging of one or more clusters from the 
preceding clustering. Each successive clustering 
is produced when an increasingly larger dis- 
tance is taken as the criterion for putting 
traits into the same cluster. 

The tree diagram in Figure 1 shows three of 
the clusterings generated by Johnson’s (1967) 
diameter method. The leftmost part of the 
tree shows which traits merge when the maxi- 
mum distance (s measure) between traits 
within a cluster is .756. Thus, the three s 
values between fighter, forceful, and pagan are 
all less than or equal to .756, as are the six 
distance values between crazy, greal, erratic, 
and sincere. A cluster may consist of only one 
trait. For example, the distance between 
religious and all of the other traits is greater 
than .756. The next higher level in the diagram 
shows the clusters that are formed when the 
maximum distance permitted between traits 
within any cluster is .810. For example, two 
of the clusters at .756, one consisting of artistic, 


Fic. 1. Tree diagram of three levels of clustering of 
the 99 traits used most frequently in A Gallery of 
Women. (Each level is optimal with regard to a trait 
property in distinguishing the trait clusters at that 
level.) 
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unhappy, literary, generous, and had means and 
the second consisting of intelligent and success- 
ful, form a single cluster at .810. A number of 
clusters remain unchanged in membership: 
for example, the first six clusters in the tree. 
‘The rightmost part of the diagram indicates 
the set of trait clusters when the maximum s 
within any cluster is .826. 

The clustering program actually produced a 
larger number of clusterings than the three 
diagrammed in Figure 1, the first at a distance 
value of .359 and the last at 1.000, at which 
value all traits merge into a single cluster. 
The three clusterings in Figure 1 were selected 
on the basis of statistical analyses designed to 
discover which trait clusterings in the hierar- 
chical sequence have the greatest psychological 
interpretability in terms of the various trait 
properties measured in this study, 

The method consists of separate but parallel 
analyses for each property. Consider the hard- 
soft ratings, for example. These ratings can 
be grouped to correspond with the way the 
traits are grouped at any given level of clus- 
tering. Thus, each level of clustering provides 
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a way of partitioning the total sum of square 
of the 99 hard-soft ratings into a sum betwee 
clusters and a sum within clusters. An 
ratio based on this partitioning was com; 
for different levels of clustering starting w 
359. A plot of the exact p value assoc 
with each F versus the clustering 
shown in Figure 2 for hard-soft (bottom right 
curve) and seven other properties. The «igh 
properties in Figure 2 were selected to bi 
representative of the factor structure (Tabie 
of the properties. 

According to Figure 2, hard-soft, male- 
female, and conforms-does not conform a 
the only properties with at least one p level 
less thai .01 and, employing this level 
cutoff, the only measured properties us:ble 
for interpreting the clustering analysis. A!somg 
according to Figure 2, the minimum p levels 
for the three properties are at .756, 810, ead 
826, respectively, and these are the clustering 
levels diagrammed in Figure 1, ‘i 

Figure 2 also shows that neither the evalua- 
tive factor (good-bad, interesting-dull) nor 
the activity factor (active-passive, intro- 
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Fic. 2. Graphs for each of eight trait properties showing the probability level at 
which a property distinguishes the trait clusters at different clustering levels. 
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extraverted) are related to the clus- 
analysis. The single potency scale 
ft) does tap an underlying basis for 
ering. However, this potency scale 
isads to some extent on Factor 4, the 
“male-female factor.” Finally, young-old, 
which failed to show significance, was analyzed 
because it loaded rather weakly on the activity 
factor and may have represented a new factor. 

It should be emphasized that the trait 
ratings were obtained from college students 
and not from Dreiser or Dreiser scholars. It 
may be the case, for example, that evaluation 
be a significant property if the good- 
atings were obtained from individuals 

he know Dreiser well enough to judge his 
, perceptions of desirable and undesirable traits. 
f \ fact, such a source of trait ratings may have 
resulted in a different pattern of factor load- 
ings than that summarized in Table 2. There 
is biographical evidence, for example, that 
suggests that Dreiser disliked conformists 
rather intensely. Trait ratings based on such 
information about Dreiser may have increased 
substantially the correlation between the con- 
‘nity and evaluation scales. In any case, 
a positive result obtains as it did with 
hevd-soft and conforms—does not conform, it 
would appear that a degree of communality 
exists between Dreiser’s connotative meanings 
of the various trait terms and those of the 
college students. 

The positive results with three of the proper- 
ties do not necessarily mean that a trait cluster 
sy 1st consist exclusively of either hard or soft 
traits, either male or female traits, and either 
conforming or nonconforming traits. The aver- 
age values of the traits for the different clusters 
are distributed over a wide range. The clusters 
with the smallest and largest values on each 
of the three properties are listed in Table 3. 
These clusters may represent extreme “per- 
sonality types” in Dreiser’s perception of 
+, people. A discussion of their content, particu- 
; larly in connection with certain biographical 

information about Dreiser, will be deferred 

until after the scaling results are presented. 
Finally, a parallel set of analyses was also 
performed with the 6 measure as input for the 
clustering program. The pattern of results 
was generally similar to that described with 
the s measure. The same three properties were 
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TABLE 3 


Trait CLUSTERS WITH THE SMALLEST AND LARGEST 
AVERAGE VALUES ON THE THREE PROPERTIES: 
Harp-Sort, MALE-FEMALE, CONFORMS- 
Dors Not CONFORM 


Hard Soft 
Cold, vigorous (1.5) | Emotional, playful, 
Fighter, forceful, sympathetic, 
pagan (1.7) varietistic (5.3) 
Indifferent, hard, Good, kind, not 
restless (2.0) | strong (5.7) 
Aspirant, critical (2.5) | Painter, sensual (6.0) 
Male Female 
Man, tall (18) | Attractive, charm- 
Good-looking, care- ing, girl, cold, 
ful, handsome, vigorous 80) 
manager (24) | Defiant, lovely, 
Poetic, intellectual, reserved, under- 
writer (30) standing (81) 
Crazy, great, erratic, Pale, thin (81) 
sincere (32) | Painter, sensual (86) 
Conforms Does not conform 
Shrewd, skill (2.0) | Crazy, great, 
Aspirant, critical (2.0) erratic, sincere (5.0) 
Able, ambitious, Genius, old, poor (5.0) 
good-looking, Communist, radical, 
careful, handsome, troubled, worker (5.0) 
manager 1) | Dreamer, free, 
American, nice (2.5) lonely, sad (5.0) 
Serious, sophis- Fighter, forceful, 
ticated (2.5) pagan (5.7) 


Note.—Number in parentheses is the average value of the 
trait cluster on the property. Hard—soft and conforms-does not 
conform can range from 1 to 7, and male-female can range 
from 0 to 100, 


found to be significant, although the minimum 
p levels were generally not as small as those 
found with the s measure. Young-old also 
reached significance at the .01 level. The 
remaining properties were not significantly 
related to the clusterings based on 6. 


Multidimensional Scaling and Interpretation 


The multidimensional scaling was accomp- 
lished by a computer program (Version 3, 
May 1967) developed by Kruskal (1964a, 
1964b), based on his ideas and those of Shepard 
(1962a, 1962b). Multidimensional scaling pro- 
vides a spatial representation of the traits so 
that the distances between the traits in this 
space correspond to the measure of psy- 
chological distance used as input data. The 
scaling program can produce a solution of one 
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dimension, in which case the traits are ar- 
ranged on a straight line, a solution of two 
dimensions, in which case the traits are ar- 
ranged on a plane, a solution of three dimen- 
sions, etc. 

Two separate sets of solutions were obtained 
for the 99 most frequently occurring traits: 
one set was computed with the s measure as 
input data and the second set with the ô mea- 
sure as input data. Each set consisted of sepa- 
rate solutions for 1, 2, 3, 4, and 5 dimensional 
spaces. 

The program also computes a measure of 
the goodness of fit, termed stress, for each 
solution. Figure 3 is a plot of the stress of a 
solution versus the number of dimensions in 
a solution, The stress values for the s measure 
form one curve and those for the ô measure 
form the second curve. 

n Stress decreases in value as the interpoint 
stances in a multidimensional solution con- 
form more closely to the input distances, 
with 0% indicating that a perfect monotone 
relationship exists between input and output 
distances. The fact that stress decreases with 
increasing dimensionality is to be expected, 
of course, since the goodness of fit must neces- 
sarily improve (or remain unchanged) when 
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Fic. 3. Goodness of fit of the multidimensional con- 
figurations of the 99 traits used most frequently in A 
Gallery of Women. One curve is based on the s measure 
as input for the scaling, and the other curve is based on 
the ô measure. 
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more dimensions are used to re pres: 
input data. 

The rate at which stress decreases wit} 
creasing dimensionality is often useful, ho 
in determining the optimal dimensionali: 
set of objects. For example, even though t!» pr 
gram will arrange the traits on a plane, a t 
dimensional space may be required to ca: ‘um 
adequately the distance relations in the © ow 


data. The stress measure would usually 1 
this fact by decreasing sharply in value be- 
tween the two- and three-dimensional «olu 


tions. If additional dimensions are superfinou 
—for example, as in a four-dimensional : 
tion when a three-dimensional solutic 
adequate—little drop occurs in the 

measure. For the sake of parsimony, th 
dimensionality after which there is little 1 
provement in fit is chosen as the optim 
dimensionality, that is, when a definite “elbow? 
appears in the type of curve shown in Figure 3. 

This method is somewhat equivocal with 
the present data. There is some suggestion: of 
an elbow at three dimensions, but the stress 
value at that point is still rather high. Actually, 
it is high when compared with studies in which 
the trait categories were selected by the 
investigators (e.g., Friendly & Glucksbe: g 
1970; Rosenberg, Nelson, & Vivekananthan, 

1968; Rosenberg & Olshan, 1970) but not 
when compared with studies using a free 
response approach (Rosenberg & Sedlak, 1972). 

Another basis for judging dimensionality is 
the psychological interpretability of the space. 
As we shall see below, interpretability based 
on the empirical fitting of the 14 trait proper- 
ties decreases after the four-dimensional solu- 
tions, suggesting that this is possibly the ap- 
propriate level of dimensionality for these 
data. 

The results in Figure 3 suggest that the ô 
measure behaves more like a distance measure 
than the s measure does in that the ô measure 
yields configurations with less stress for all l 
dimensionalities—a finding which corroborates | 
previously reported comparisons between the 
two measures (Olshan, 1970; Rosenberg et al., 
1968). This finding, together with the fact 
that the correlation between the s and ô mea- 
sures is only .489 (V = 99X98+2=4,851), 
Suggests that while the two measures are re- 
lated, the 6 measure does contain new and use- 
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ix. 4, Two-dimensional configuration of the 99 traits used most frequently in A Gallery of Women. 
‘Also shown in the best-fitting axis for each of three trait properties. Each number in parentheses is the 
multiple correlation between projections on the axis and the property values.) 


ful information about the traits. One probable 
source of this information is the greater differ- 
entiation by the ô measure as compared with 
the s measure of the highly dissimilar traits. 
Also, the spaces based on ô were found to be 
more interpretable than those based on s— 
interpretable in terms of the statistical fitting 
of ihe various trait properties, For these 
reasons and to simplify the presentation, the 
discussion that follows is based exclusively on 
scaling results obtained from the 6 measure. 
Figure 4 shows the two-dimensional con- 
figuration obtained from Kruskal’s program. 
This solution, with its relatively high stress 
of ~28%, is likely to juxtapose a number of 
traits that are relatively further apart in the 
input data—and which undoubtedly do move 
away from one another in spaces of higher 
dimensionality. Nevertheless, the general con- 
figuration in Figure 4 is quite interpretable. 
In fact, the same properties that proved signi- 
ficant in distinguishing the trait clusters are 
the most significant properties in the two- 
dimensional space. Moreover, the two-dimen- 
Sional solution is the most convenient one for 


visually illustrating how a combination of ex- 
ternal properties is used to interpret a multi- 
dimensional configuration. 

First, a linear multiple correlation was 
calculated for each property to determine _ 
whether the property was relevant to inter- 
preting the space. The multiple correlation was 
calculated by making the property value the 
dependent variable and orthogonal coordi- 
nates of the traits in the configuration the 
independent variables. For the two-dimensional 
solution, for example, the independent varia- 
bles are the arbitrary x and y coordinates by 
which the program specifies the trait con- 
figuration. Solutions of higher dimensionality 
are also each represented by the program with 
a set of arbitrary orthogonal coordinates, and 
these coordinates serve as the independent 
variables in the calculation of multiple corre- 
lations for each of these spaces. 

The axes in Figure 4 were plotted by using 
the ratio of the two regression weights ob- 
tained from the corresponding multiple regres- 
sion as the slope of a’ line through the origin. 
Each axis is optimally located in terms of 
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maximizing the linear multiple correlation be- 
tween the values of the traits on a given prop- 
erty, for example, male-female, and the loca- 
tions of the traits on the corresponding axis. 
(The location of a trait on an axis is the 
intersection of the axis with a perpendicular 
extended from the trait.) Any rotation of an 
axis would decrease the linear multiple corre- 
lation, the value of which is shown in paren- 
theses in the figure. 

Table 4 is a summary of all the multiple 
correlation values for all the properties and 
trait configurations. Considering only the num- 
ber of multiple correlation values that reached 
the .01 level (the criterion also used with the 
clustering analysis), there is a sharp drop in 
the interpretability of the five-dimensional 
solution. This is true in spite of the fact that 
this particular solution was obtained with a 
special run of the program in which the starting 
configuration was the more interpretable four- 
dimensional solution (with random numbers 
for the initial values of the fifth dimension). 
Apparently, a loss in interpretable information 
during the iteration from this initial configura- 
tion can occur in nonmetric scaling. That is, 
information present in a solution of lower 
dimensionality is masked somewhat because 
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TABLE 4 


MULTIPLE CORRELATIONS BETWEEN TRAIT PROPERTIES AND EACH OF THE 


be He EDA. 


ESD 


the solution of higher dimensionality is fitting 
more of the error component in the input 
measure, For this reason, the five-dimens:onal 
solution was dropped from further analysis. 
Table 5 presents the angles betweew the 
fitted axes for properties significant beyond 
the .01 level in the two-, three-, and ëu 
dimensional spaces. Several general impres 
emerge from Tables 4 and 5: (a) Sex and : 
formity are highly significant and mor 
less orthogonal in the various spaces. (b) T} 
angle between hard-soft and male-fen 
increases considerably from the two- to ‘he 
four-dimensional solution, that is, from 10 
degrees to 67 degrees. Dreiser apparently 
thought hardness was related to sex but no 
redundant with it. Thus, the impression from 
Figure 4 alone is somewhat misleading. (¢ 
Similarly, the angle between frequency and 
male-female increases from 3 degrees ‘o 
60 degrees with increasing dimensionality. 
The fact that the frequency with which a 
trait is used is related to (but not isomorphic 
with) the sex of the persons characterized by 
the trait is not surprising since Gallery is more 
about women than men. (d) An evaluative 
cluster emerges in the four-dimensional solu- 
tion—good-bad, interesting-dull, social good- 


rie Five MULTIDIMENSIONAL CONFIGURATIONS 
Dimensionality 
Property Be 
1 2 3 4 5 
Good-Bad 025 -150 320 .363* .349 
Interesting-Dull .200 .216 KY be .392* 370 
Social Good-Bad -080 WARA -303 .367* 384* 
Intellectual Good-Bad 010 .024 .198 .242 -220 
Hard-Soft .285* .242 271 .390* .360 
Active-Passive 156 133 271 321 357 
Introverted—Extraverted 239 231 312 .363* 381 
Impulsive-Inhibited 172 143 .293 342 359 
Dominant-Submissive 241 .232 .290 340 352 
Young-Old 064 125 -280 315 326 
Conforms-Does Not Conform 081 372" 376* .385* AB4* 
Decided-Undecided 200 234. .222 308 341 
Male—Female 524*#* .581%*% .617** .632*** AY Bhasin 
Frequency of occurrence 3840" -405** A67*** AT6*** .4894** 


N i AG are grouped into sets based on the factor analysis in Table 2. 


p < 1001. 
#8 < 10001. 
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N DEGREES) BETWEEN FITTED AXES FOR 
ss WHose MULTIPLE CORRELATION 
SIGNIFICANT BEYOND .01 LEVEL 


Two-dimensional configuration 


Property 1 Jj | 3 
1, Coxforms nae 
2. Mate 94 — 
3, Frequency | 91 3 — 


Three-dimensional configuration 


s | 3 | 4 


rty i 
| | 

i al 2 

inte ng | - 

Conforms 109 — 

Mole | 93 98 — 

requeney | 148 89 58 me 
` Four-dimensional configuration 
= ET T é 
} 4 1 2-5) 3 4 5 6 7 8 
` À A \ L 
1, Good-Bad |- 
2. Inieresting | 47 ai 
3, Social Good— eÀ 
Bad 23 T eas 

4, Hard-Soft 107. 129- 18. — af 
5. Introvert i49 112 130 102 °— 
6, Conforms (pees eg caren Rett OY hers 
7. Male 63 90 83 67 140 94 = 
8, Frequency 113 149 120 48 90 89 60 =i 


bad. The angles between any three of these 
properties and sex or conformity vary consider- 
ably: 55 degrees-99 degrees. Nevertheless, the 
evaluative cluster does help interpret the four- 
dimensional space. 

It should be emphasized again, as it was in 
the interpretation of the clustering analysis, 
that the trait ratings were not obtained from 
Dreiser or Dreiser scholars. Positive multiple 
correlations indicate a degree of communality 
between Dreiser’s meanings of the trait terms 
and those of the raters, but the angles between 
properties may reflect as much the implicit 
beliefs held by the raters as by Dreiser. 


DISCUSSION 


Earlier we raised the question of whether 
Dreiser’s trait space was interpretable essen- 
tially in terms of the semantic differential 
factors of evaluation, activity, and potency. 
The answer, based on the data summarized 
in Table 4, is a qualified “no.” First, an eval- 
uative factor is present in the trait space, 
although it emerges clearly only in the four- 
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dimensional solution. Even here, the evalua- 
tive dimension is considerably weaker than 
that usually found in studies of personality 
perception. Second, with the exception of 
introverted-extraverted in the four-dimen- 
sional solution, the multiple correlation values 
for all the activity properties in all the con- 
figurations fall short of the 01 level. Third, 
the potency factor as represented by hard- 
soft does appear to be present in the trait 
space, although it is not clearly separable from 
properties not usually identified as potency 
scales. Finally, sex and conformity, which 
emerge as highly significant properties in the 
trait space (and clustering, as well) are not 
strongly related to any of the evaluation, 
activity, or potency properties. Even if sex 
and conformity could be recast as properties 
representing one or more of the semantic 
differential factors, Dreiser’s life and. per- 
sonality, geem to provide a more promising and 
interesting basis than the semantic differential 
framework for understanding, why certain 
properties appear in Dreiser’s trait space. 

There is another reason for turning to facts 
about Dreiser in an attempt to understand 
the findings of the present study. Given the 
fact that the approach used in this study for 
systematically extracting an individual’s im- 
plicit theory from his free response descrip- 
tions is relatively untried, the value of this 
approach would be enhanced if the major 
findings about the person’s theory were con- 
sistent with or explicable in terms of evidence 
from other, independent sources. It is fortunate 
that Dreiser’s life has been extremely well 
documented by a number of biographers and 
commentators, The discussion of Dreiser’s 
life is necessarily brief here and rests primarily 
on a recent and rather comprehensive bio- 
graphy by Swanberg (1965). 

To begin with, the presence of sex as a 
highly significant property both in the cluster- 
ing and scaling solutions appears to be explica- 
ble in terms of one of the most salient features 
of Dreiser’s life—that of his deep involvement 
with women. The traits typically ascribed to 
men are distinctly different from those as- 
cribed to women. There are at least two 
distinguishable aspects in his relationships 
with women, one having to do with women as 
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sexual partners and one having to do with 
women as intellectual companions. 

Dreiser’s sexual involvement with women 

was often scandalous. From his teens to his 
seventies and through two matriages, he was 
‘“ndefatigably, compulsively promiscuous” 
(Swanberg, 1965). He often carried on affairs 
with several women at a time. One such affair, 
with a 17-year-old girl, was instrumental in 
his loss of the lucrative editorship of Butterick 
Publications, Characteristically, Dreiser incor- 
porated this affair into his writings. This 17- 
year-old girl appears as “Flowerface” in The 
“Genius,” and the novel itself is a fictionalized 
portrayal of Dreiser as an artist who goes in 
for the pursuit of women. According to W: C, 
Lengel, Dreiser’s one-time secretary and long- 
time friend, Dreiser was “a hedonist, ‘a volup- 
tuary, and a varietist. , . . There were always 
women, [He]... mystified them, charmed 
them, captivated them (Lengel, 1962, vii- 
viij” : 

His sexual involvement with women is re- 
flected at least Partly in the “male chauvinist” 
traits that show up in his writing. Of the 99 
traits used most frequently in A Gallery of 
Women, the ones associated primarily with 
women, that is, with high values on the male- 
female property, include attractive, beautiful, 
graceful, lovely, physically alluring, and sensual, 
(A number of parallel terms were associated 
with men : good-looking and handsome.) Dreiser 
also deviated noticeably from the contem- 
porary stereotype of the male chauvinist. He 
did not characterize women as more conform- 
ing than men; for example, sex and conformity 
are almost orthogonal in the scaling solutions, 
Dreiser himself was drawn to “attractive, 
unconventional women” (his own words). 

He also liked his attractive, unconventional 
women to be “clever,” and he depended on 
women with business acumen and intellectual 
Strength. He employed women as his literary 
agents and often entrusted the extensive 
editing of the first drafts of his verbose manu- 
scripts to his female companions. A Miss Kusell 
and a Mrs, Campbell chopped off about 15% 
of his original version of An American Tragedy 
—a cut at which he “wailed” but to which he 
acquiesced. In our data, this aspect of his 
relation to women shows up in such female 
traits as reads, defiant, intelligent, cold, and 
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clever. He reserved for men the traits great, 
poetic, sincere, and genius. i i 

The importance of sex in Dreiser’s perceps 
tion of people invites speculation about the 
pervasiveness of this dimension in person + 
ception, even if the sexual involveme; 
most people do not match those of D 
Although the sex of the perceiver has 
extensively studied as a variable influere 
perception (see reviews by Tagiuri, 1969; a 
Warr & Knapper, 1968), very little work } 
been concerned with the influence on perce 
tion of the sex of the person perceived—or 
variable of possibly greater cogency, the s 
combinations involved in considering bo 
perceiver and perceived, 

The significance of conformity in the cluster- 
ing and scaling appears to be related to a 
second major theme in Dreiser’s life. Erom 
the very beginning of his literary career, 
Dreiser fought with the forces of conventicn 
in society. His first novel, Sister Carrie, ws 
accepted for publication by Frank Doubleday, 
who then tried to cancel the agreement because 
Mrs. Doubleday found the manuscript offen- 
sive. The troubles with prudery and conven- 
tion continued throughout Dreiser’s life. in 
1916, the New York Society for the Supression | 
of Vice succeeded in having The “Genius,” 
which Dreiser had just completed, banned in 
New York City, He came to regard himself as 
the force for realism in American letters and 
Sought out those who shared his noncon- 
formist views. Much of the time he spent in 
New York, he lived in Greenwich Village—a 
haven for nonconformists even in Dreiser’s da Vs 
He was attracted by nonconformity in all 
forms; it was no accident that the iconoclast 
H. L. Mencken was such a strong and per- 
sistent influence in Dreiser’s career. 

Swanberg (1965) characterized the salience $ 
of conformity as a dimension in Dreiser’s — 
Conception of people in his formulation of _ 
“Dreiser’s Law.” 


Always his friends had been nonconformists (his 
mother, . . . , Mencken) and his adversaries the bour- | 
geois (his father, his wife, the Doubledays, . . .). In 
one side of his dual nature his beliefs had crystallized 
into a formula that might be called Dreiser’s Law, and 
which could damage him as an artist if it took him over 
completely. Stripped to its fundamentals, Dreiser’s 
Law read: 


| St de 


Beli. 1 by the multitude, the bourgeois and their 

leaders, ars likely to be wrong per se. 

An important corollary of the Law read: 

‘s holà by unconventionalists which fly in the face 
y are in all probability right. 

As extensions of these principles, he viewed the 
philistine as a blockhead who accepted his whole pat- 
tern o! beliefs as handed down to him by elders, and 
who ticrefore did not think; and, conversely, saw the 
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rebel «s the splendid individualist, the independent 
thinke:, the only truly creative person [Swanberg, 
y 1965, p. 181]. 

It is interesting that the scaling also captured 
what Dreiser probably saw as frequent accom- 
paniments to noncomformity: sad, suffering, 
lonely, troubled. There is little doubt that he 


often experienced these negative feelings as a 

\ € of his own nonconformity. 
# Dreiser also longed for fame and fortune— 
he fruits of conformity—which he finally 
d with Am American Tragedy. It may 
be the case that basic conflicts—for Dreiser, 
cor /ormity—in the individual’s personality are 
projected as preoccupations with traits that 
represent both sides of the conflict, and hence 
appear as trait dimensions. 

The relative weakness of the evaluative 
dimension in this study may reflect more about 
Dreiser’s novelistic style than about his private 
views of people. Dreiser was certainly evalua- 
tive and extremely opinionated in his personal 
life, and yet his writings do not describe 
polarized characters. As Granville Hicks (1965) 
pointed out: “There are no contemptible per- 
sons in Dreiser’s novels. Although at times he 
_ professed a Nietzschean scorn of the masses 
_... he instinctively made the best case pos- 

sible for any person he wrote about [p. 229].” 

It may be interesting, in this connection, to 

analyze the descriptions of people in Dreiser’s 

personal writings, for example, a sample 
from the large mass of available Dreiser 
correspondence. 

While the analyses appear to yield results 
interpretable in terms of what is known about 
Dreiser’s orientation to others, it should be 

obvious that the clustering and scaling that 
have been carried out do not provide a com- 
prehensive picture of Dreiser’s implicit theory 
of personality. The analyses presented here 
were based only on the 99 most frequently 
Occurring trait categories in A Gallery of 
Women. It is interesting, in this connection, 
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to compare the two types of structures, the 
typological versus the multidimensional, in 
terms of representing the information about 
these 99 traits. Both structures were inter- 
pretable with the sex and conformity proper- 
ties, the importance of which has just been 
discussed. However, a number of properties 
that were found to be relevant to the interpre- 
tation of the four-dimensional solution were 
not picked up at all as significant for the 
clustering solutions. By this criterion, at least, 
the multidimensional structure is a somewhat 
more sensitive and adequate technique for 
representing (or displaying) Dreiser’s view of 
people than the typological structure. 

The finding that a multidimensional struc- 
ture is more adequate than a tree structure in 
the present instance does not, of course, imply 
that it will be so in general. In fact, in a related 
study® using a sample of college students, the 
tree structure appears to do a better job than 
the multidimensional structure in representing 
their perceived relations among traits. 

In summary, the study of Dreiser has yielded 
a number of methodological and substantive 
gains. 

On the methodological side, the study 
demonstrates that feasible techniques now 
exist for obtaining a parsimonious description 
of an individual’s view of people from his free 
response protocols. Developments in the area 
of computer-assisted content analysis should 
facilitate the study of other individuals by 
reducing the labor now involved in extracting 
character descriptions. 

On the substantive side, the study of 
Dreiser indicates that perceived trait dimen- 
sions that are highly related to an individual’s 
life and personality can be uncovered from his 
free response descriptions. In the case of 
Dreiser, these trait dimensions could not be 
readily recast into the semantic differential 
structure that typically emerges from aggre- 
gated group data. Even if an individual’s per- 
ceived dimensions can be cast into such a 
structure, many important details about his 
categories and beliefs are probably best ex- 
plicated in terms of the individual’s particular 
life and personality. 


5R. A. Jones, and S. Rosenberg, Structural repre- | 
sentations of naturalistic descriptions of personality. | 
Unpublished manuscript, 1971. ? 
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General dimensions of person perception are 
nevertheless necessary in the long-run develop- 
ment of theory, and an affective dimension 
such as evaluation is likely to be part of such a 
theory. The study of Dreiser suggests that sex 
of the perceived may be another general 
dimension of person perception. However, the 
study also indicates that research designs will 
have to be sensitive to what constitutes good 
and bad traits, male and female traits, etc., for 
different individuals. 
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REACTIONS TO UNCONDITIONAL COOPERATION: 


FIELD STUDY EMPHASIZING VARIABLES NEGLECTED IN 
LABORATORY RESEARCH 


J. WILLIAM DORRIS * 


University of California, Los Angeles 


A review of research on “unconditional cooperation” suggests that design 
tures common to laboratory bargaining experiments mitigate the effective- 
of this strategy in eliciting cooperation. A field study was designed to 
aove these factors. The subjects were rare coin dealers who were offered a 
chance to buy a small collection of rare coins from a naive, “unconditionally 

perative” seller. Two theoretical variables were manipulated by the seller's 
nation of why he was selling the coins: (a) whether or not a “moral” 
peal was made and (b) whether or not information given to the dealer indi- 
ted that a prior dealer had been exploitative or fair toward the seller. The 

: of the coin shop, either large or small, was considered as a third inde- 

ndent variable. The data analyzed included price offers, decision time, eye 

itact and distance at time of first offer, and content ratings of comments 
mide by the dealer. The major findings indicated that the “moral” appeal 

cited a higher price offer for the coins than did the “neutral” appeal, and 

t this difference was attributable to considerable helping behavior by the 
aoral-appeal dealers combined with some exploitative attempts by the neutral- 
appeal dealers. These findings are discussed in relation to research on helping 
behavior, to pacifist bargaining tactics, and to the findings of the current study 
due to information given about a prior dealer’s behavior. The results of the 
current study were compared with those of laboratory research on “uncondi- 


tional cooperation” in light of differences in experimental settings. 


Laboratory bargaining experiments have 
consistently found that high levels of “uncon- 
ditional cooperation” by one player elicit 
considerably more exploitation than reciprocal 
cooperation (cf. Bixenstine, Potash, & Wilson, 
1963; Lave, 1965; Shure & Meeker, 1967; 
Solomon, 1960; Swingle & Gillis, 1968; 
Whitworth & Lucker, 1969). A review of these 


Ty 


experiments suggests that a number of vari- 
ables associated with the laboratory setting 
in which the “unconditional cooperation” 


1 This study is based on a PhD dissertation in 
social psychology completed at the University of 
California, Los Angeles, in 1970, and financed by 
National Science Foundation Grant GS 1121X2 to 
Harold H. Kelley, The author is particularly indebted 
to Harold H. Kelley for his help throughout all 
Phases of the research. Peter M. Bentler, Phillip 
Bonacich, Harold H. Kassarjian, and Gerald H. Shure 
made valuable comments regarding experimental 
design and theoretical interpretations. 

Computing assistance was obtained from the Health 
Sciences Computing Facility, University of California, 
Los Angeles, sponsored by National Institutes of 
Health Special Resources Grant RR-3. 

* Requests for reprints should be sent to J. Wil- 
liam Dorris, Department of Psychology, University 
of Massachusetts, Amherst, Massachusetts 01002. 


occurs mitigate its effectiveness in eliciting 
reciprocal cooperation (cf. Nemeth, 1970). 
These variables include (a) instructions which 
either tacitly or explicitly encourage exploita- 
tion, suspend reciprocity norms, and make 
“unconditionally cooperative” behavior in- 
comprehensible and (b) experimental condi- 
ions” that conceal the player’s identities 
and severely restrict normal channels of 
communication and social influence. ; 
In the present research, a field experiment 
was designed to overcome these features of 
the laboratory approach. In contrast to the 
laboratory bargaining studies, the subjects in 
the present study were unaware of their par- 
ticipation in an experiment and hence were 
not uncertain about the relevance of normal 
ethical considerations. Nor was there an 
“experimenter” to provide them, tacitly or 
explicitly, with approved means of exploita- 
tion. Nor was the subject anonymous or im- 
mune to social influence. He was interacting 
face-to-face with a person who had chosen 
to trust him specifically, and who clearly 
explained the reasons for his “unconditionally 
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cooperative” behavior. Within these experi- 
mental conditions, the effects of varying two 


. aspects of the “unconditionally cooperative” 


behavior were considered: (@) whether or not 
the “cooperator” made a moral appeal to the 
subject and (b) whether or not the informa- 
tion given to the subject suggested that a 
previous subject had been exploitative. 

In order to insure its impact, the moral 
appeal was based on three factors: (a) em- 
phasizing the relevance of moral standards in 
evaluating the subject’s behavior, (b) making 
the subject fully responsible for the outcome 
of the interaction, and (c) appealing for help 
on the basis of a legitimate need. Earlier lab- 
oratory research has shown that when the first 
two of these factors are combined with “un- 
conditional cooperation,” higher levels of re- 
ciprocal cooperation are elicited than when 
they are omitted (cf. Meeker & Shure, 1969). 
The moral appeal also included a request for 
help. To the extent that the basis of this 
request was perceived to be legitimate by the 
subject, it would be expected to evoke a 
“norm of social responsibility” which pre- 
scribes that one help those in need (cf. 
Berkowitz & Daniels, 1963). In general, 
greater effectiveness for the unconditional 
cooperator was predicted when he is allowed 
to make a clear moral appeal. 

The second experimental variable involved 
providing the subject with the information 
that an earlier person had acted in either 
an exploitative or nonexploitative fashion 
when confronted with the situation he now 
faced, Providing the subject with this infor- 
mation is seen as giving him a standard with 
which he can compare his alternative behav- 
iors in order to determine their appropriate- 
ness, Research on social comparison processes 
(cf. Festinger, 1950, p. 272) and modeling of 
nonnormative behavior (cf. Freed, Chandler, 
Mouton, & Blake, 1955; Lefkowitz, Blake, & 
Mouton, 1955) suggests that while he is of- 
fered a morally questionable means of making 
a sizable monetary gain, the subject may 
compare himself with the earlier subject whose 
behavior may either increase or diminish his 
ethical qualms, Thus, one might expect the 
subject’s behavior to reflect that attributed to 
the earlier subject. If he had attempted 
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exploitation, so will the subject; if he had 
been honest, the subject will be honest also, 
Although the research previously cited sug- 
gests that subjects’ behavior will reflect that 
attributed to earlier subjects, there are reasons” 
to expect otherwise, A contrary result may be 
predicted from earlier research in social per- 
ception by Hovland, Harvey, and Sherif 
(1957). They found that if a person holds an 
attitude on a particular issue, statements that 
reflect an attitude similar to his own wil! be 
acceptable to the person, and he will judge 
them to be closer to his own attitude than 
they actually are (“assimilation”), Con-— 
versely, statements indicative of an opposing 
attitude will be rejected and perceived as 
being more different from his attitude than 
they actually are (“contrast”). These authors 
also present evidence that the more importaut 
the issue is to the person, the smaller will be 
the range of discrepant statements that le 
finds acceptable. Applying 
the present research, the su 
over the ethics of his behavior 
as being the issue. Thus, it may be argued 
that the importance of this issue will be 
greater when a clearly moral appeal is made 
than when one is not made by the person 
whom the subject is tempted to exploit. 
Hence, we might expect an interaction þe- 
tween our two independent variables, with 
the subject rejecting the exploitative behavior 
of the earlier subject as a model when the 
issue of ethics is emphasized (“contrast”) 
and finding it acceptable when ethical issues 
are not made salient (“assimilation”). k 
The research that has been cited suggests 
that the behavior of the earlier subject may 
have either of two effects. It may serve as a 
model for the subject in all cases, or it may 
be differentially accepted or rejected as a 
model depending on the salience of moral 
issues. Rather than choosing between these 
two a priori predictions, it was decided to 
examine the result for evidence of both effects. 


METHOD 
Subjects and Task 


Approximately 150 shops that buy and sell rare 
coins were located in the yellow pages of the tele- 
phone directories of several metropolitan areas. Of 
these shops, 4 were used as sources of information 


and coins, 18 were used in the pilot study, 31 were 
closed or had disconnected phones, and 71 were used 
in the study. The remaining shops were not used 
in any way, Of the 71 subjects, 6 were eliminated 
from data analysis for reasons such as one 
dealer being given the experimental treatment while 
anothers can in the same store eventually gave the 
price offer for the coins, These 6 subjects were 
virtual qually distributed across conditions. 

Assizument of subjects to experimental conditions 

on the basis of several theoretical and 


Was donc 

i practi guidelines. Because the coin market is 

f, knov o fluctuate, it was decided to attempt to run 
one «bject in each of the four major conditions 
(Type of Appeal X Prior Dealer’s Behavior) of the 
study cach day that data were collected, This dis- 
tributon rule also served to minimize any effects that 
mig been due to shop location, Thus, for 


instance, differences between two experimental condi- 
tions could not have been due to the subjects in one 
conditi \g Jocated in a wealthy business district 
while the second condition came from a 
e, ‘fhe order in which dealers were used 
o day was determined by convenience, 
‘heir various locations and business hours. 


those 
mi 


considering 
Fin ‘cause of the nature of the moral appeal, 
a shop had to do business in both coins and stamps 
to qualify for this condition. 

Tue subject’s task was one which regularly occurs 
in the course of his daily business. He was offered 


the opportunity to buy a small collection of coins 
by cither the author or his assistant,* who enacted 
the role of a male college student wanting to sell 
the coins, Eight coins were offered for sale. These 
coins were kept in separate clear plastic envelopes, 
each of which had a label stapled onto it indicating 
the type of coin and an abbreviation of its condi- 
tion. All of the coins were Lincoln pennies with a 
total retail selling price of about $45. 

An empty envelope was used to vary the factor 
of the prior dealer’s behavior, Since all nine envelopes 
| were given to the dealer in a “Sucrets” box, he was 
p 
$ 


į 
4 
i 


forced to notice the empty one first, since it was 
always on top. The coin seller then explained that 
he had sold the missing coin to another dealer, and 
by reading the label on the empty envelope, the 
subject learned that the prior dealer had been either 
fair or exploitative (see the Design section below). 
Comments made by the coin seller before giving 
the coins to the dealer determined whether the sub- 
ject received a “moral” or “neutral” appeal. Since 
the coin seller made it quite clear that he had no 
idea of what the coins might be worth, the dealer 
Was led to believe that he had the opportunity to 
buy the coins as cheaply as he desired. The coins, 
N of course, were never sold to the dealer. 
+ 


*Grover Gentry, an undergraduate psychology 
student at University of California, Los Angeles, 
Was the second experimenter in this study. The au- 
thor is greatly indebted to him not only for his 
technical help but also especially for his encourage- 
Ment and motivating enthusiasm throughout all 
Phases of the study. 
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Design 

Four independent variables, each with two levels, 
were included in the experimental design. Two of 
these variables, type of appeal and prior dealer's — 
behavior, were of theoretical interest, The other two r 
variables, experimenter and shop size, were con- 
sidered to be of possible importance and, therefore, 
were included in the design so that their effects 
would be equally distributed across the two main 
variables, 

Type of appeal. The coin seller gave one of two ~“ 
explanations as to why he was selling the coins, In 
the moral appeal, the seller explained that he needed ě ; 
to sell the coins to get some money to help- for -9 
some school books and that he was gong Oe 
them to this dealer on the basis of recommendation 
of another customer, an acquaintance of the seller, 
In the neutral appeal, the seller explained that he 
simply had some coins that he wanted to sell because 
he was not interested in becoming a collector. In — 
both appeals, the seller’s ignorance of the coins’ value 
and his desire to sell them to this dealer were clearly 
indicated, 

In the moral appeal, the coin seller made the 
following comments: 


Hi, do you buy coins? ... My name's Jim 
Dorris [Gentry], and I have some coins that I 
need to sell. [If the dealer interrupts here or later, 
the seller replies: “If it’s OK, Pd like to take a min- 
ute to explain why I am selling the coins.” }] The 
coins belonged to grandfather, and T inherited them, 
Since I don’t know anything about these coins 
,. . or coin collecting at all for that matter... 
Tve been afraid that I might get exploited selling 
the coins. 

Because of this I showed the coins to a stamp 
collector who lives in my apartment house, [If the 
dealer asks for name, seller knows only the first 
name.] He didn’t know what the coins were worth, 
but he said he’s done quite a bit of business with 
you and that I could trust you to give me a fair 
price for them. 

Normally I wouldn't sell the coins, but Im in 
kind of a jam. I need to buy some textbooks for 
my summer school classes to get ready for exams, 
and for some reason my check got held up at work, 
Since I need the books right now, I’m really in a 
bind, So whatever you can give me for the coins 
would really help. 

[Seller takes “Sucrets” box from his shirt pocket 
and hands it to dealer, saying:] Here are the coins. — 
[When the dealer notices that the coin is missing 
from the top plastic envelope which is labeled 
either 1909S XF, or 1914D XF, the seller com- 
ments: Oh, yeah! I was going to sell the coins to 
another dealer yesterday . . . but I got interrupted 
while I was putting the coins in the box. After 
I got off the phone, I just picked up the Sucrets 
box and went down to this other dealer's shop. 
I told him the same thing I told you about getting 
the coins from my grandfather, that I didn’t know 
anything about them and why I was selling them, 
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When we opened the box I discovered I had for- 
gotten all of the coins except that one [seller 
points to empty envelope]. Anyhow, he looked at 
that coin and immediately offered me $15 for it. 
So I sold him that one for $15. 

After I left the shop, I started wondering about 
getting exploited since I didn’t know anything 
about the coins. That’s when I decided to talk to 
the stamp collector, and he recommended that 
T sell the rest of the coins to you. 

[After the dealer makes initial price offer,] 
dollars. Gee, I was hoping they would be 
worth more, since those textbooks are expensive. 

[After dealer responds to seller's comment, ] 
dollars would help pay some of the expense, 
but it’s still not enough. ... I think TI wait 
one day to see if my check comes in the mail. 
If it does, I'll be all set; if not, I'll come by 
tomorrow, 


The coin seller’s comments in the neutral appeal 
were as follows: 


Hi, do you buy coins? .. . I have some coins 
that I want to sell. They belonged to my grand- 
father, and I inherited them... , I don’t know 
anything about these coins, or coin collecting at 
all for that. matter, and I don’t really want to 
become a collector. So I want to sell the coins. 

I don’t want to take time to get a lot of ap- 
praisals from different people. So I thought I would 
sell them to you today for whatever you can give 
me for them, Here they are [hands dealer the 
Sucrets box]. 

[When the dealer discovers missing coin, the 
seller's comments are the same as in the moral 
appeal, except the last paragraph involving wonder- 
ing about being exploited and talking to the stamp 
collector is omitted.) 

(After initial offer,] dollars, Gee, I was 
hoping they would be worth more. 

[After response to this comment,] dollars 
is really not that much, I think TIl just hold onto 
them for a while and see if the price goes up, 
Thanks anyhow, 


Prior dealer's behavior. In all cases, the dealer 
“discovered” the empty envelope and then learned 
that the missing coin had been sold to another dealer 
the previous day for $15. When the prior dealer was 
fair, the envelope was labeled “1909S XF,” indicating 
a Lincoln penny with a listed retail price of $30 
(according to Coin World newspaper, 7/8/70). When 
the prior dealer was exploitative, our empty enve- 
lope was labeled “1914D XF,” indicating another 
Lincoln penny listed at a retail value of $140. In 
either case, the missing coin was reportedly sold for 
$15, and the seller made it obvious that he had no 
idea of whether or not this was a fair price. The 
dealer, on the other hand, being well aware of the 
values of the 1909S and the 1914D pennies in XF 
(extra fine) condition, knew immediately that the 
seller had received a fair price for the 1909S and had 
been exploited in selling the 1914D. 
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Experimenter. The author (B) and his assistant | 
(G) each served an equal number of times as the 
coin seller in each experimental condition. Two ex- 
perimenters were used in order to minimize the 
possibility of “experimenter effects” (Rose: thal, | 
1966). Experimenters were alternated between the 
coin seller and data collector roles (described be- 
low), and they made independent estimates of most 
observations, | 

Shop size. Preexperimental discussions with both 
coin dealers and collectors suggested that large shops ' 
might be able to pay more money for the coins since ¥ 
they have more capital to invest and a greater vol- 
ume of business than the small shops. These discus- 
sions also suggested that the small shop dealer might 
be less likely to exploit a naive seller. To check such 
possibilities and to control their influence on: the 
theoretical variables, the shops were divided into 
two categories, large and small, on the basis of thcir 
advertisements in the yellow pages of the telephone $ 
book, Ad sizes were rated from “1” (1-3 lines) to $ 
“9” (% of a page). All “1” and “2” shops were 
classified as small, and anything over “5” was larve. 
Sizes “3” and “4” were classified either large or 
small depending on a number of other factors such 
as physical size, presence of large safes and teletype 
machines, volume of coins on display, location, and 
decor. EERS k 

The accuracy of the size of advertisement as an | 
index of shop size is attested by two independent $ 
checks. First, of the 31 shops that were closed or had 
disconnected telephones, 23 had ads of size 1, and 3 
were size 2, while only 1 shop had an ad of size 5 or 
greater, Second, a postexperimental analysis of vari- 
ance revealed that shops that we had classified as 
“large” on the basis of the telephone ads had signifi- 
cantly more employees (X = 2.2) than did the 
“small” shops (Y=15; F=80, df = 1/48, p< 
.007). 


EA 


Procedure 


The two experimenters always entered the coin 
shop together. Both were dressed similarly in levis 
and short-sleeved sports shirts with the tail hanging 
out (to conceal the tape recorder in the coin seller’s 
back pocket),¢ The experimenter posing as the coin 
seller waited until the dealer was not engaged with 
another customer (never more than a few minutes). 
He then approached the dealer with the opening 
question, “Hi, do you buy coins?” Upon receiving an 
affirmative answer or being directed to the person | 
who bought coins, the experimenter explained why 
he wanted to sell the coins, gave the coins to the 
dealer, and then explained what had happened to | 
the missing coin, During this period, the coin seller | 
was responsible for recording the conversation on his 
* The tape recorder used in this study was a FiCord 
300 model. This recorder fit snugly into the back 
pocket of the coin seller’s jeans, The recorder has an 
external microphone that may be connected and 
concealed, as in this study, beneath the experimenter’s 
clothing. 


portable tape recorder, timing interruptions due to 
such irrele events as speaking with other cus- 
tomers, an ponding to the dealer in ways that 
were cons t with his initial story (such as being 
ignorant the coins’ value or having chosen this 
~ dealer or basis of a customer’s recommendation). 
© The coin <vller also noted any sources such as the 


~ Red Bo yr Coin World newspaper that the dealer 
~ checked making his appraisal of the coins’ value. 

Whik seller was engaged with the dealer, the 

~ second < imenter was busy collecting information 

? about t jin shop. Unobtrusively, under the guise 

© of tasu browsing around the shop, the second 
experimenter measured the dimensions of the shop 
and the counter and bid-board displays of both coins 
and s These measurements were made by 
covertly pacing off the area accessible to customers 
while ostensibly looking at the display cases. The 
size of the bid boards and display cases was ob- 

; tained simultaneously. The second experimenter also 

? noted the number of customers and employees in the 

> shop; the relative volume of coins and stamps on 
display; the percentage of coins priced at $10 or 
Bicater nd the existence of expensive furnishings, 
Sales, or te pe machines. 

By tl ime the dealer was ready to make his 
initi ice offer, the second experimenter usually 
had turned to the vicinity of the seller and the 
deal: hus, frequently, both experimenters were 


timate the distance between the dealer and 
well as eye contact, at the time of the ini- 
ce offer. 
er the dealer’s final offer was declined, the 
r thanked the dealer for his time and informa- 
tion (if any), and both experimenters left the shop 
together, They immediately returned to their car 
where the information that was not on the tape 


able ‘ 


particular dealer. 
ag RESULTS 
Checks on Experimental Manipulations 


` 
When tape recordings of the interactions 
= were transcribed, a check was made to see if 
the dealer indicated his awareness of the 
appeal, Every dealer made at least one com- 
ment in response to our appeal which indi- 
cated that he had heard it. Twelve of the 
~ dealers made statements or asked questions 
N which suggested that they had not under- 
stood our story. Typical among these was the 
question as to why we did not sell all of the 
coins to the dealer who bought the most valu- 
able one. Whenever such a question was 
raised, the seller always reiterated that por- 
tion of the appeal that was not clear until the 
dealer indicated that he understood. An analy- 
_ sis of variance showed that those dealers hear- 
ing the moral appeal made more moral self- 
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references (X = 1.4) than did those dealers 
hearing the neutral appeal (X = .6; F = 6.7, 
df = 1/47, p< 01). Finally, it should be 
noted that every dealer was willing to listen 
to our entire appeal; none gave any evidence 
of doubting its veracity, and several indicated 
that other people had sold them coins for 
similar reasons (inherited them and did not 
want to start collecting, or to obtain money 
for some other purpose). 


Effects of Type of Appeal 


Price offers. When the prices that the deal- 
ers offered to pay for the coins were compared 
by analysis of variance, the moral appeal was 
found to elicit higher bids than did the neutral 
one (X = $13.63 for moral, $8.72 for neutral; 
F = 20.6, df = 1/49, p < .0001). This differ- 
ence involved only the initial bid. When the 
seller indicated that this disappointment with 
his bid, 10 of the 65 subjects changed their 
bids at least once. Four of these dealers were 
in the moral-appeal condition, and six were in 
the neutral condition. The neutral-appeal deal- 
ers were higher than the moral-appeal dealers 
on the average number of price changes (2 to 
1), the average amount of price increase 
($2-.40), and the average percentage of 
increase from the initial offer, (457— fo). 
The relationship between these bid increases 


and. the initial price differences is considered 


recorder was recorded on a record sheet for that toi En. x 
Pease e Discussion section. 


When the final offer for each of the dealers 
is considered, there is still a significant differ- 
ence between the moral-appeal and neutral- 
appeal dealers, despite the fact that the neu- 
tral-appeal dealers made greater increases in 
their price offers than did the moral-appeal 
dealers. 

Nonverbal measures. Both of the nonverbal 
measures of closeness between the dealer and 
the experimenter showed a main effect for 
nature of appeal. Eye contact was rated on a 
3-point scale, with “1” signifying that the 
dealer looked into the seller’s eyes at the time 
of making his initial offer; “2,” that he was 
looking in the seller’s direction (but not in his 
eyes) and established eye contact during the 
comments he made immediately after the 
bid; and “3,” that the dealer did not estab- 
lish eye contact with the seller either during 
or immediately after the initial price offer. 
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Eye contact at the time of the first bid was 
greater with the moral appeal (X = 1.2) than 
with the neutral appeal (X = 1.8; F = 13.9, 
df = 1/49, p < .0005). 

Since the seller maintained a stable posi- 
tion at the counter while the dealer was ap- 
praising his coins, whatever distance there 
was between them at the time of the initial 
price offer was determined by the dealer’s 
movement, This distance was estimated by 
the seller in terms of half-feet. An analysis of 
variance indicated that the dealer was closer 
to the seller while making his initial offer 
following a moral appeal (X = 2.5 feet) than 
after a neutral appeal (X = 3.1 feet; F = 5.7, 
df = 1/49, p < .02). 

Interaction measures. The tape recordings 
of the interactions were used to obtain two 
time measures and six ratings of the contents 
of the dealer’s comments.’ 

The time lapse from the point at which the 
dealer was given the coins until he made his 
first price offer (bid time) was longer follow- 
ing the moral appeal (X = 392 seconds) than 
after the neutral appeal (X = 238 seconds; 
F= 6.1, df = 1/47, p < .02). This difference 
was even greater when the entire time of the 
interaction, including the dealer’s comments 
after his initial offer, is considered (moral = 
489 seconds; neutral = 310; F= 8.1, df = 
1/47, p < .006). 

In four of the six content categories that 
were rated, moral-appeal dealers were signifi- 
cantly more talkative than the neutral-appeal 
dealers. As already reported, the moral-appeal 


ë Specific criteria for rating the content of the 
dealers’ comments were used in determining whether 
a comment fit into one or more of the six categories, 
For instance, a statement was classified as a moral 
self-reference if it either («) made a direct reference 
to the dealer’s ethical business standards (e.g, “I'll 
give you a fair price”) or (b) clearly implied the 
dealer was trustworthy (eg. “I keep a price list 
right here on the counter. Most dealers wouldn’t 
show a list like that.”) Transcriptions of each 
dealer’s comments were rated independently by both 
of the experimenters without knowledge of the ex- 
perimental conditions, A random sampling of about 
one-fourth of the subjects (16 of 65) indicated that 
the two experimenters rated the comments of these 
subjects into the same context categories for 89% of 
the comments rated. The few cases of disagreement 
were usually settled by mutual discussion of the 
response being considered and, in a few instances, by 
listening to the tape recording from which the 
transcription was made. 
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dealers made more moral self-reference: than 
did the neutral-appeal dealers ($ < 01), 
Moral-appeal dealers also made more com- 
ments giving information about the coi: sold 
to the prior dealer (X = 2.5 comments) ‘han 
did the neutral-appeal dealers (X = CERA 
20.5, df = 1/47, p < .0001). A similar dif- 
ference was found regarding the amoun: of 
information given about the coins being of- 
fered for sale. The moral-appeal dealers rade 
a greater number of informative comments 
(X = 5.0) than did the neutral-appeal dal- 
ers (K=14; F=106, df= 1/47, ¿< 
002). 

The fourth category which yielded a ige 
nificant main effect for type of appeal in- 
volved statements suggesting profitable al- 
ternatives to accepting the dealer’s higest 
price offer. The moral-appeal dealers sugges! od 
more profitable alternatives (X = 1.6) than 
did the neutral-appeal dealers (X = ESE, = 
6.1, df = 1/47, p < .02). 

The two content categories which did nat 
yield significant effects for type of app<al 
involved statements by the dealer that eit!:er 
substantiated or degraded the value of the 
coins being offered for sale. 


Effects of Prior Dealer’s Behavior 


There were no significant main effects of | 
the prior dealer’s behavior for any of the price f 
offers or nonverbal measures. There were two 
interactions with the variable of shop size on 
the nonverbal measures that are cited below | 
(see Tables 3 and 4), A 

The content measures yielded differences | 
for two of the six categories rated. Following | 
exploitation by the prior dealer, the dealers 
gave more information about the coin that 
had been sold to the prior dealer (X = 2.5) 
than they did if the seller had obtained a fair 
deal (X=.9; F= 20.5, df=1/47, p< 
-0001). When talking to a seller who had been 
exploited, dealers made fewer comments which. 
degraded the value of his remaining coins (X 
= 3.4) than they did to sellers who had previ- 
ously received a fair price for the one coin (x 
= 9.5; F=3.7, df = 1/47, p < .06). 

There was one interaction between type of 
appeal and prior dealer’s behavior. This in- 
teraction concerns the amount of information 
given about the coin that had been sold to the 
prior dealer. As Table 1 shows, the typical 
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TABLE 1 


F COMMENTS GIVING INFORMATION 
IN SOLD TO THE PRIOR DEALER 


ABOUT 
AS A TION OF TYPE OF APPEAL AND 
JOR DEALER’S BEHAVIOR 
i Type of appeal 
Prior d: 
hehe ar 
Moral Neutral 
Fair 1.25 .50 
Expi: 3.75 1.25 
Note. „b < :00004; 
for the a df = 1/47, 
$ < 000 1 for the Appeal X Prior Dealer's Behavior 
interact! 12.0, df = 1/47, p < 02, 
dealer g+ve by far the most information when 
he hea: moral appeal from a seller who 
had bec loited. 
Experimenicr Effects 


dependent variables were ana- 


lyze ck for differences between the two 
expe: cers who enacted the coin-seller role, 
the a: sor (B) and his assistant (G). Only 
one sct was found. Although the two ex- 
perircnters served as sellers for equal num- 
bers <f small and large coin shops, there were 
more employees in G’s shops (X = 2.2) than 


in 
0 


(R= 1.5; F=111, df = 1/48, p< 


Shop Size 

Considered as a group, the results attribu- 
table to differences in shop size suggest that 
small shop dealers were more responsive to 
our experimental manipulations than were 
large shop dealers. Small shop dealers took a 
longer time evaluating the coins and making 
comments prior to making their first bid (373 
seconds) than did the large shop dealers (258 
seconds; F = 3.4, df = 1/47, p < .07). Fol- 
lowing the moral appeal, by far the greatest 
number of moral self-references were made by 
small shop dealers (see Table 2). Although 
large shop dealers were known to have more 
money available to buy coins, their average 
offer ($10.84) was not significantly different 
from that of the small shop dealers ($11.51). 
In fact, following a moral appeal, the small 
shop dealers offered an average of nearly a 
dollar more than did the large shop dealers. 
Again this difference is not significant, but 
given our knowledge that the large shop deal- 
ers could pay more, these two results suggest 
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that the small shop dealers were most re- 
sponsive to the moral appeal. 

Tables 3 and 4 show two interactions be- 
tween shop size and the second theoretical 
variable, prior dealer’s behavior. These results 
also suggest that small shop dealers were 
more responsive to the experimental manipula- 
tions than were the large shop dealers. Table 
3 shows that the greatest amount of eye con- 
tact came from small shop dealers following 
exploitation by the prior dealer and that the 
least eye contact came from these dealers 
following a fair price offer from the prior 
dealer. Similarly, as Table 4 shows, small shop 
dealers stood closer to the seller after he had ~ 
been exploited than after he received a fair 
deal. 

TABLE 2 


Mean NUMBER or MORAL SELF-REFERENCES AS A 
Function or TYPE oF APPEAL AND SHOP SIZE 


Shop size 
Type of appeal 
Large Small 
Moral 83 1.88 
Neutral 69 .50 


Note.—For type of appeal, F = 6.7, df = 1/47, p <.01; 
and for the Type of Appeal X Shop Size interaction, F = 4.4, 
df = 1/47,p <04. 


TABLE 3 


Averace Amount oF Eye Contact at TIME OF 
TnitIAL BID AS A FUNCTION OF PRIOR DEALER’S 
BEHAVIOR AND SHOP SIZE 


Shop size 
Prior dealer's 
behavior 
Large Small 
Fair 1.50 1593) 
Exploitative 1.47 1.25 


Note.—The smaller the number, the greater the eye contact; 
scale: 1-3; for the prior dealer's behavior, F = 4.8, df = 1/49, 
p <.03; for the Prior Dealer X Shop Size interaction, F = 
4.2, df = 1/49, p <.05. 


TABLE 4 


MEAN DISTANCE (IN FEET) BETWEEN DEALER AND 
SELLER AT INITIAL BID AS A FUNCTION OF PRIOR 
DEALER'S BEHAVIOR AND SHOP SIZE 


Shop size 


Prior dealer's 
behavior 


Large Small 
Fair 24 ot 
Exploitative 3.1 2.6 


Note.—Prior Dealer's Behavior X Shop Size interaction, 
F = 6.0, df = 1/49, p < .02. 
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Trend Effects 


The subjects were divided into three groups 
in terms of the sequence in which they were 
contacted: the first 22 subjects, the second 211, 
and the final 22. Within each of these three 
groups, the distribution of subjects according 
to the four independent variables was vir- 
tually equal. There were two effects due to 
trend. For the first 22 subjects, the total talk- 
ing time was 310 seconds; for the second 21 
subjects, it was 421 seconds: and for the 
final 22, it was 484 seconds (F = 2.8, df= 
2/54, p < .07). The mean number of com- 
ments giving information about the coins be- 
ing offered for sale increased from 2.0 for the 
first third of the subjects to 2.2 for the second 
third, and finally jumped to 5.1 for the last 
third of the subjects (F = 3.6, df = 2/51, p 

< .04). 

DISCUSSION 


This discussion of the results focuses pri- 
marily on the major finding of the study, the 
higher price offer elicited by the moral appeal 
relative to the neutral appeal. First, we will 
attempt to understand this price difference by 
reference to the other findings attributable to 
type of appeal. Next, an attempt will be made 
further to refine our understanding in light of 
research on helping behavior and pacifist bar- 
gaining tactics, and the present study’s find- 
ings in regard to prior dealer’s behavior, A 
third section will compare the conditions of 
this experiment with those common to labora- 
tory studies of “unconditional cooperation.” 
Finally, a prominent potential alternative 
interpretation of the data will be considered, 


Relationship of Price Offers to Other Appeal- 
Type Findings 

The major prediction involving appeal type 
was that the moral appeal would elicit a 
higher price offer than would the neutral ap- 
peal. This prediction was strongly confirmed, 
When confronted with a naive, readily ex- 
ploitable, and totally cooperative seller, the 
dealers offered significantly more money when 
a moral appeal was made ($13.63) than when 
one was not ($8.72, p < .0001). A considera- 
tion of the other findings involving appeal 
type suggests that the moral appeal elicited a 
concerned effort to help the seller, while the 
neutral appeal resulted in a combination of 
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businesslike responses and exploitative at- 
tempts, 

The analyses of the verbal interactions oro- 
vide the strongest set of data supporting this 
explanation for the difference in the price 
offers of the moral- and neutral-appeal deal- 
ers. These analyses showed that after hea: ing 
the moral appeal, dealers took time to vive 
the coin seller more information about his | 
coins and the coin market that would be help-4 
ful to him in obtaining the highest possible 
price for the coins. In a similar vein, the 
moral-appeal dealers made more specific sug- 
gestions concerning ways in which the seller j 
might compare prices and elicit higher offers, 

The nonverbal measures indicated that i 
while making their initial offer, moral-appeal 4 
dealers stood closer to and had more eye 
contact with the seller than did the 
appeal dealers. Mehrabian’s (1968) dis 
of research relating nonverbal beh ir io 
interpersonal feelings indicates that 
a smaller distance between communican d 

(0) more eye contact between them côn- 
municate a positive attitude. Particularly rele- 
vant is Machotka’s (1965) finding (cited in 
Mehrabian, 1968) that maintenance of eve 
contact by the communicator indicates con- 
cern for the addressee. Applied to the results, 
this earlier finding suggests a greater concern 
for the seller on the part of our moral-appeal 
dealers. 

Dealers in this study offered between $2.50 
and $20 for the coins. Discussions with vari- 4 
ous informants indicated that offers of $124 
and greater should be considered fair and that 
a dealer might legitimately offer considerably 
less if he had no particular need for the coins 
or honestly felt they were overgraded. While 
such factors no doubt operated in some in- 
stances, they do not account for the distribu- 
tion of the lowest price offers. There were 14 
final price offers of less than $8, and 12 of i 
these came from neutral-appeal dealers, Of4 
these 14 low-offer dealers, 8 gave virtually no 

explanation for their low offers, while the oth- 
ers made considerable reference to legitimate 
factors such as those considered above. The 8 
dealers who did not explain their offers all 
received the neutral appeal. Thus, while there 
is no conclusive evidence of attempted exploi- 
tation, the data strongly suggest such efforts 


, Cy 


` 


by at least 8 out of a total of 32 neutral- 
appeal dealers. 
Finally, a consideration of the 10 dealers 


who increased their initial price offers is like- 
wise consistent with the explanation that the 
neutral appeal dealers were acting as good 
businessmen, while the moral-appeal dealers 
were wore concerned with helping the seller. 


About equal numbers of neutral-appeal and 
moral-appeal dealers increased their initial 
price offers. However, the neutral-appeal deal- 
ers tended to start with a lower initial offer, 
which allowed them to make larger bid in- 


creases ($2—.40 for moral-appeal subjects) 
and still end up with a lower final offer than 
the moral-appeal dealers. 


Relationship of Other Research Findings to 
Effects of Appeal Type 


In attempting to understand why the moral 
appeal elicited:helping behavior, it is impor- 
tant to remember that the appeal was made on 
several bases. In the moral-appeal condition, 
the coin seller presented himself as a person 
who needed money for an admirable purpose 
(to buy textbooks to study for exams) and 
was in a financial bind through no fault of his 
own. Furthermore, he was ignorant of the 
coins’ value and had been sent to this dealer 
specifically by another customer who attested 
to the dealer’s honesty and trustworthiness. 
Thus, in the moral-appeal condition, the coin 
seller was made dependent on coin dealers for 
help by “external” circumstances that he 
could not control, and he chose to depend on 


` a particular dealer on the basis of his ethical 


reputation. 

This situation is similar to that involved in 
laboratory research of “helping behavior,” in 
that the coin seller clearly evoked a “norm of 
social responsibility” in appealing to the 
dealer for help. As initially formulated by 
Berkowitz and Daniels (1963), the norm of 


„social responsibility prescribes that a person 


help those in need of help. Further research 
that is relevant to the present study indicates 
that a powerful person’s efforts to help a de- 
pendent person are greater when (a) the costs 
of helping are low (Schopler & Bateson, 1965) 
and (b) the dependence is perceived to be 
caused by “external” (environmental) rather 
than “internal” (personal) factors (Schopler 
& Matthews, 1965). 
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Both of these factors appear to have worked 
to the coin seller’s advantage in our study. 
The financial costs of helping the dependent 
seller were probably low relative to the psy- 
chological rewards inherent in helping him or 
the psychological costs of exploiting him. The 
cost of a few dollars risked in a high price 
offer was probably minimized by the satisfac- 
tion of helping an unfortunate victim of cir- 
cumstances who recognized the dealer’s mag- 
nanimity. To exploit the helpless seller would 
require violating both the dictates of the 
“social responsibility norm” and the trust of 
a regular customer. These psychological costs 
were undoubtedly high relative to the poten- 
tial monetary reward for exploitation. 

Considering the second finding involving the 

perceived cause of the dependence, the coin 
seller was clearly portrayed as being made 
dependent by “external”? factors. Thus, we 
may apply the conclusion of the Schopler ando 
Matthews study (1965) to the relationship 
existing between the coin seller and moral- 
appeal dealer: 
If the partner [coin seller] is seen as a victim of 
circumstances he will be helped more than if he is 
seen as being dependent of his own volition because 
voluntary dependence lessens the salience of the 
social responsibility norm [pp. 611-612]. 

In addition to evoking social responsibility 
pressures, the moral appeal also emphasized 
the relevance of moral standards regarding 
fairness or honesty to the evaluation of the 
dealer’s behavior, a factor found to lessen 
exploitation in at least one laboratory study 
(Meeker & Shure, 1969). Although we can- 
not isolate the effects of the second component 
from those of the social responsibility norm in 
the moral appeal, the data from the prior 
dealer’s behavior variable strongly suggest 
that social responsibility pressures were re- 
sponsible for the helping behavior. 

The few results attributable to the prior 
dealer’s behavior consistently suggest that 
when the earlier dealer purchased the valuable 
penny for only $15, he was rejected as a be- 
havioral model, and his behavior made moral 
issues salient without the seller ever making 
an independent moral appeal. The reactions 
of the dealers as they gave information about 
the valuable coin strongly suggest that this 
was the case, “My God! That’s terrible!” and 
other similar exclamations, followed by a hur- 
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ried leafing through a coin price guide with 
additional informative comments about how 
much they would have paid and how to get 
the coin back, all clearly indicated that they 
rejected his behavior as unethical and dissoci- 
ated themselves from it. The findings of 
greater eye contact (by small shop dealers, 
especially) and less degrading of the coins’ 
value following prior dealer exploitation could 
also be viewed within this context as at- 
tempts to further dissociate oneself from the 
exploitative prior dealer by showing some 
concern for the seller, 

Assuming that the exploitativeness of a 
prior dealer aroused the subject’s awareness 
of the ethical issues involved in his transac- 
tion with the coin seller, it is interesting to 
note that the prior dealer’s behavior had no 
effect on price offers. This suggests that the 
variations in price offers due to the type of 
appeal variable resulted from the dealers’ 
responding to the seller’s need of financial 
help (and the social responsibility norm 
thereby elicited) rather than to his efforts to 
emphasize norms of honesty and trustworthi- 
ness. 

Finally it should be noted that in present- 
ing himself as being ignorant of the coin’s 
value, the seller made it clear that only the 
dealer would know if he offered a fair price. 
This situation is quite different from the paci- 
fist appeal cited above (Meeker & Shure, 
1969) in which both the potential exploiter 
and the pacifist are able to evaluate the fair- 
ness of the behavior. 


Comparison with Laboratory Studies of 
“Unconditional Cooperation” 

In introducing the present research, a num- 
ber of studies were cited which investigated 
responses to an “unconditionally cooperative” 
person within the context of a mixed-motive 
bargaining situation. All of these studies were 
laboratory experiments in which “uncondi- 
tional cooperation” elicited predominantly ex- 
ploitative behavior. Our comments on these 
studies suggested that several factors com- 
mon to the laboratory experiments might be 
responsible for this result. The purpose of the 
present study was to eliminate these factors 
while retaining the central object of study, 
the unconditionally cooperative person. The 
interpretation of the results discussed thus 
far indicates that the difference in the prices 
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that were offered could be attributed to zon- 
siderable helping behavior aroused by the | 
statement of need, combined with some ex- 

ploitation elicited by the neutral appeal. l 

The conditions of the present study may be 
contrasted with the typical laboratory sit::a- 
tion (e.g, Bixenstine et al., 1963; Shu:e, 
Meeker, & Hansford, 1965), which evo'-es 
considerable exploitation. In contrast to 
typical laboratory “cooperator,” the coin sel- 
ler’s reasons for his “unconditional coop a- 
tion” were readily comprehensible. Thus, even 
when a neutral appeal was given, the dex’cr 
could not justify exploitative behavior by cc1- 
vincing himself that the other person did vot 
care about the outcome (as in Solomon, 19/0, 
for instance). 

Second, the task involved in the preset 4 
study is hardly novel for the subject—a:- 
praising and buying coins being his-daily bus- 
ness—and he did not rely on the experiment: r 
or anyone else to guide his behavior. He-e 
again, the present conditions contrast with i 
laboratory studies in which the experiment.-r 
may either explicitly (cf. Swingle, 1965; | 
Swingle & Gillis, 1968) or implicitly (cf. Bix £ 
enstine et al., 1963) encourage exploitative 
behavior, or the game payoff may convert a 
supposedly individualistic game into a cori 
petitive one (cf. Gallo, 1966; McClintock & | 
McNeel, 1966). 

Finally, in the present conditions, there was 
full visual and verbal interaction between the 
dealer and seller. This situation removed the 
anonymity that has been given credit for 4 
allowing laboratory subjects to exploit coop- 
erators (cf. Shure & Meeker, 1967). It may 
be argued that removing anonymity increased f 
the subjects’ concern over their “public im- J 
age,” thus inhibiting exploitation (cf. Mar- 
lowe, Gergen, & Doob, 1966). Similarly, in- 
creasing interaction in both visual (cf. Dorris, } 
Gentry, & Kelley*) and verbal (cf. Wich- 4 
man, 1970) modes has been shown to increase 
levels of cooperation in mixed-motive labora- 
tory bargaining situations, and in the former 
study to the benefit of cooperators bargaining 
with competitors. 


J. W. Dorris, G. Gentry, and H. H. Kelley. Ef 
fects upon bargaining of modality of interaction, ini- 
tial degree of conflict, and initial orientations. Manu- 4 
script submitted for publication in the Journal of a? | 
Personality and Social Psychology, 1972. 


Given © ese changes from the typical lab- 
oratory «<perimental conditions, it is not 


/ surprisin that relatively little evidence of 
 exploita:- was found. One would expect ex- 
| ploitatic’. only to the extent that the indi- 
_ vidual ler relied on such tactics in his 

daily ib- siness or perhaps if the monetary 


tempta’ n were much greater than the $20 or 
so that ve could gain. 


|} An Ali native Interpretation 


| A œ ceivable alternative interpretation of 
the pr: <ont findings might be that the dealers 
hearin: the moral appeal became concerned 
abou! «.intenance of their reputation with the 
stam lector who recommended them. Thus, 
to ent.nce their reputation, the moral-appeal 
dealers were helpful to the seller. Although 
{> explanation cannot be entirely ruled out, 
it hw several factors, one of which is 
the sneity of the dealers’ responses. They 
did ppear to take time to consider their 
respo ->s before giving them. Only six of the 


deale: asked about the stamp collector, and 
_ the -verage price offer of these dealers 
| ($12.57) was, in fact, less than the overall 
more i-appeal offer ($13.63). Even if the deal- 
ers were in fact concerned about maintaining 
thei: reputation, they would not have had to 
offer more than the neutral-appeal dealers 
since the naive seller would have had no way 
of judging the fairness of their offer. They 
could easily have convinced him of their fair- 
i ness while making a lower offer. 
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THE SIGNIFICANCE OF HEART RATE FOR BEHAVIOR: 
A CRITIQUE OF LACEY’S HYPOTHESIS * 


ROGERS ELLIOTT ? 


Dartmouth College 


Lacey’s hypothesis concerning cardiovascular feedback effects on attention is 
presented. Relatively direct tests of the hypothesis seem not to support it well. 
Its terms appear to lack sufficient independent definition to permit discriminat- 
ing interpretive use. An alternative is to view heart rate change, like change in 
motor behavior, as a response, and one hypothesis that does that, chiefly 
identified with the work of Obrist, is presented for illustration and contrast 
with respect to measurability, definition, and parsimony. 


It is the purpose of this paper to examine 
critically Lacey’s (1959, 1967; Lacey, Kagan, 
Lacey, & Moss, 1963) recent hypothesis 
about the relation between cardiovascular 
responses—especially heart rate—and per- 
formance on sundry tasks. This examination 
consists of a review of relatively direct assess- 
ments of the validity of the hypothesis, of the 
status of the “intake-rejection” dimension 
and difficulties in its application, and of the 
telative merits of viewing cardiovascular 
events as causes rather than effects in psycho- 
logically relevant tasks. 


WHAT THE HYPOTHESIS STATES 


Lacey’s hypothesis has been elaborated at 
various symposia for over more than a decade. 
For example, in 1959, he reported data show- 
ing that some stimulus situations resulted in 
heart rate decreases with concomitant skin 
conductance increases, while others caused the 
more frequently reported result of increases 
in both measures. In the former situations, 
subjects had been attending simple visual or 
auditory inputs, and in the latter, they had 

- been solving mental arithmetic problems or 
enduring cold-pressor stress. The early hy- 
pothesis was that “cardiac deceleration ac- 


1 The writing of this paper, and some of the re- 
search reported in it, was supported by Grants 
HD04047 and HD 12648 from the National Institute 
of Child Health and Human Development. The 
author thanks Bruce Flaherty, for collecting some 
of the unpublished data reported, and Peter Bankart, 
Robert Leaton, and R. B. Malmo, for their helpful 
criticisms of the manuscript. 

2 Requests for reprints should be sent to Rogers 
Elliott, Department of Psychology, Dartmouth Col- 
lege, Hanover, New Hampshire 03755. 
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companied and perhaps even facilitated case 

of ‘environmental intake,’ whereas cardiac ac- : 

celeration accompanied or facilitated ae i on 

of the environment [Lacey et al, 19% 

p. 165]? ” The “rejection” of the environ: 
filtering « outi 
distractio 


ment implies primar 
irrelevant stimuli tha’ have « t 
for the organism [Lacey, 1967 p 
dimension of intake vers ejection of stim uli 
was also said to be related to the pleasant- 
unpleasant dimension, and Lacey (1967) re- 
viewed some of the evidence showing that 
commerce with pleasant stimuli was often“ 
accompanied by cardiac deceleration. 

The hypothesis was given further empirical 
definition with reference to the data reported 
in 1963. For tasks (e.g., mental arithmetic, 
reverse spelling) that required “internal ma- 
nipulation of symbols and retrieval of stored 
information [p. 165],” heart rate and skin 
conductance went up together, relative to rest-_ 
ing levels or pretask alerting periods. For tasks 
that did not require “cognitive elaboration,” 
but demanded only that the subject receive, by 
simply noting and detecting, incoming stimuli 
(e.g., looking at a flashing light), heart rate 
went down, and skin conductance went up. 
The time base from which measures of the 
different experimental periods were taken was 
in most instances a minute. | 

In 1967, Lacey elaborated on the neuro- 
physiological evidence for his hypothesis, 
mainly from acute physiological preparations 
in infrahuman mammals. The postulated 
mechanism involved cardiovascular afferent 
feedback from the receptors of the aortic 
arch (via the vagus) and the carotid sinus” 
(via the glossopharyngeal nerve)—feedback — 


n va! ue 


_ that has 
static ef 


iot only the well-established homeo- 
‘ts within the cardiovascular system 


ryous system. For example, evidence 
was cited that cortical activation is prolonged 
when glossopharyngeal and vagal afferent in- 
put is prevented by cutting those nerves, and 
that stimulation of afferent fibers from presso- 
receptors resulted in inhibition of a wide vari- 
ety of forms, including the reduction of the 
duration of stimulus-evoked cortical, motor, 
and autonomic activity. Thus, decreases in 
heart rate and blood pressure would be dis- 
inhibiting and would produce “a net increase 
in [cortical] excitation: a lowering of thresh- 
«old, a prolongation of the impact of stimuli, 
an increase in spontaneous activity, and the 
like [Lacey, 1967, p. 30].” Increases in heart 
rate and blood pressure, on the other hand, 
would result in raised thresholds for sensing 
stimuli, a less enduring impact of a stimulus, 
a a decrease in spontaneous activity, and so on. 
Although the data cited in 1963 were tonic 
« in the sense that the reference time base was 
> at least a minute for the base, alerting-for- 
task, and task conditions, the data cited in 
= 1967 were primarily phasic, concerning effects 
3 i à ? Ses 
© occurring over brief durations. (The distinc- 
l 
4 
5 


tion has meaning, as we shall see, especially 
when phasic effects in one direction may be 
superimposed upon tonic effects in another.) 
For instance, the study by Birren, Cardon, 
and Phillips (1963) was cited to show that 
_reaction time changed within the period of a 
single heart cycle as a function of the time 
at which the reaction signal was presented. 
Reaction times were fast when the signal was 
presented during the P wave, a cardiac phase 
occurring just before atria contract and in 
which aortic pressure is falling to its mini- 
mum; reaction times were slow when signals 
were presented during contraction, higher 
aortic pressure, and presumed central inhibi- 
tion. Moreover, when the reaction time task 
used had a fixed 4-second preparatory inter- 
val, Lacey found a marked deceleration 
occurring during the interval, interpretable as 
a cardiovascular response facilitating “intake” 
of the reaction signal by activating the cortex: 
and the $reater the deceleration, the quicker 
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the reaction. Lacey emphasized the fact that 
the activation theory would not have pre- 
dicted such a result in which improved per- 
formance accompanies decreases in heart rate. 
Finally, Lacey cited data on infants gathered 
at the Fels Research Institute (Kagan & 
Lewis, 1965; Lewis, Kagan, Campbell, & 
Kalafat, 1966) showing that there was a 
heart rate deceleration accompanying atten- — 
tion in children and that the magnitude of 
deceleration correlated with the duration of 
fixation, another case in which “taking in” 
was accompanied, and presumably facilitated, 
by heart rate deceleration. 

Lacey (1967) asserted the high degree of 
speculativeness associated with any hypothe- 
sis extrapolating from acute physiological 
preparations in animals to the sorts of tasks 
already mentioned in human experimenta=, 
tion and mentioned a variety of Se pea 
and possibly confounding factors. Further- 
more, it has long been well-known that there 
are many factors affecting normal heart rate 
change besides those under consideration— 
factors like sinus arrhythmia and the homeo- 
static control over blood pressure and heart 
rate already mentioned. However, these fac- 
tors, even when they occur in the tasks in 
question here, are either shown to be rela- 
tively ineffective or unsystematically related 
to major effects, as Lacey has done (1967) 
in the case of respiration and heart rate 
during reaction time tasks, or assumed to be 
so. Despite all these complications, the hy- 
pothesis was asserted, and it, not its qualifica- 
tions, needs primarily to be assessed. That 
is to say, within the range of the psycho- 
logical events described by Lacey, the idea 
is, and is taken by many to be, that lowered 
heart rate will facilitate a “taking in” atti- 
tude and greater sensitivity to stimuli, while 
a raised heart rate will facilitate a “rejection” 
attitude and cognitive manipulation and elab- 
oration, with a raised threshold for simple 
incoming stimuli. : 

There are various classes of data in terms 
of which one can evaluate the hypothesis. 
First, there are relatively direct tests of vari- 
ous implications of the theory, particularly 
with respect to sensitivity to external stimuli 
as measured both by threshold estimation and 
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by reaction time. Second, there are studies 
that point up the difficulties of independent 
definition of the critical terms, intake and 
rejection, of the hypothesis. Finally, there are 
alternative kinds of hypotheses that may 
also bear on the data and which may have 
advantages of definition, measurability, and 
parsimony. 


Some RELATIVELY Direct Tests 


Various investigations have examined dif- 
ferent relatively specific aspects of the hy- 
pothesis with a fair degree of directness and 
with results that are mixed but generally un- 
encouraging to the hypothesis. Edwards’ and 
Alsip (1969a) determined auditory thresh- 
olds, presenting test tones under both high 
and low transient heart rate. Although the 
hypothesis seems to predict that there would 
be lower thresholds during low heart rate, 
no difference in threshold was observed, and 
the authors concluded that signal detection 
was not related to unprovoked changes in 
heart rate. 

A recent paper by Saxon and Dahle (1971) 
reported a within-subject design in which 
auditory sensitivity was better during lower 
heart rate than during higher heart rate, but 
the high heart rates were induced by exercise, 
and it cannot be determined whether the 
direct aftereffects of the exercise (heavy 
breathing, “body noise,” difficulty in settling 
down) might have affected sensitivity directly, 
since the postexercise period was brief. The 
method used also precluded any signal- 
detection analysis to determine whether ap- 
parent change in threshold was a change in 
response criterion or not. It. might be possible 
for an incentive manipulation to cause an 
improvement in apparent threshold by the 
method employed, for example, even though 
the typical result of adding incentives is 
higher heart rate (Elliott, 1969). 

Saxon (1970) measured auditory sensitivity 
for threshold-level signals in four subjects and 
found that sensitivity in the P wave of the 
cardiac cycle was, as Lacey would predict, 
much higher than sensitivity during the 
QRS complex. Elliott and Graf (1972) re- 

peated Saxon’s design testing visual sensitiv- 
ity in 25 subjects, There was no significant 
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variation in sensitivity across four phases 
the cardiac cycle, Though the hit rate for t 
P wave was slightly higher than that fo: 

QRS complex—49% to 45%—the differesce 


was not significant (£ < 1), with 13 subje-ts 
showing greater sensitivity in the P ph:se, 
and 12 showing less, 

Delfini and Campos (1971) reported a si ni- 
lar design with similarly negative result: in 
10 subjects, with each given 800 signal :nd | 
800 nonsignal trials, and they found tiat 


phase of cardiac cycle was unrelated eit’ or 
to sensitivity or to response bias. These 
authors also reviewed neurophysiological evi- 
dence that suggests that afferent baroreceptor 
discharge correlated with phases of the cardiac 
cycle is not prominent past the level o° 
nucleus tractus solitarius, and may be “lost? 

at the reticular cells. a 

Thompson and Botwinick (1970) presente 
signals either during the R, T, and P wave;, 
or at different times after.the R wave, bit 
they were unable to repeat the earlier findi: g 
by Biren et al. (1963) of a rélation between 
reaction time and the phase of, heart cycle in 
which the reaction signal was presented. Evcn 
though Callaway and Layne (1964) had a‘-o 
found such a relation, Birren (1965) failed 
to find one among aged subjects, and tke 
present author agrees with the suggestion of 
Thompson and Botwinick that “the relation- 
ship between RT [reaction time] and phase 
in the cardiac cycle, if real, is reproducibie 
only under extraordinarily unique circum- 
stances [p. 64].” 3 

Obrist, Webb, Sutterer, and Howard 
(1970a) blocked the cardiac decelerative re- 
sponse pharmacologically in a group of sub- 
jects and found that such blockage did not 
affect their reaction time performance relative 
to a group of normal controls. 

Johnson and May (1969) compared the 
heart rate response during a reaction time 
task having a fixed 4-second foreperiod with 
the heart rate response during a 4-second 
time-estimation task, They reasoned that since 
a subject estimating time should be rejecting 
the environment and concentrating on some 
sort of internal cognitive process (probably 
counting), the heart rate ought to have gone 
up. On the contrary, however, it showed reli- 


“able decc’-ation, as did the heart rate of 
the reaction time subjects, shortly before the 
key-pres’ ‘esponse common to both groups. 
Interest! `y enough, Lacey and Lacey (1970) 
have re ly reported a deceleration “much 
| the sam that found during the anticipatory 
period »* the RT [reaction time] experiment 
[p. 222 in a serial time-estimation situa- 
tion in hich the subject “with no external 
k stimule on except his perception of the 


elapse? »eriod [p. 222],” must decide when 
to pre . This would seem to be an explicit 
denial t the task requires attention to the 
extern: nvironment or an “intake” attitude. 
They ‘oked “response intention” as an 
explane'ocy hypothetical term distinct from 
„attentio or intake, but since all three states 
appear : have similar effects on heart rate, 


and since, +s it will be shown, it is difficult to 
w intention to respond (something like 


sei 
set, =: sumably) can be operationally or con- 
ceptu distinguished from attention, the 
 findir, of deceleration in time-estimation 
~ tasks would seem awkward for the Lacey 
f hypc‘hesis to handle. : 
Tc return to the reaction time task, the 
pattern of results having to do with the heart 


= tate deceleration response just prior to the 
reaction signal raises another difficult issue, 
É Subjects in such a task are trying, so they 
report, to focus on the reaction signal, and 
_ nothing else, particularly when they are under 
incentive conditions, and they become quiet 
č. 


both generally throughout the series (Elliott, 
+1964, 1966b) and prior to each signal (Obrist, 
Webb, & Sutterer, 1969; Obrist, Webb, 
Sutterer, & Howard, 1970b). Trying to shut 
out task-irrelevant stimuli is said (see above) 
to be characteristic of “rejection,” and should 
be manifest in accelerating heart rate. It 
might be argued that the task is so simple 
that little cognitive elaboration is involved, 
but, in this case, one of Lacey’s paradigm 
“tasks becomes, in fact, the occasion for com- 
peting heart rate tendencies; decelerative, be- 
cause the task is simple, and accelerative, 
because it requires rejection of distracting 
stimuli. Tt is also true in the reaction time 
case that the course of spontaneous movement 
is opposite to what Lacey expected: it goes 
down with heart rate deceleration, not up. 
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There is another dilemma for the hypothesis 
in reaction time data. Webb and Obrist 
(1970) have shown that reaction times are 
slow following the shortest of four preparatory 
intervals (2, 4,8, and 16 seconds), and also that 
the heart rate deceleration is smallest during 
the 2-second preparatory interval. Such a re- 
sult is easily accommodated by the hypothe- 
sis. The problem lies in the fact that the result 
occurred only when the preparatory intervals 
were presented in random, or irregular order. 
When they were presented in regular order, 
with a series of reactions at each preparatory 
interval so that the subject knew which delay 
to expect, then the heart rate deceleration was 
still smallest at the shortest preparatory inter- 
val, but now the reaction times were fastest 
at that same preparatory interval. Thus, the 
degree of heart rate deceleration appears to 
have little to do with this characteristic dif- 
ference in speed to the shortest preparatory 
interval of a series depending on whether 
presentation is regular or irregular. 

The studies just reviewed have been con- 
cerned with the phasic heart rate response just 
before an anticipated point in time at which 
the subject will act or react. Some of the 
studies examined next concern tasks (series 
of stimulus situations demanding continued 
responses), and the measures reported are 
average heart rates representing all or part of 
task periods of 20-90 seconds—that is, tonic 
measures. It is natural that both sorts of 
experiments should be done: Lacey’s original 
data were tonic (1- or 2-minute tasks); his 
later reaction time data were phasic. How- 
ever, the two sorts do not always match. 
One of the present author’s students, Bruce 
Flaherty, showed in an undergraduate honors 
experiment that in 4-second and §-second 
fixed-foreperiod reaction time tasks, adding 
incentive significantly increased tonic heart 
rate and improved performance; but, at least 
in the 8-second foreperiod, the phasic heart 
rate deceleration superimposed on the higher 
tonic level was substantially greater, too, than 
it was under low incentive. That result 
matches one of Lacey and Lacey (1970), who 
showed that phasic heart rate responses, both 
accelerative and decelerative, were greater 
during the preparatory interval of “moti- 
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vated” reaction time trials than during 
“unmotivated” trials, but no corresponding 
difference in tonic level was mentioned by 
them. It would seem useful to deal explicitly 
with the distinction between tonic and phasic 
effects, as Malmo and Belanger (1967) have 
suggested. 

Studies of shock also illustrate the impor- 
tance of a tonic-phasic distinction, and they 
also appear to present some difficulties for 
the Lacey hypothesis, It is a common observa- 
tion that heart rate shows phasic deceleration 
just prior to shock (Obrist, 1968; Obrist, 
Webb, & Sutterer, 1969; Zeaman & Smith, 
1965). One may ask, Why should the orga- 
nism wish to facilitate the “taking in” rather 
than the “rejection” of such a signal? How- 
ever, there is also good evidence that heart 
rate shows tonic acceleration in anticipation 
of shock (e.g, Deane, 1969; Elliott, 1966a; 
Zeaman & Smith, 1965), at least for the first 
few trials, and one might say that subjects 
are rejecting the environment, Indeed, it has 
been shown that both heart rate effects, tonic 
accelerative and phasic decelerative, occur 
in anticipation of discrete shock (see Zeaman 
& Smith, 1965). The point is, unless one has 
a theory about when one uses tonic, and when 
phasic data, one cannot have it both ways. 

The present author (Elliott, 1969) has also 
produced data that appear to contradict the 
tonic version of the Lacey hypothesis. 
Elliott’s subjects performed the three tasks 
of the Stroop test: reading the names of 
colors, then naming the hues of different 
rectangles of color, and finally naming the 
hue, not the word, in which a color word is 
printed, the word and hue not being the same, 
The order of the tasks was reversed for half 

of the 64 subjects to control for habituation 
and other order effects. Though the final task 
described is, by comparison with the first two, 
very difficult, requiring continuous resolution 
of response conflicts (cognitive processing) 
and evoking tendencies on the part of sub- 

jects to reject the textual environment, heart 
rate was lowest on this task. 


PROBLEMS OF TESTABILITY AND DEFINITION 


Studies of tonic heart rate responses to 
various tasks of the sort previously discussed 
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fall within an established convention of a: sess- 
ing motivation-performance relations with” 
psychophysiological data. There are a number) 
of studies that have been done with the Lacey 
hypothesis as a point of reference or a frame- 
work for interpretation, and some are citeds 
here to illustrate the difficulty of giving indes 
pendent definition of the intake-rejection 
dimension, a difficulty that has resulted, as 
can be seen, in awkward and probably ill- 
advised distinctions among such terms as at- 
tention, verbalization requirement, and task 
difficulty. The lack of a clear, independent 
definition of what an “intake” task is and 
what a “rejection” task is makes it difficult 
to make predictions, although post hoc use of 
the intake-rejection dimension is, as with all 
theories, alarmingly easy Some of the ‘dif. 
ficulties are manifest in studies addressing the” 
question of “verbalization,” 

Campos and Johnson (1966, 1967). per- 
formed a series of investigations “testing 
whether the intake-rejection dimension. is 
confounded with a verbalization requirement, 
one which might itself account for heart rate 4 
increases for those tasks in which it was 4i 
present. For example, doing mental arithmetic 
almost certainly involves a covert verbal 
process, as does reverse spelling and making 
up sentences, all tasks used by Lacey to 
induce “rejection,” Campos and Johnson im- 
posed either a requirement to describe, during 
the task or after it, one of a variety of stimuli 
varying in complexity or affective tone; or no y 
such requirement. Of the seven stimulus situ- 4 
ations in which subjects were simply to look ~ 
at (take in) a slide (eg, of a landscape, 
maze, cartoon figure, or pictures of accidents), 
there was a significant heart rate deceleration 
relative to prestimulus levels for only one, and 
that one, a scene from a highway accident, © 
was rated as very unpleasant and was, there- 
fore, not the one most likely to qualify as an ng 
“intake” stimulus, On the other hand, instruc- ) 
tions to describe the viewed scenes routinely — 
raised heart rate, 

Though it might be difficult for the intake- 
rejection hypothesis to explain why the heart 
rate failed to decelerate during simple view- 
ing, it could explain the heart rate accelera- 
tions under the verbalization requirement, 


possibly b; pointing out that a requirement 
3 to verbalize requires “cognitive elaboration.” 
Hence, the Campos and Johnson reports are 

y negative with respect to the hy- 
pothesis, snd certainly they did not succeed 
in their effort to test “whether verbalization 
instructions can produce shifts in direction 


- of heart rate with attention held constant 
[1966, p. 308].” Instructions on how to inter- 
M act witn stimuli (eg., verbalization instruc- 
» tions) must surely have some effect on atten- 
F tion to them. 
Edw:rds and Alsip (1969b) also tried to 
disenvave the verbalization and intake-rejec- 


tion dimensions by defining tasks, a priori, as 
© intake (viewing scenes) or rejection (e.g., 
ly doing anagrams) tasks, and having each task 
4 done with and without a verbalization require- 
ment. Heart rate decelerated to three of the 


| four intake-no-verbalization tasks, but it rose 
Bein ¢ other combination. Therefore, the 
j verbalizaiion factor made a difference only 
= with the intake tasks. From the point of view 

of the intake-rejection hypothesis, one might 
< argue that the reason verbalization had no 
> effect on heart rate changes in the rejection 


tasks was that heart rate rises anyway in 
rejection tasks, so that a further rise in re- 
= sponse to verbalization would be attenuated 
or actually masked. However, the result only 
reemphasizes the difficulty in trying to sepa- 
rate the intake-rejection dimension from such 
factors as difficulty or verbalization require- 
ment. Consider one of the rejection—verbaliza- 
tion tasks employed by Edwards and Alsip 
 (1969b): “See how many three letter words 
you can make from HALLMARK. Report 
your number at the beginning of the rest 
"period [p. 9].” The so-called rejection—no- 
verbalization form of this task merely leaves 
out the second sentence. However, the first 
sentence alone is a clear invitation to covert 
= verbal activity, cognitive elaboration, or what 
shave you. One has to conclude that the a 
priori attempt to unconfound the two vari- 
ables—rejection and verbalization—failed. 
This kind of confusion among critical 
_ terms has been best epitomized, perhaps, ina 
recent paper by Adamowicz and Gibson 
(1970). Their subjects listened either to 
comedy recordings (Bill Cosby and others) 
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or to passages from sophisticated college texts _ 


in mathematics and psychology; the subjects 
were told either simply to listen only, or to 
listen and then summarize the passage to the 
experimenter after listening. Heart rate de- 
celerated in the listening-only condition, more 
to the text than the comedy; and heart rate 
rose in the listen-and-summarize condition, 
again, more to the text than to the comedy. 
The textual material was considered by pilot 
subjects to have been less interesting and less 
attention demanding than the recordings in 
the listen-only condition, and more interesting 
and more attention demanding in the listen- 
and-summarize condition. 

The authors offered as a possible explana- 
tion for their findings (lower tonic heart rate 
in the listen-only condition) the interpreta- 
tion that sensory intake was greatest in the 
listening-only conditions, despite their own 
data indicating that students found the text 
material boring under those circumstances. 
Two quotations will demonstrate a more seri- 
ous confusion. “A highly complex cognitive 
task without any need for its solution soon 
becomes boring [Adamowicz & Gibson, 1970, 
p: 246] .. .? and “The magnitude of cogni- 
tive task energy demand is probably some 
function of the interaction between cognitive 
task difficulty and external solution require- 
ments such as verbalization [Adamowicz & 
Gibson, 1970, p. 247]. We have here an 
attempt to define task (not stimulus) com- 
plexity and/or difficulty. as something distinct 
from solution requirements. However, a task 
that does not need to be done is not a task. 

Dahl and Spence (1971) had judges rate 
a variety of tasks on each of nine items of a 
“Task Demand Scale.” Judges showed good 
agreement on total scale scores, and total 
scores were highly correlated with tonic heart 
rate for the different tasks. Only one of the 
items in the scale failed to correlate with 
tonic heart rate significantly in any of the 
three groups assessed, and that was the item, 
derived from Lacey’s formulation, concerning 
direction of attention. The authors pointed 
out that on the tasks they used, the dimen- 
sion could be applied only with relative dif- 
ficulty, but the tasks were fairly common, 
like the three parts of the Stroop test —digit 


403 


404 ROGERS 
symbol, listening to white noise, discrimination 
of blank versus signal stimuli, and ‘so on. 
Porges and Raskin (1969) used a set of 
tasks in which subjects in different groups 
were required either to watch a flashing light, 
to count the number of its flashes in a trial, 
to match the rate of flashing to the rate of 
presentation of intermittent tones, or to 
match the rate of flashing to their own heart 
tate. They found that during the first few 
seconds of the 26-second trials, average heart 
rate rose for the group estimating its own 
heart rate and fell for the other three groups. 
Their interpretation was that the estimation 
of own heart rate required internal observa- 
tion and rejection of the environment, thus 
evoking cardiac acceleration. It may be ob- 
, served, ‘though, that estimating one’s own 
heart rate especially at the beginning of a 
trial when one needs to “home in” on it, is 
harder than matching tones, a point noted by 
Porges and Raskin and consistent with the 
fact that errors of estimation were greater 
among those subjects estimating heart rate 
than among those estimating tone rate. 
Making the heart rate easy by amplifying it 
for the subject would be, of course, to make 
it external, also. One wonders what the out- 
come might be if an internal signal (a notice- 
able, but not aversive, pressure stimulus, say, 
from apparatus implanted somehow in the 
gut) were made very easy to track, or if 
an external signal—for example, a recording of 
a beating heart—were made difficult to track 
by masking so that it is a strain to make out. 
It is possible that the difficulty of a task 
might be represented by the amount of energy 
a person typically applies to its completion, 
as many of the items in the Dahl and Spence 
(1971) paper implied, and that possibility 
may serve to introduce the next section, 


ALTERNATIVE EXPLANATIONS 


The foregoing sections can be Summarized 
as showing that Lacey’s hypothesis, in its 
more testable implications having to do with 
heart rate and sensitivity to stimuli, has 
received only mixed and generally discour- 
aging support, and that in wider applications 
to more complex behaviors, the conceptual 
confusion arising from lack of good indepen- 


ELLIOTT 


dent, prior definition of the critical terms 
of the hypothesis have rendered it virtually 
untestable. However, the basic observe: ions 
that seem to have prompted it remain ‘ be 


explained: phasic heart rate, for exz dle, 
does decelerate in anticipation of most dis: ate 
stimuli of any importance, and tonic * art 
rate is lower during tasks asking only ela- 
tively passive intake, like looking at <eo- | 
metric figures, than those requiring rapid and 
focused thinking, like serial addition, 

The kind of question asked by a car:lio- 
vascular feedback hypothesis is how ch: ge 
in heart rate might facilitate taking in or 


rejecting stimuli of many degrees of comy 
ity. Putting the question that way leaves « 
of account questions about what caused ti 
heart rate to change in the first place. Whea 
that latter question is considered, it may } 
possible to account for the changes in heart 
rate without any postulation of afferent 
feedback effects. 

The virtues of simplicity would not just’ fy 
such ignoring of feedback effects if t re 
were substantial evidence for them in ‘he $ 
kinds of human, perceptual-motor perfo: n- 
ance data reviewed, but there is none, at 
least not of the sort that cannot be read ly 
employed to support a variety of other }y- 
potheses, aside from the reaction time deta 1 
of Birren et al. (1963) and Callaway aad 
Layne (1964) and the sensitivity data of | 
Saxon (1970). Even if there were such evi- 
dence, it would be essential to try to assess pis 
the degree to which the kinds of heart raion 
effects surveyed here can be accounted for 
by viewing heart rate changes as responses 
rather than causes. Such an assessment might 
Permit an estimate of how much might remain | 
for a visceral afferent feedback’ position to 
account for uniquely. 

Among the most prominent hypotheses 
about the control of heart rate and one that 
illustrates the conception of heart rate changes” 
as effects rather than causes, is one that has < 
been called by Obrist et al. (1970b) the ` 
cardiac-somatic relationship. In a number of j 
papers, using eye blinks and eye movements, 
chin electromyogram, and gross bodily move- ` 
ments in humans and gross activity in dogs 
as his measures of unprompted (“spontane- 


ut 


activity, Obrist and his colleagues 
168; Obrist, Howard, Sutterer, 
Murrel, 1969; Obrist & Webb, 


Apus?) mo 
~ (Obrist 
- Hennis, . 


1967; © ist, Webb, & Sutterer, 1969; Obrist 
et al., da, 1970b; Webb & Obrist, 1967, 
» 1970) e shown repeatedly that there are 
very | degrees of concomitance between 
one 0 re such measures of somatic activity 
and rate change. Situations have in- 
volve vious reaction time tasks and antici- 
" patio if aversive events in humans, and 
» both rant and classical conditioning in 
dogs ` correspondence between indexes of 
move and heart rate is sufficiently close, 
for e le, that the typical biphasic heart 
rate vo conse prior to signal stimuli is mir- 
m rored biphasic changes in such measures 
asx elec. myograph bursts and eye move- 


usadi a review of these results and rele- 
*chophysiological and neurophysiologi- 
ybrist et al. (1970b) suggested that 


cular events, particularly the HR [heart 
anges, reflect what the organism is doing so- 
iy... [and] might be most simply viewed as 
nting one of the visceral components of the 
rganismic response in any given situation [p. 


he case of the reaction time task, for 
exznple, the heart rate deceleration is seen 
no: as a feedback mechanism but as part of a 
general quieting response involving a variety 
of somatic and visceral events while the 
subject awaits the tone. 

It was even suggested (Obrist et al., 1969) 
“that the elevated heart rate of children might 
be related to the fact that they show a much 
higher frequency of gross bodily movements 
than do adults. The basic observation agrees 
_ with the present author’s (Elliott, 1964, 
1966b), and suggests that indexes of gross 
movement and of general muscular tension 
might be among the most useful measures of 
attention, In this respect, one might reinter- 
pret Lacey’s (1967) citation of the Lewis 
et al. (1966) and Kagan and Lewis (1965) 
reports, showing a relation between attention 
(fixation time) and heart rate deceleration 
in infants. As the latter report shows, there 
were a number of significant correlations be- 
tween fixation and either of two measures of 
4 movement (vocalization, arm movement), so 
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that the heart rate deceleration, rather than 
being taken as providing feedback to facilitate 
attention, might simply have been an accom- 
paniment of a general somatic quieting associ- 


ated with fixation. Certainly, as already men- — 


tioned, Lacey’s (1967, p. 30) prediction of 
more spontaneous movement with cardiac 
deceleration finds no support in Obrist’s data, 
which always shows less. 

In the Obrist et al. (1970b) hypothesis, 
heart rate change is related, in general, to 
metabolic requirements involved in action, but 
the action includes everything the organism 
is doing, so that, if in a reaction time task, 
the muscles of the responding arm grow 
tense, but the rest of the body grows quiet, 
the net effect is less action and lower heart 
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rate. Or, as Black and deToledo (1972) put it < 


in discussing the fact that heart rate change 
was more highly correlated with measures of 
general activity than of bar pressing alone: 
“changes in heart rate and bar pressing 
should be positively correlated only when 
changes in bar pressing and general activ- 
ity are correlated [p. 300].” Black and 
deToledo’s major conclusion, based on studies 
of the organization of heart rate, hippocampal 
theta, skeletal activity, and muscle tension 
during conditioning is that “heart rate. 
seems to be a correlate of some variable repre- 
senting the total amount of skeletal activity 
[p. 308].” 
(1970) have also come to a similar conclusion 
based on their observations of the close rela- 
tion between heart rate and ongoing molar 
patterns of motor activity in the rat, and the 
absence of any direct relation between heart 
rate and deprivation operations. One of the 
impressive demonstrations in the Black and 
deToledo chapter was that the heart rate of 
dogs in response to an aversive conditioned 
stimulus was predictable not on the basis of 
whether they were taking in or rejecting the 
signal, but on the basis of how they had 
been trained to avoid signaled shock: either 
by remaining still (in which case the heart 
rate stayed level or went down), or by press- 
ing a pedal (in which case heart rate rose). 
Obrist et al. (1970b) stressed the evidence 
for correlation between somatic and cardiac 
activity, and theirs is a response—response 
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hypothesis to that degree. The formulation 
also postulates that both kinds of responses 
are linked by common initiating or controlling 
mechanisms in the central nervous system. 
Their covariation should be apparent unless 
one or the other response is blocked, as by 
curare or atropine.* Therefore, in measuring 
somatic activity, one is not measuring the 
independent variable controlling heart rate, 
but the behavior which is, to state the case 
for plausible biological relevance, subserved 
by a good blood supply made available by 
appropriate changes in cardiac output. The 
somatic activity—that is, the motor behav- 
ior—needs to be accounted for in the usual 
way, but at least in the case of intact orga- 
nisms, the Obrist et al. formulation 
asserts that only those changes in heart rate 
that are independent of changes in somatic 
activity are likely to require explanations 
independent of those made in accounting for 
the somatic events themselves, 

The present author’s version of a hypothe- 
sis about activity control of heart rate 
(Elliott, 1969) was invented to account for 
changes in tonic heart rate in intact human 
subjects, and emphasized the sheer amount 
of the overt responding being done, or about 
to be done. It was an empirically derived 
notion, without the theoretical and physio- 
logical elaboration of the cardiac-somatic 
view (Obrist et al., 1970b), and it concerned 
somewhat more sustained motivational peri- 
ods, as the word tonic was meant to imply. 
However, there are some problems when this 
formulation is applied to existing tonic data, 
` Changes upward in incentive usually result 
in heart rate elevation (Elliott, 1969). How- 
ever, at least in the present author’s data 
(Elliott, 1964, 1966b), they also result in 
reduced general bodily movement in a reaction 
time task. In order to account for the in- 
crease in tonic heart tate, one would need 
to show increasés in muscular tension that 


3 Black and deToledo (1972) have shown, for ex- 
ample, that heart rate change will go up or down 
under curare, depending on whether the discrimina- 
tive stimulus presented Was one that in the non- 
drugged state was conditioned to movement or to 
inactivity. Therefore, they concluded, heart rate 
change is related to central components of the 
movement control system 
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would more than compensate for declines in ¥ 
overt movement. There were significant elec- | 
tromyogram increases accompanying the rise 
in heart rate with incentive in adults, but 
there were none in 5-year-old children. Such 
a discrepancy might be resolved with more 
careful and extensive analysis of both «vert 
movement and muscular tension, In these two 
studies, electromyogram was measured from 3 
the nonresponding arm only, and general g. 
bodily movement was indexed by artifacts in 
the electroencephalogram and respiration 
polygraph recordings, 

It is apparent that a hypothesis that em- 
phasizes both movement and muscular tension 
as the strongest correlates of heart rate will 
have to say something more definite about P 
how much tension (measured in how many 9e 
different muscle groups) would equal how 
much movement. Otherwise, it would be as 
vulnerable to the kind of post hoc use pre- 
viously illustrated in explaining some of the 
author’s own data, as the Lacey or activa- 
tion hypotheses. The problem has not arisen 
with Obrist’s phasic data, in which tension 
and movement scores (and heart rate) all 
follow similar courses. But, where it does 
arise, in tonic data, empirical and parametric 
studies will have to be brought to bear on 
the issue, and investigators will have to take 
pains to use a wide variety of measures of 
movement (e.g., stabilimetric, ratings of films) 
and to record electromyogram from a 
sufficient sample of muscle groups. 

Furthermore, no one argues that heart rates 
change can be accounted for exclusively in 
terms of change in general activity and, by 
inference, the needs of the musculature for 
blood. In addition to the homeostatic phe- 
nomena already referred to, heart rate shows 
characteristic habituation effects and orient- 
ing and defensive responses which may or 
may not be movement and tension related. i 
Heart rate is under autonomic control, and 
it may be to some degree under the control 
of the sorts of arousing variables—chiefly 
collative and classically conditioned—that af- 
fect other responses under autonomic control, 
like skin conductance. 

There are, for example, reports of heart 
rate effects related to variations in uncertainty 


j 


(e.g, Bow 1971; Epstein & Roupenian, 
1970) and fear or anxiety in anticipation 
of shock . Deane, 1961; Elliott, 1966a), 

and until 1.ovement and muscular tension are 

© adequately sampled in such situations, it can- 
not be svi! whether heart rate changes occur 
in the ice of skeletal-motor changes. If 
they do, of course, they might still be ac- 
counted without reference to the cardio- 
vascula iback effects. 

Hea: e change is also known to antici- 
pate chinge in overt movement, as when it 
goes u r to actual speech in an interview 
situation (Murray, 1963), or up in dogs just 
before exercise (Rushmer, Smith, & 
Lasher, 1960), or up during the minute prior 
to a tas during which the subject has been 
“alerted” that the task will be coming (Lacey 
et al, 1963); again, if a thorough attempt to 


sample tension and movement (or some other 


correlate of cardiac output like oxygen con- 
sumption) yields no relation to heart rate, 
then sther variables will be required to ac- 
coun! for its variation. The same remarks ap- 
ply ‘o heart rate changes under curare and 
hea! rate changes as a result of direct operant 


itioning. In all these cases a conditioning 
history will undoubtedly be required, along 
with a measure of current metabolic demand, 
to render a reasonably complete account of 
heart rate change in the tasks that have been 
discussed. Such a history is, again, observable 
in a sense in which “intention” to take in or 
reject the environment, along with cardio- 
vascular feedback effects, are not. There are 
awkward measurement problems with the 
cardiac-somatic formulation, but they are diffi- 
culties of method, not conception, and they are 
presumably tractable. There seems nothing in 
the formulation as difficult as the problems of 
independent definition of such terms as intake 
and rejection. 

Partly because of their observability, one 
of the most important contributions of taking 
measures of movement, tension, or metabolic 
activity may be to reduce the number of in- 
terpretations of one’s results. For example, 
Kahneman, Tursky, Shapiro, and Crider 
(1969) found that heart rate rose when their 
subjects were listening to @ short series of 
numbers, to each of which they were to add a 


SIGNIFICANCE OF HEART RATE FOR BEHAVIOR 


previously announced constant integer and 
then report the series of sums. An interpreta- 
tion of this result appeared in a subsequent 
paper (Tursky, Schwartz, & Crider, 1970), in 
which the authors suggested that the subjects 
had not only listened to the numbers (intake, 
hence deceleration), but had begun to work on 
them (rejection, hence acceleration). In thai 
second paper, Tursky et al. showed that hear- 
ing the numbers prior to being told whether 
or not to add something to them resulted in 
heart rate deceleration in a purer instance of 
information intake. However, in the 4 seconds — 
between hearing the numbers and hearing the ~ 
constant to be added to them, heart rate rose. 
This period was considered by the authors to ; 
be one of cognitive processing (“rehearsal”), 
with the subjects presumably anticipating — 
various tasks. That interpretation may be — 
correct, and a report of interviews with the 
subjects might have substantiated it. As it is, 
two other periods of consistent and predomi- 
nantly accelerative heart rate responses went — 
unexplained: one occurred before the subjects — 
heard the one set of numbers, and one oc- 
curred after the task response was completed. — 
It is not easy to invoke “rehearsal” or other 
cognitive processing for those periods, but 
measures of concurrent somatic or metabolic — 
events might have shown concomitant varia- 
tions with heart rate. Had they done so, the — 
changes in movement and tension could pro- — 
vide a plausible account of heart rate change, | 
though they themselves would still require ex- 
planation. ee 


CONCLUDING REMARKS 


The chief characteristic of the afferent feed- 
back model when compared with the cardiac- 
somatic model or others signifying heart rate 
change as a dependent variable is the role of 
the heart, and of cardiovascular activity in 
general. In the afferent feedback model, the 
heart works in ways that facilitate the inter- 
action (intake or rejection) that the organism 
intends with the environment; it is thus given 
causal status in the chain of events leading to 
the outcome of the interaction: for example, 
if heart rate decelerates, reaction time will be 
faster. In the cardiac-somatic model, the 
heart rate response is just one of the many 
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responses that make up the outcome of the 
interaction, so that relations between heart 
rate deceleration and reaction time would be 
response-response, linked centrally as com- 
ponents of a molar biological act involving 
behavior and its energy requirements; or, in 
the operant conditioning model, heart rate 
change is sometimes conceptualized as a re- 
sponse functionally related to discriminative 
and reinforcing stimuli with no particular re- 
lation to other motor or visceral responses. No 
one is denying that most responses do have 
feedback effects (i.e, act as stimuli), but 
there is a real and considerable difference be- 
tween the two kinds of hypotheses concerning 
the status and importance of cardiovascular 
feedback. 

The afferent feedback position is automati- 
cally less parsimonious. Its complexity would 
be justified if there were reliable data for 
which it were especially appropriate, but there 
seem to be very few. All that is observed is 
that antecedent events (various settings, in- 
structions, task materials, apparatuses, and 
events) result in various concurrently mea- 
sured responses including receptor-orientating, 
other perceptual-motor, and physiological re- 
sponses. Studies concerning activation theory 
worked with just those classes of observation, 
and the present author (Elliott, 1964), in 
criticizing activation theory, came to the fol- 
lowing conclusion, which seems as pertinent 
here: 


perhaps the simplest way to conceptualize the re- 
sults of . . . any study in which covert physiological 
and overt motor variables are concurrently measured, 
is to assume that both the overt and covert behav- 
iors are affected chiefly not by each other, but by 
whatever antecedent manipulations have been made 


[p. 18]. 


In the 1967 paper in which his hypothesis 
was most elaborately presented, Lacey also 
criticised activation theory on a number of 
grounds, but his own hypothesis has perpetu- 
ated activation theory’s chief characteristic 
and chief weakness, which is to infer without 
warrant a causative internal state (instead of 
drive or arousal, it is now cognition and atten- 
tion) from a peripherally indexed response. 
Neither the provenance of the hypothesis, the 
definition and rigor of its terms, nor the em- 
pirical data that bear on it are so far so 
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compelling as to preclude a vigorous sk 
cism, 
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TEST ANXIETY AND THE MODEL WHO FAILS’ 


IRWIN G. SARASON 2 
University of Washington 


Two experiments were performed in which subjects, differing in test anxiety, 
performed on a verbal learning task following the opportunity to observe a 


differing in test anxiety differed in performance levels but not in their reactions 
to observing a self-derogatory model. In the second experiment, the model was 
not self-derogatory, but rather Was authoritatively failed by the experimenter, 
Under this condition, high and low test anxiety showed opposite reactions. 
High-test-anxiety scorers were detrimentally affected by Observing the failed 


In recent years the relationship of obserya- 
tional or modeling opportunities to subse- 
quent behavior has received increased atten- 
tion. Dependent measures in experiments 
that have been conducted include aspects of 
social and motor behavior and intellectual 
achievement (Bandura, 1969; Mischel, 1971). 
There is evidence that these types of measures 
are sensitive to experimental manipulation of 
the opportunity to observe a model and, be- 
yond that, to specific components of the mod- 
el’s behavior. This article describes two ex- 
periments whose results add to knowledge 
of the interacting roles in intellective per- 
formance played by (a) observational oppor- 
tunities given to subjects prior to performing 
on a task and (b) a personality character- 
istic, test anxiety, 4 

Available evidence indicates that verbal 
learning is influenced significantly by the test 
anxiety of the learner and the stressfulness of 
conditions associated with his performance 
(Sarason, 1960; Wine, 1971). The perform- 
ance of high-test-anxious subjects is more 
detrimentally affected by situational stress 
than is the performance of less anxious sub- 
jects. There is also reason to believe that 
high- and low-test-anxious persons differ in 
the degree to which observation of a model 
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first experiment, groups 


influences their behavior, Modeling has been 
used successfully as a facilitative factor in 
subjects’ performance, especially by those who 
are high in test anxiety (Sarason, 1971). 
Might the negative effects of stress and the” 
Positive effects of observing models be ac- 
counted for within one theoretical framework? 

Test anxiety has been conceptualized as a 
tendency to emit personalized task-irrelevant 
responses when the individual experiences 
heightened awareness that his performance is 
being evaluated. From this perspective, the 
highly test-anxious individual is especially 
attentive to cues concerning the evaluative 
component of his performance, and his fear of 
failure can be interpreted in terms of self- 
centered thoughts that interfere with concen- 
tration on the task at hand, Tf the highly . 
test-anxious individual js Particularly vigi- w 
lant, he might attend as much to the success 
or failure experienced by the model he is ob- 
serving as to the actual Tesponses emitted by 
the model in Performing the task. In the 
experiments reported here, the question w. 
asked: Do persons differing in test anxie 
respond differentially to the vicarious experi- 
ence of observing either a model who simply 
performs a task or a model who fails? 


EXPERIMENT I 
Method 


Test ANXIETY AND THE Moprt Wuo Fats 


large group of students prior to and independent of level of performance. Specifically, the model made 
the experiment. The high- and low-TAS groups were six self-derogatory statements: “g haven't gotten 
drawn from approximately the upper and lower even one right so far”; “I hope Im not ruining 3 
15% of the score distribution. The score ranges for your experiment”; “Maybe Id better give up”; “I 
the three TAS groups were: high TAS, 22-37; middle just don’t seem to be able to improve”; Sometimes Fi 
TAS, 10-21; low TAS, 0-9. just seem to go blank”; “I’m sorry Pm so dumb, I 
Models, There were four models, female students must have failed miserably.” k 
at the University of Washington, who were trained 
to respond to a verbal learning task in a prescribed Results and Discussion 


fashion. Assignment of subjects to models was ran- $ 
dent The effects of the experiment were evalu- — 


Procedure. The materials used in the learning ex- ated by means of an analysis of variance. In 
periment were 17 dissyllable words of low meaning- addition to the organismic variable of test 
fulness Heke (Noble, 1952). These stimuli were anxiety and the experimental treatment, trial — 
presented on a Lafayette memory drum, using a A 
2-second exposure and a 6-second intertrial interval. blocks (five blocks of five trials each) were ; 
The anticipation method was employed over 25 examined as a repeated-measurements factor. 
learning trials. In addition to the expected learning effect, 

The 36 subjects in each of the high-, middle-, and there were two significant results, For List S, 
low-TAS groups were assigned to one of three con- a significant F for TAS (p< .01) resulted — 
ditions. Under the control (no-modeling) condition, from the fact that the number of correct re- 


the subject was brought into the laboratory and i 7 
given the necessary directions and instructions. All sponses over all trials was negatively related 


subjects worked for 25 trials on the subjects’ list, to TAS scores. The mean numbers of correct 
List S, and then performed on List M for 15 trials. responses were as follows: high TAS, 181.94; 
List M was the list to which the model responded middle TAS, 202.08: and low TAS, 222.75 
in the other two experimental groups. Lists S and The AS effect o significant ( 2 


M were comparable in terms of their meaningfulness 
ZIRE < 01). The control (no-model) mean was 


Under the two modeling conditions, the subject 176.22, while the subjects who observed the 
was told: neutral model who did not engage in self- 
You are going to perform a memory drum task in derogation had a mean of 209.72, and the self- 
a few minutes. It would be helpful for you to derogatory model group had a mean of 
watch someone else performing the same task be- 220.83. The results for the subjects’ perform- 
fore you do it yourself. You will also be able to ance on List M, the list on which the model 


hear what she says. After she finishes, I will ex- had performed essentially paralleled the. 
cuse her and take you into that room and give Lia Stenli 7 


you a different list of words to learn. z S123 a 
& i Noteworthy by its absence was a significant 

Piemoga a at noe inig m a TAS x Treatments interaction. It had been 
RN ee ea Meact expected that the opportunity to observe àa 


with List M for 15 trials. The model “learned” ac- Se ee 
cording to a predetermined response pattern that model would have a facilitative effect on all 


resembled the typical serial position curve, and she of the subjects, especially on those who are 
was then dismissed. While the model was being _ high in_test anxiety, It had also been ex- 
run," the subjects din the observations ected that the high-TAS group would show 
room, observing through a one-way murror. The tie 

observation room contained a memory drum which this facilitative effect to less of an extent 
yan synchronously with the memory drum in the under the condition involving the self-deroga- 
experimental room (the same experimental room v tory model. While an overall facilitative mod- 
used for all conditions; for the control aoe E eling effect was evident, exposure to a self. 
one-way mirror was covered). At the end of the : 1 sally affect 
experimental session, the subject was given a series derogatory model. did not differentially affec' 
of rating scales dealing with how she reacted to the the three test-anxiety groups. 


experimental situation. Along with the absence of a TAS X Treat- 


Jing conditions differed only in cer- i AT ‘ 
es Be by the model, Under the neutral ments effect, the presence of a significant main 
condition, the model simply carried out the task effect for TAS had not been predicted. Previ- 
assigned "and made no self-derogatory comments. ous research has suggested that under neutral 


Under the self-derogatory modeling condition, the conditions, subjects high in test anxiety typi- 


ignated points throughout the 15 trials, 
am Scant tndleating dissatisfaction with her cally perform at levels comparable to those 
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of lower TAS scorers (Sarason, 1960). Post 
hoc review of the self-derogatory model con- 
dition led to the observation that while the 
model had derogated herself, her performance 


Shad not been proclaimed by an authority fig- 


f 


“ure (in this case, the experimenter) to be a 


failure, Since there might be discernible dif- 
ferences attributable to observing someone 
who is self-debasing and thinks he is failing 
and someone who is told authoritatively that 
he has failed, it was decided to replicate the 
experiment with one whose conditions would 
include observation of a model who receives 
explicit notification of poor performance. 
Another possibility concerned the effect of 
the experimenter’s manner and characteristics 
on the subjects’ performance level. The gen- 
erally low level of performance of the high- 
TAS group—even under the control condition 
—might have been due to the subject’s per- 
ception of the experimenter. Although a basis 
for drawing firm conclusions was lacking, it 
was decided to use a different experimenter in 
the second experiment. Because it was possible 
to elicit both postexperimental ratings in the 


two experiments, it was felt that some infer- ` 


ences might be drawn about the role of the 
experimenter. Later, some differences are 
noted between the results of the two experi- 
ments that are relevant to this point. 


EXPERIMENT IT 
Method 


Subjects. The subjects were 108 females enrolled 
in introductory psychology classes at the University 
of Washington. They were selected in the same 
manner as were those used in Experiment I, The 
TAS cutoff points for high-, 
anxious groups were also the same. 

Procedure. The experimenter was female, and the 
general procedure was the same as that used in Ex- 
periment I, One-third of the subjects received a con- 
dition in which they observed a model, and one- 
third were not given an observational opportunity. 
The remaining subjects were administered a condi- 
tion in which a model did not make self-derogatory 
comments but was informed that her performance 
level on the serial learning task was unusually low. 
At the end of the model’s performance, the experi- 
menter added up the “score” and told her that she 
had made many fewer correct responses than would 
be expected of college students. The model was, in 
effect, told that she had failed, and the experimenter 
expressed concern that if her performance on the 


, 


Irwin G. 


middle-, and low- 
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learning task was a representative sample of : 
abilities, her academic success was questionable. 


Results and Discussion 


An analysis of variance was performed o 
the number of correct responses on List S 
Although the direction of the results for the 
TAS main effect resembled that obtain«! in 
the first experiment, they failed to rea 
statistically significant level. The effect i: 
treatments was significant (p < .01), with the 
two modeling conditions leading to better 
performance than did the control (no-model) 
condition. 

In addition to the expected differences 
among trial blocks, the TAS X Treatments 
interaction was the only other significant _ 
effect (p < .01). Table 1 presents the means 
involved in this interaction. The three test- 
anxiety groups did not differ under the aire ol 
condition. For the condition under which the 
model did not fail, high-TAS performance. 
was superior to the low- and middle-sco 
groups (p< .05). For the condition mee 
which the model was failed, the high TAS ~ 
group performed at a significantly lower level 
(X = 170.13) than did the high-TAS group 
that observed a nonfailing model (X = 221.11, 
p< .01). The middle-TAS groups under the 
two modeling conditions did not differ signifi- 
cantly in their performance levels, with means 
of 210.14 and 201.54 for the failed and non- 
failed modeling conditions, respectively. The 
low-TAS group that observed the failed model 
performed at a higher level (X = 232.11) 
than did the one that observed a nonfailing 


TABLE 


MEAN NuMBER OF CORRECT RESPONSES IN EXPERI- 
MENT IT ON THE SERIAL LEARNING Task (List S) 


Test anxiety No. correct responses 

High 

Model 221.11 

Model fails 170.13 

Control 166.31 
Middle 

Model 201.54 

Model fails 210.14 

Control 174.55 
Low 

Model 193.44 

Model fails 232.11 

Control 176.32 


5. model (¥ = 193.44, p < 01), The results for 
List M, the one on which the model had per- 
formed, were similar to those for List S. 
Following both experiments, the subject 
was asked to fill out a series of rating scales 
indicating her reactions to the experiment. In 
addition, a number of open-ended questions 
were asked. The subjects in the second study 
described the experiment as being more in- 
y-~ teresting than did those in the first one (p 
< .05). High-TAS scorers reported being less 
able to concentrate (p < .05) and less cer- 
tain of their ability to perform the task at 
hand (p < .05) in the first than in the second 
experiment, Comparable differences were not 
found for middle- and low-TAS groups. These 
„az: ndings, together with comments in responses 
~~ to the open-ended questions, suggest that the 
high-TAS subjects in the first experiment per- 
ceived their experimenter as being more anxi- 
ety provoking (or, perhaps, less anxiety allay- 
ing) than did the high-TAS subjects in the 
second experiment. 

The results of the two experiments suggest 
that observing someone derogating himself 
~ and observing someone who is declared by an 
authority figure to be a failure can have quite 
different effects on performance. In both ex- 
periments, the best performance was attained 
by low-TAS scorers who observed either a 
self-derogating model (Experiment I) or a 
model who is failed (Experiment II). These 
conditions appear to have functioned as moti- 
, vators for low-TAS subjects, High-TAS sub- 
$ ~< jects responded quite differently to these two 
j conditions, with observation’ of a failure re- 
- port given by the experimenter to a model 
having a more negative effect than observa- 
tion of a self-derogating model. Observing 
someone who is dissatisfied with himself does 
not necessarily mean that there is an objec- 
tive basis for that reaction, Observing a 
teacher, experimenter, or other figure nega- 
` tively evaluate one student’s performance 
makes it clear that he is capable of doing the 
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same with the performance of the next stu- 
dent. 

The findings support the conclusion that 
persons differing in test anxiety differ in the 
stimuli to which they are especially attentive 
and how they react to the stimuli. Evaluative 
cues seem to motivate persons low in test 
anxiety. For those who are highly anxious, 
evaluative cues may be quite threatening dan- 
ger flags. The subjects’ impressions of and 
subjective reactions to the experiment sug- 
gest that the experimenter is a stimulus sus- 
ceptible to varying analyses depending on the 
perspective of observers, the subjects. 

The results of these experiments show that 
when poor performance occurs, it is not neces- 
sarily attributable to inadequate intellective 
wherewithal. The results are consistent with 
a learning or habit interpretation of test anxi- 
ety. That high-test-anxious subjects reacted 
so Wifferently to a model’s self-derogation and 
his declared failure by the experimenter sug- 
gests that it is as necessary in analyzing anxi- 
ety to focus on the eliciting cues as it is to 
attend to the anxiety responses themselves, 
Important problems for future research con- 
cern the types of cues to which persons dif- 
fering in test anxiety are responsive and the 
nature of their reactions to these cues. 
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BLACK POWER IDEOLOGY AND COLLEGE STUDENTS’ R 
ATTITUDES TOWARD THEIR OWN AND 
OTHER RACIAL GROUPS: 


A CORRECTION 


ELISE E. LESSING + ann SUSAN W. ZAGORIN 
Institute for Juvenile Research 


The data for this study which were initially 
reported in the January 1972 issue of the 
Journal of Personality and Social Psychology 
were re-run as a test problem to check the 
conversion of a computer program for use on 
a larger machine. It was then discovered that 
in the original data analysis, the first of the 
three IBM cards for the first subject in the 
white sample had at some point become dis- 
placed later in the deck so that the computer 
read incorrect data for many of the white 
subjects. The results of this particular data 

@ analysis were presented in Tables 3 and 4 of 
the original paper. The corrected tables are 
as follows. 

The Results section of the original paper 
requires the following changes. Hypothesis c, 
which predicted that subjects high in black 
power orientation would perceive a signifi- 
cantly smaller distance between plack person 
and ideal person and between black person and 
friend than would subjects low in black power 
orientation, received full rather than partial 

support among white subjects. The anomaly 
of white subjects high in black power orienta- 
tion perceiving black person and ideal person 
as more distant than did white subjects low in 
plack power orientation was an error. 

Hypothesis e, which predicted that students 

high in black power orientation would per- 
- ceive a significantly smaller distance between 

white person and enemy than subjects low in 
black power orientation, was upheld. Initially 
it was stated that the high-black-power- 
oriented white subjects tended to view all 
types of persons as more distant from enemy 
than they were viewed by low black power 
subjects. In the corrected analysis, this trend 


1 Requests for reprints should be sent to Elise E. 
Lessing, Institute for Juvenile Research, 1140 S. 
Paulina Street, Chicago, Illinois 60612. 


414 


was no longer statistically significant. Hy- 
pothesis ¿, which predicted that students low 
in black power ideology would produce a lower 
evaluation factor score for black person than 
for Negro person or colored person was upheld 
in the white sample. 

The converse of Hypothesis g, namely ~, 4 
Hypothesis j, which predicted that low black” 7 
power subjects would perceive a significantly 
larger distance between black person and 
friend and ideal person than between other 
nonwhite concepts and friend and ideal person 
was upheld in the white sample. The initial 
finding of a significant trend among whites in 
a direction opposite to that ee E 
error. The fact that black person 
closer than Negro or colored perso’ n 
(in potency) and to friend (in activity) 
not, contrary to the original statement, out- 
weigh the fact that black person was seen = 
as more distant than Negro or colored 
person from friend or ideal in regard to the 
evaluation factor. 

Hypothesis k, which predicted that students 
low in black power orientation would perceive 
a significantly smaller distance between black 
person and enemy than between the other two 
nonwhite concepts and enemy was not upheld 
in the white sample, White subjects who were 
low in black power orientation produced a 
slightly, but not significantly, smaller black 
person versus enemy than colored or Negro 
person versus enemy D (2.23 verst 2.43 
and 2.44). 

With respect to the Discussion section of 
the original paper, it should he,noted that the 
general effect of the corrections in the data- 
was to increase the number of hypotheses up- 
held in the white sample. The second para- 
graph of the Discussion section (p. 69) should 
now summarize the findings as follows: In 
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TABLE 3 : 


REsULTS or Sıx Two-Way ANALYSES OF VARIANCE WITH LEVEL or BLACK POWER ORIENTATION 
(BPIS Scores) AnD Tyre or Eranic Concert (CoLORED, BLACK, WHITE, OR NEGRO) As 
INDEPENDENT VARIABLES AND SEMANTIC DIFFERENTIAL DISTANCE (D) Scores Com- 
PUTED VERSUS IDEAL PERSON, FRIEND, AND ENEMY AS DEPENDENT VARIABLES 


F values for white sample F values for Negro sample 


Reference concepts 


for D scores 
BPIS Ethnic concept ne concent BPIS Ethnic concept ATA 
Ideal person 9,81** 16.57** 16.29** 3,12 46.04** 13.51** 
Friend 7,947* PTO 10,98** 8.47** 29.08** Ae 
Enemy AT 11.45** 12.05** 30 34.87** 9.23** 


Note.—For the white sample, df = 1/234 for the F for BPIS and 3/702 for the F for ethnic concept and interaction; for the 
Negro sample, df = 1/114 for the F for BPIS and 3/342 for the Ffor ethnic concept and interaction, 
*p <.05, 


*p <.01. 
TABLE 4 
D Score Means FOR Hick anp Low Brack Power (BPIS) SUBGROUPS 
OF THE WHITE AND NEGRO SAMPLES 
Colored person Black person White person Negro person 
Sample 

Low High Low High Low High Low High 
BPIS BPIS BPIS BPIS BPIS BPIS BPIS BPIS 

White 
Ideal person 2:51 2.09 2.818 2.118 2.01 2.10 2.49 2.01 
Friend 2.14 1.85 2.48" 1.878 1.75 1.78 2:16 1.74 
Enemy 2.43 2.60 2,238 2.568 2.86% 2.578 2.44 2.64 

Negro 
Tdeal person 1.78 2.38» D128 1.568 2.71 3.84 1.92 2.06 
Friend 1.70 2.30 1.99 1.90 2.42 3.44 1.76 2.08 
Enemy 3.26 3.36? 3.27 3,95* 2.66% 2.02% 3.16 3.58 


relevant, to hypotheses are not marked. 


level. 


< “ both the white and the Negro samples, those 
college students who were high in black power 
„orientation placed a lower evaluation upon 
white person and saw white person as closer 
to enemy than did students who were low in 
black power orientation. In addition, white 
and black students who were high in black 
power orientation evaluated black person more 
favotably, saw black person as further from 
the concept of enemy, saw black person as 
closer to ideal person, and (in the case of 
white subjects only) saw black person as 
closer to friend than did students low in black 
power orientation. 

The major difference in interpretation as 
a result of the corrections is that there is 
now little support for the statement: 


Note.—D score represents distance between concept in lefthand column and concept in top row, Significant differences not 


a Difference between D score means of low and high BPIS subgroups differed significantly; p for Scheffe’s? <.05. 
b Difference between this D score mean and D score mean for black person differed significantly for subjects within this BPIS © 


However, in the case of the white students, the data 
suggest that the high and low endorsers of black 
power ideology are quite different types of people 
with level of black power orientation following from, 
rather causing, the extensive personal differences [p. 
70 in original paper]. 


Though the evaluation factor score mean for 
ideal person earned by low-black-power- 
oriented white subjects did not exceed that 
earged by both high- and low-black-power- 
oriented black subjects, the fact that it ex- 
ceeded the mean of low-black-power-oriented 
white subjects in the context of generally 
more misanthropic attitudes toward all persons — 
was used as a basis for suggesting that 
authoritarianism was the underlying personal- 


€ 
ity trait differentiating the white subjects low 
in black power orientation from those who 
were high. The data are not sufficient to sup- 
port this line of reasoning. The statement 
regarding the general misanthropy of the low- 
black-power-oriented white subjects is false 
on the basis of the corrected data. Instead, 
æ there was evidence that the low-black-power- 
oriented white subjects were Specifically preju- 
Wiced against nonwhites. Thus, whites low in 
black power ideology saw colored person, 
Negro person, and black person (but not 
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white person, as had been originally stated )/ 
as significantly further from the concept oj 
ideal person and friend than did whites high } 
in black power ideology. Finally, while as was 
originally stated (p. 70), there was agreement 
among white and black subjects th: ja -% 
person was most potent and active v.: eae 
person next, it was colored rather tian w if 
person who came next, with white ne 
being seen as least Potent and active (. 

by low-black-power-oriented whites w 
white person ahead of colored in ac 
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